Examines various speech technologies deployed in healthcare service robots to maximize the robot's ability to interpret user input. Demonstrates how robot anthropomorphic features and etiquette in behavior promotes user-positive emotions, acceptance of robots, and compliance with robot requests. Analyzes how multimodal medical-service robots and other cyber-physical systems can reduce mistakes and mishaps in the operating room. Evaluates various input methods for improving acceptance of robots in the older adult population. Presents case studies of cognitively and socially engaging robots in the long-term care setting for helping older adults with activities of daily living and in the pediatric setting for helping children with autism spectrum conditions and metabolic disorders. Speech and Automata in Health Care forges new ground by closely analyzing how three separate disciplines - speech technology, robotics, and medical/surgical/assistive care - intersect with one another, resulting in an innovative way of diagnosing and treating both juvenile and adult illnesses and conditions. This includes the use of speech-enabled robotics to help the elderly population cope with common problems associated with aging caused by the diminution in their sensory, auditory and motor capabilities. By examining the emerging nexus of speech, automata, and health care, the authors demonstrate the exciting potential of automata, both speech-driven and multimodal, to affect the healthcare delivery system so that it better meets the needs of the populations it serves. This book provides both empirical research findings and incisive literature reviews that demonstrate some of the more novel uses of speech-enabled and multimodal automata in the operating room, hospital ward, long-term care facility, and in the home. Studies backed by major universities, research institutes, and by EU-funded collaborative projects are debuted in this volume. This volume provides a wealth of timely material for industrial engineers, speech scientists, computational linguists, and for signal processing and intelligent systems design experts. Topics include: Spoken Interaction with Healthcare Robots Service Robot Feature Effects on Patient Acceptance/Emotional Response Designing Embodied and Virtual Agents for the Operating Room The Emerging Role of Robotics for Personal Health Management in the Older-Adult Population Why Input Methods for Robots that Serve the Older Adult Are Critical for Usability Socially and Cognitively Engaging Robots in the Long-Term Care Setting Voice-Enabled Assistive Robots for Managing Autism Spectrum Conditions ASR and TTS for Voice-Controlled Robot Interactions in Treating Children with Metabolic Disorders
Editors Amy Neustein and Judith A. Markowitz have recruited a talented group of contributors to introduce the next generation of natural language technologies to resolve some of the most vexing natural-language problems that compromise the performance of speech systems today. This fourteen-chapter anthology consists of contributions from industry scientists and from academicians working at major universities in North America and Europe. They include researchers who have played a central role in DARPA-funded programs and developers who craft real-world solutions for corporations. This anthology is aimed at speech engineers, system developers, computer scientists, AI researchers, and others interested in utilizing natural-language technology in both spoken and text-based applications.
Text-to-speech (TTS) aims to synthesize intelligible and natural speech based on the given text. It is a hot topic in language, speech, and machine learning research and has broad applications in industry. This book introduces neural network-based TTS in the era of deep learning, aiming to provide a good understanding of neural TTS, current research and applications, and the future research trend. This book first introduces the history of TTS technologies and overviews neural TTS, and provides preliminary knowledge on language and speech processing, neural networks and deep learning, and deep generative models. It then introduces neural TTS from the perspective of key components (text analyses, acoustic models, vocoders, and end-to-end models) and advanced topics (expressive and controllable, robust, model-efficient, and data-efficient TTS). It also points some future research directions and collects some resources related to TTS. This book is the first to introduce neural TTS in a comprehensive and easy-to-understand way and can serve both academic researchers and industry practitioners working on TTS.
The two-volume set CCIS 1516 and 1517 constitutes thoroughly refereed short papers presented at the 28th International Conference on Neural Information Processing, ICONIP 2021, held in Sanur, Bali, Indonesia, in December 2021.* The volume also presents papers from the workshop on Artificial Intelligence and Cyber Security, held during the ICONIP 2021. The 176 short and workshop papers presented in this volume were carefully reviewed and selected for publication out of 1093 submissions. The papers are organized in topical sections as follows: theory and algorithms; AI and cybersecurity; cognitive neurosciences; human centred computing; advances in deep and shallow machine learning algorithms for biomedical data and imaging; reliable, robust, and secure machine learning algorithms; theory and applications of natural computing paradigms; applications. * The conference was held virtually due to the COVID-19 pandemic.
This book constitutes the refereed proceedings of the 16th International Conference on Speech and Computer, SPECOM 2014, held in Novi Sad, Serbia. The 56 revised full papers presented together with 3 invited talks were carefully reviewed and selected from 100 initial submissions. It is a conference with long tradition that attracts researchers in the area of computer speech processing (recognition, synthesis, understanding etc.) and related domains (including signal processing, language and text processing, multi-modal speech processing or human-computer interaction for instance).
This book contains the revised selected, refereed papers from the IFIP World Conference on Computers in Education on Towards a Collaborative Society through Creative Learning, WCCE 2022, Hiroshima, Japan, August 20-24, 2022. A total of 61 papers (54 full papers and 7 short papers) were carefully reviewed and selected from 131 submissions. They were organized in topical sections as follows: Digital Education and Computing in Schools, Digital Education and Computing in Higher Education, National Policies and Plans for Digital Competence.