Acoustical and Environmental Robustness in Automatic Speech Recognition

Acoustical and Environmental Robustness in Automatic Speech Recognition

Author: A. Acero

Publisher: Springer Science & Business Media

Published: 2012-12-06

Total Pages: 197

ISBN-13: 1461531225

DOWNLOAD EBOOK

The need for automatic speech recognition systems to be robust with respect to changes in their acoustical environment has become more widely appreciated in recent years, as more systems are finding their way into practical applications. Although the issue of environmental robustness has received only a small fraction of the attention devoted to speaker independence, even speech recognition systems that are designed to be speaker independent frequently perform very poorly when they are tested using a different type of microphone or acoustical environment from the one with which they were trained. The use of microphones other than a "close talking" headset also tends to severely degrade speech recognition -performance. Even in relatively quiet office environments, speech is degraded by additive noise from fans, slamming doors, and other conversations, as well as by the effects of unknown linear filtering arising reverberation from surface reflections in a room, or spectral shaping by microphones or the vocal tracts of individual speakers. Speech-recognition systems designed for long-distance telephone lines, or applications deployed in more adverse acoustical environments such as motor vehicles, factory floors, oroutdoors demand far greaterdegrees ofenvironmental robustness. There are several different ways of building acoustical robustness into speech recognition systems. Arrays of microphones can be used to develop a directionally-sensitive system that resists intelference from competing talkers and other noise sources that are spatially separated from the source of the desired speech signal.


Acoustical and Environmental Robustness in Automatic Speech Recognition

Acoustical and Environmental Robustness in Automatic Speech Recognition

Author: Alex Acero

Publisher: Springer

Published: 2013-07-13

Total Pages: 186

ISBN-13: 9781461363668

DOWNLOAD EBOOK

The need for automatic speech recognition systems to be robust with respect to changes in their acoustical environment has become more widely appreciated in recent years, as more systems are finding their way into practical applications. Although the issue of environmental robustness has received only a small fraction of the attention devoted to speaker independence, even speech recognition systems that are designed to be speaker independent frequently perform very poorly when they are tested using a different type of microphone or acoustical environment from the one with which they were trained. The use of microphones other than a "close talking" headset also tends to severely degrade speech recognition -performance. Even in relatively quiet office environments, speech is degraded by additive noise from fans, slamming doors, and other conversations, as well as by the effects of unknown linear filtering arising reverberation from surface reflections in a room, or spectral shaping by microphones or the vocal tracts of individual speakers. Speech-recognition systems designed for long-distance telephone lines, or applications deployed in more adverse acoustical environments such as motor vehicles, factory floors, oroutdoors demand far greaterdegrees ofenvironmental robustness. There are several different ways of building acoustical robustness into speech recognition systems. Arrays of microphones can be used to develop a directionally-sensitive system that resists intelference from competing talkers and other noise sources that are spatially separated from the source of the desired speech signal.


Robust Automatic Speech Recognition

Robust Automatic Speech Recognition

Author: Jinyu Li

Publisher: Academic Press

Published: 2015-10-30

Total Pages: 308

ISBN-13: 0128026162

DOWNLOAD EBOOK

Robust Automatic Speech Recognition: A Bridge to Practical Applications establishes a solid foundation for automatic speech recognition that is robust against acoustic environmental distortion. It provides a thorough overview of classical and modern noise-and reverberation robust techniques that have been developed over the past thirty years, with an emphasis on practical methods that have been proven to be successful and which are likely to be further developed for future applications.The strengths and weaknesses of robustness-enhancing speech recognition techniques are carefully analyzed. The book covers noise-robust techniques designed for acoustic models which are based on both Gaussian mixture models and deep neural networks. In addition, a guide to selecting the best methods for practical applications is provided.The reader will: - Gain a unified, deep and systematic understanding of the state-of-the-art technologies for robust speech recognition - Learn the links and relationship between alternative technologies for robust speech recognition - Be able to use the technology analysis and categorization detailed in the book to guide future technology development - Be able to develop new noise-robust methods in the current era of deep learning for acoustic modeling in speech recognition - The first book that provides a comprehensive review on noise and reverberation robust speech recognition methods in the era of deep neural networks - Connects robust speech recognition techniques to machine learning paradigms with rigorous mathematical treatment - Provides elegant and structural ways to categorize and analyze noise-robust speech recognition techniques - Written by leading researchers who have been actively working on the subject matter in both industrial and academic organizations for many years


Techniques for Noise Robustness in Automatic Speech Recognition

Techniques for Noise Robustness in Automatic Speech Recognition

Author: Tuomas Virtanen

Publisher: John Wiley & Sons

Published: 2012-11-28

Total Pages: 514

ISBN-13: 1119970881

DOWNLOAD EBOOK

Automatic speech recognition (ASR) systems are finding increasing use in everyday life. Many of the commonplace environments where the systems are used are noisy, for example users calling up a voice search system from a busy cafeteria or a street. This can result in degraded speech recordings and adversely affect the performance of speech recognition systems. As the use of ASR systems increases, knowledge of the state-of-the-art in techniques to deal with such problems becomes critical to system and application engineers and researchers who work with or on ASR technologies. This book presents a comprehensive survey of the state-of-the-art in techniques used to improve the robustness of speech recognition systems to these degrading external influences. Key features: Reviews all the main noise robust ASR approaches, including signal separation, voice activity detection, robust feature extraction, model compensation and adaptation, missing data techniques and recognition of reverberant speech. Acts as a timely exposition of the topic in light of more widespread use in the future of ASR technology in challenging environments. Addresses robustness issues and signal degradation which are both key requirements for practitioners of ASR. Includes contributions from top ASR researchers from leading research units in the field


Robustness in Automatic Speech Recognition

Robustness in Automatic Speech Recognition

Author: Jean-Claude Junqua

Publisher: Springer Science & Business Media

Published: 2012-12-06

Total Pages: 457

ISBN-13: 1461312973

DOWNLOAD EBOOK

Foreword Looking back the past 30 years. we have seen steady progress made in the area of speech science and technology. I still remember the excitement in the late seventies when Texas Instruments came up with a toy named "Speak-and-Spell" which was based on a VLSI chip containing the state-of-the-art linear prediction synthesizer. This caused a speech technology fever among the electronics industry. Particularly. applications of automatic speech recognition were rigorously attempt ed by many companies. some of which were start-ups founded just for this purpose. Unfortunately. it did not take long before they realized that automatic speech rec ognition technology was not mature enough to satisfy the need of customers. The fever gradually faded away. In the meantime. constant efforts have been made by many researchers and engi neers to improve the automatic speech recognition technology. Hardware capabilities have advanced impressively since that time. In the past few years. we have been witnessing and experiencing the advent of the "Information Revolution." What might be called the second surge of interest to com mercialize speech technology as a natural interface for man-machine communication began in much better shape than the first one. With computers much more powerful and faster. many applications look realistic this time. However. there are still tremendous practical issues to be overcome in order for speech to be truly the most natural interface between humans and machines.


Robust Speech Recognition in Embedded Systems and PC Applications

Robust Speech Recognition in Embedded Systems and PC Applications

Author: Jean-Claude Junqua

Publisher: Springer Science & Business Media

Published: 2006-04-18

Total Pages: 193

ISBN-13: 0306470276

DOWNLOAD EBOOK

Robust Speech Recognition in Embedded Systems and PC Applications provides a link between the technology and the application worlds. As speech recognition technology is now good enough for a number of applications and the core technology is well established around hidden Markov models many of the differences between systems found in the field are related to implementation variants. We distinguish between embedded systems and PC-based applications. Embedded applications are usually cost sensitive and require very simple and optimized methods to be viable. Robust Speech Recognition in Embedded Systems and PC Applications reviews the problems of robust speech recognition, summarizes the current state of the art of robust speech recognition while providing some perspectives, and goes over the complementary technologies that are necessary to build an application, such as dialog and user interface technologies. Robust Speech Recognition in Embedded Systems and PC Applications is divided into five chapters. The first one reviews the main difficulties encountered in automatic speech recognition when the type of communication is unknown. The second chapter focuses on environment-independent/adaptive speech recognition approaches and on the mainstream methods applicable to noise robust speech recognition. The third chapter discusses several critical technologies that contribute to making an application usable. It also provides some design recommendations on how to design prompts, generate user feedback and develop speech user interfaces. The fourth chapter reviews several techniques that are particularly useful for embedded systems or to decrease computational complexity. It also presents some case studies for embedded applications and PC-based systems. Finally, the fifth chapter provides a future outlook for robust speech recognition, emphasizing the areas that the author sees as the most promising for the future. Robust Speech Recognition in Embedded Systems and PC Applications serves as a valuable reference and although not intended as a formal University textbook, contains some material that can be used for a course at the graduate or undergraduate level. It is a good complement for the book entitled Robustness in Automatic Speech Recognition: Fundamentals and Applications co-authored by the same author.


The Application of Hidden Markov Models in Speech Recognition

The Application of Hidden Markov Models in Speech Recognition

Author: Mark Gales

Publisher: Now Publishers Inc

Published: 2008

Total Pages: 125

ISBN-13: 1601981201

DOWNLOAD EBOOK

The Application of Hidden Markov Models in Speech Recognition presents the core architecture of a HMM-based LVCSR system and proceeds to describe the various refinements which are needed to achieve state-of-the-art performance.


Proceedings of the International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA)

Proceedings of the International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA)

Author: Suresh Chandra Satapathy

Publisher: Springer Science & Business Media

Published: 2012-12-14

Total Pages: 749

ISBN-13: 3642353142

DOWNLOAD EBOOK

The volume contains the papers presented at FICTA 2012: International Conference on Frontiers in Intelligent Computing: Theory and Applications held on December 22-23, 2012 in Bhubaneswar engineering College, Bhubaneswar, Odissa, India. It contains 86 papers contributed by authors from the globe. These research papers mainly focused on application of intelligent techniques which includes evolutionary computation techniques like genetic algorithm, particle swarm optimization techniques, teaching-learning based optimization etc for various engineering applications such as data mining, image processing, cloud computing, networking etc.


Springer Handbook of Speech Processing

Springer Handbook of Speech Processing

Author: Jacob Benesty

Publisher: Springer

Published: 2007-11-22

Total Pages: 1170

ISBN-13: 3540491279

DOWNLOAD EBOOK

This handbook plays a fundamental role in sustainable progress in speech research and development. With an accessible format and with accompanying DVD-Rom, it targets three categories of readers: graduate students, professors and active researchers in academia, and engineers in industry who need to understand or implement some specific algorithms for their speech-related products. It is a superb source of application-oriented, authoritative and comprehensive information about these technologies, this work combines the established knowledge derived from research in such fast evolving disciplines as Signal Processing and Communications, Acoustics, Computer Science and Linguistics.