[PDF] Full Robust Adaptation To Non Native Accents In Automatic Speech Recognition Download eBook

Robust Adaptation to Non-Native Accents in Automatic Speech Recognition

Author: Silke Goronzy

Publisher: Springer

Published: 2003-07-01

Total Pages: 135

ISBN-13: 3540362908

Speech recognition technology is being increasingly employed in human-machine interfaces. A remaining problem however is the robustness of this technology to non-native accents, which still cause considerable difficulties for current systems. In this book, methods to overcome this problem are described. A speaker adaptation algorithm that is capable of adapting to the current speaker with just a few words of speaker-specific data based on the MLLR principle is developed and combined with confidence measures that focus on phone durations as well as on acoustic features. Furthermore, a specific pronunciation modelling technique that allows the automatic derivation of non-native pronunciations without using non-native data is described and combined with the previous techniques to produce a robust adaptation to non-native accents in an automatic speech recognition system.

Model Selection Based Speaker Adaptation and Its Application to Nonnative Speech Recognition

Author: Xiaodong He

Publisher:

Published: 2003

Total Pages: 222

ISBN-13:

DOWNLOAD EBOOK

Rapid globalization requires speech recognition systems to handle not only speech spoken by native speakers, but also speech spoken by foreign speakers. Currently, most American English speech recognition systems are built from speech data of American native English speakers. Although these systems work very well for native speakers, their performances degrade dramatically on recognition of foreign accented speech. Moreover, due to wide varieties of foreign accents, different speaking proficiency levels of English and limited data, in general it is difficult to train a specific acoustic model for each foreign accent. Therefore a practically feasible way to improve the performance of nonnative speech recognition is fast model adaptation. In this dissertation, the problem of adapting acoustic models of native English speech to nonnative speakers is addressed from the perspective of adaptive model selection. The goal is to dynamically select the optimal model for each nonnative talker so as to balance model robustness to pronunciation variations and model details for discrimination of speech sounds. A maximum expected likelihood (MEL) based technique is proposed for reliable model selection when adaptation data is sparse, where expectation of log-likelihood (EL) of adaptation data is computed based on distributions of mismatch biases between model and data, and model is selected to maximize EL. Moreover, in order to obtain reliable results when the available data is very limited, an improved prior knowledge guided MEL (P-MEL) approach is also proposed by using maximum a posteriori (MAP) estimation of bias distributions. These model selection methods are further combined with Maximum likelihood linear regression (MLLR) to enable adaptation of both structure and parameters of acoustic models. Experiments were performed on data of speakers with a wide range of foreign accents. Results show that the MEL based model selection can dynamically select proper model according to the available adaptation data, and the P-MEL approach can achieve a good performance even when the data amount is very small. Compared with the standard MLLR, the MEL+MLLR and the P-MEL + MLLR methods led to consistent and significant improvement to recognition accuracy on nonnative speakers, without performance degradation on native speakers.

Speaker Classification I

Author: Christian Müller

Publisher: Springer

Published: 2007-08-28

Total Pages: 363

ISBN-13: 354074200X

DOWNLOAD EBOOK

This volume and its companion volume LNAI 4441 constitute a state-of-the-art survey in the field of speaker classification. Together they address such intriguing issues as how speaker characteristics are manifested in voice and speaking behavior. The nineteen contributions in this volume are organized into topical sections covering fundamentals, characteristics, applications, methods, and evaluation.

Speech Recognition

Author: France Mihelič

Publisher: BoD – Books on Demand

Published: 2008-11-01

Total Pages: 580

ISBN-13: 953761929X

DOWNLOAD EBOOK

Chapters in the first part of the book cover all the essential speech processing techniques for building robust, automatic speech recognition systems: the representation for speech signals and the methods for speech-features extraction, acoustic and language modeling, efficient algorithms for searching the hypothesis space, and multimodal approaches to speech recognition. The last part of the book is devoted to other speech processing applications that can use the information from automatic speech recognition for speaker identification and tracking, for prosody modeling in emotion-detection systems and in other speech processing applications that are able to operate in real-world environments, like mobile communication services and smart homes.

Technology-Enhanced Language Learning for Specialized Domains

Author: Elena Martín-Monje

Publisher: Routledge

Published: 2016-03-10

Total Pages: 309

ISBN-13: 131731090X

DOWNLOAD EBOOK

Technology-Enhanced Language Learning for Specialized Domains provides an exploration of the latest developments in technology-enhanced learning and the processing of languages for specific purposes. It combines theoretical and applied research from an interdisciplinary angle, covering general issues related to learning languages with computers, assessment, mobile-assisted language learning, the new language massive open online courses, corpus-based research and computer-assisted aspects of translation. The chapters in this collection include contributions from a number of international experts in the field with a wide range of experience in the use of technologies to enhance the language learning process. The essays have been brought together precisely in recognition of the demand for this kind of specialised tuition, offering state-of-the-art technological and methodological innovation and practical applications. The topics covered revolve around the practical consequences of the current possibilites of mobility for both learners and teachers, as well as the applicability of updated technological advances to language learning and teaching, particularly in specialized domains. This is achieved through the description and discussion of practical examples of those applications in a variety of educational contexts. At the beginning of each thematic section, readers will find an introductory chapter which contextualises the topic and links the different examples discussed. Drawing together rich primary research and empirical studies related to specialized tuition and the processing of languages, Technology-Enhanced Language Learning for Specialized Domains will be an invaluable resource for academics, researchers and postgraduate students in the fields of education, computer assisted language learning, languages and linguistics, and language teaching.

Advances in Robotics, Automation and Control

Author: Jesús Arámburo-Lizárraga

Publisher: BoD – Books on Demand

Published: 2008-10-01

Total Pages: 482

ISBN-13: 9537619168

DOWNLOAD EBOOK

The book presents an excellent overview of the recent developments in the different areas of Robotics, Automation and Control. Through its 24 chapters, this book presents topics related to control and robot design; it also introduces new mathematical tools and techniques devoted to improve the system modeling and control. An important point is the use of rational agents and heuristic techniques to cope with the computational complexity required for controlling complex systems. Through this book, we also find navigation and vision algorithms, automatic handwritten comprehension and speech recognition systems that will be included in the next generation of productive systems developed by man.

Audiovisual Speech Processing

Author: Gérard Bailly

Publisher: Cambridge University Press

Published: 2012-04-26

Total Pages: 507

ISBN-13: 1107006821

DOWNLOAD EBOOK

This book presents a complete overview of all aspects of audiovisual speech including perception, production, brain processing and technology.

Frontiers of Language and Teaching, Vol.2: Proceedings of the 2011 International Online Language Conference (IOLC 2011)

Author:

Publisher: Universal-Publishers

Published:

Total Pages: 544

ISBN-13: 1612335594

DOWNLOAD EBOOK

Modeling Variability in Speech Recognition

Author: Georg Stemmer

Publisher:

Published: 2005

Total Pages: 261

ISBN-13: 9783832509453

DOWNLOAD EBOOK

Computational Science and Its Applications - ICCSA 2008

Author: Osvaldo Gervasi

Publisher: Springer

Published: 2008-06-28

Total Pages: 1297

ISBN-13: 3540698485

DOWNLOAD EBOOK

This two-volume set is assembled following the 2008 International Conference on Computational Science and Its Applications, ICCSA 2008, a premium int- national event held in Perugia, Italy, from June 30 to July 3, 2008. The collection of fully refereed high-quality original works accepted as theme papers for presentation at ICCSA 2008 are published in this LNCS proceedings set. This outstanding collection complements the volume of workshop papers, traditionally published by IEEE Computer Society. The continuous support of computational science researchers has helped ICCSA to become a ?rmly established forum in the area of scienti?c computing and the conference itself become a recurring scienti?c and professional meeting that cannot be given up. The computational science ?eld, based on fundamental disciplines such as mathematics, physics, and chemistry, is ?nding new computational approaches to foster the human progress in heterogeneous and fundamental areas such as aerospace and automotive industries, bioinformatics and nanotechnology studies, networks and grid computing, computational geometry and biometrics, computer education, virtual reality, and art. Due to the growing complexity of many ch- lenges in computational science, the use of sophisticated algorithms and eme- ing technologies is inevitable. Together, these far-reaching scienti?c areas help to shape this conference in the areas of state-of-the-art computational science research and applications, encompassing the facilitating theoretical foundations and the innovative applications of such results in other areas.

Posts

Robust Adaptation to Non-Native Accents in Automatic Speech Recognition

Model Selection Based Speaker Adaptation and Its Application to Nonnative Speech Recognition

Speaker Classification I

Speech Recognition

Technology-Enhanced Language Learning for Specialized Domains

Advances in Robotics, Automation and Control

Audiovisual Speech Processing

Frontiers of Language and Teaching, Vol.2: Proceedings of the 2011 International Online Language Conference (IOLC 2011)

Modeling Variability in Speech Recognition

Computational Science and Its Applications - ICCSA 2008

Popular eBook

Recent Posts