Speech Technology

Speech Technology

Author: Fang Chen

Publisher: Springer Science & Business Media

Published: 2010-07-01

Total Pages: 349

ISBN-13: 0387738193

DOWNLOAD EBOOK

This book gives an overview of the research and application of speech technologies in different areas. One of the special characteristics of the book is that the authors take a broad view of the multiple research areas and take the multidisciplinary approach to the topics. One of the goals in this book is to emphasize the application. User experience, human factors and usability issues are the focus in this book.


Multilingual Speech Processing

Multilingual Speech Processing

Author: Tanja Schultz

Publisher: Elsevier

Published: 2006-06-12

Total Pages: 540

ISBN-13: 0080457622

DOWNLOAD EBOOK

Tanja Schultz and Katrin Kirchhoff have compiled a comprehensive overview of speech processing from a multilingual perspective. By taking this all-inclusive approach to speech processing, the editors have included theories, algorithms, and techniques that are required to support spoken input and output in a large variety of languages. Multilingual Speech Processing presents a comprehensive introduction to research problems and solutions, both from a theoretical as well as a practical perspective, and highlights technology that incorporates the increasing necessity for multilingual applications in our global community. Current challenges of speech processing and the feasibility of sharing data and system components across different languages guide contributors in their discussions of trends, prognoses and open research issues. This includes automatic speech recognition and speech synthesis, but also speech-to-speech translation, dialog systems, automatic language identification, and handling non-native speech. The book is complemented by an overview of multilingual resources, important research trends, and actual speech processing systems that are being deployed in multilingual human-human and human-machine interfaces. Researchers and developers in industry and academia with different backgrounds but a common interest in multilingual speech processing will find an excellent overview of research problems and solutions detailed from theoretical and practical perspectives. State-of-the-art research with a global perspective by authors from the USA, Asia, Europe, and South Africa The only comprehensive introduction to multilingual speech processing currently available Detailed presentation of technological advances integral to security, financial, cellular and commercial applications


Talker Variability in Speech Processing

Talker Variability in Speech Processing

Author: Keith Johnson

Publisher:

Published: 1997

Total Pages: 264

ISBN-13:

DOWNLOAD EBOOK

In this text, the editors aim to convert the mapping of speech patterns into mental representations. They cover theories of perception and cognition, issues in clinical speech pathology, and the practical concerns of speech technology.


Artificial Intelligence and Speech Technology

Artificial Intelligence and Speech Technology

Author: Amita Dev

Publisher: CRC Press

Published: 2021-06-29

Total Pages: 522

ISBN-13: 1000472906

DOWNLOAD EBOOK

The 2nd International Conference on Artificial Intelligence and Speech Technology (AIST2020) was organized by Indira Gandhi Delhi Technical University for Women, Delhi, India on November 19–20, 2020. AIST2020 is dedicated to cutting-edge research that addresses the scientific needs of academic researchers and industrial professionals to explore new horizons of knowledge related to Artificial Intelligence and Speech Technologies. AIST2020 includes high-quality paper presentation sessions revealing the latest research findings, and engaging participant discussions. The main focus is on novel contributions which would open new opportunities for providing better and low-cost solutions for the betterment of society. These include the use of new AI-based approaches like Deep Learning, CNN, RNN, GAN, and others in various Speech related issues like speech synthesis, speech recognition, etc.


Statistical Methods for Speech Recognition

Statistical Methods for Speech Recognition

Author: Frederick Jelinek

Publisher: MIT Press

Published: 2022-11-01

Total Pages: 307

ISBN-13: 0262546604

DOWNLOAD EBOOK

This book reflects decades of important research on the mathematical foundations of speech recognition. It focuses on underlying statistical techniques such as hidden Markov models, decision trees, the expectation-maximization algorithm, information theoretic goodness criteria, maximum entropy probability estimation, parameter and data clustering, and smoothing of probability distributions. The author's goal is to present these principles clearly in the simplest setting, to show the advantages of self-organization from real data, and to enable the reader to apply the techniques. Bradford Books imprint


Designing Human Interface in Speech Technology

Designing Human Interface in Speech Technology

Author: Fang Chen

Publisher: Springer Science & Business Media

Published: 2006

Total Pages: 416

ISBN-13: 9780387241555

DOWNLOAD EBOOK

Bridging the gap between the needs of the technical engineer and cognitive researchers related to speech technology applications. Systematic approach focusing on the utility of speech related product design Designed to respond to the growing need for specific theories, tools and methods for design, testing and evaluating speech related human-system interfaces. Targeted at designers, engineers, and decision makers working in the area of speech technology research


Essential Speech and Language Technology for Dutch

Essential Speech and Language Technology for Dutch

Author: Peter Spyns

Publisher: Springer Science & Business Media

Published: 2013-02-26

Total Pages: 414

ISBN-13: 3642309100

DOWNLOAD EBOOK

The book provides an overview of more than a decade of joint R&D efforts in the Low Countries on HLT for Dutch. It not only presents the state of the art of HLT for Dutch in the areas covered, but, even more importantly, a description of the resources (data and tools) for Dutch that have been created are now available for both academia and industry worldwide. The contributions cover many areas of human language technology (for Dutch): corpus collection (including IPR issues) and building (in particular one corpus aiming at a collection of 500M word tokens), lexicology, anaphora resolution, a semantic network, parsing technology, speech recognition, machine translation, text (summaries) generation, web mining, information extraction, and text to speech to name the most important ones. The book also shows how a medium-sized language community (spanning two territories) can create a digital language infrastructure (resources, tools, etc.) as a basis for subsequent R&D. At the same time, it bundles contributions of almost all the HLT research groups in Flanders and the Netherlands, hence offers a view of their recent research activities. Targeted readers are mainly researchers in human language technology, in particular those focusing on Dutch. It concerns researchers active in larger networks such as the CLARIN, META-NET, FLaReNet and participating in conferences such as ACL, EACL, NAACL, COLING, RANLP, CICling, LREC, CLIN and DIR ( both in the Low Countries), InterSpeech, ASRU, ICASSP, ISCA, EUSIPCO, CLEF, TREC, etc. In addition, some chapters are interesting for human language technology policy makers and even for science policy makers in general.


Recent Advances in Robust Speech Recognition Technology

Recent Advances in Robust Speech Recognition Technology

Author: Javier Ramirez

Publisher: Bentham Science

Published: 2011

Total Pages: 223

ISBN-13: 1608051722

DOWNLOAD EBOOK

"This E-book is a collection of articles that describe advances in speech recognition technology. Robustness in speech recognition refers to the need to maintain high speech recognition accuracy even when the quality of the input speech is degraded, or whe"


Automatic Speech Recognition

Automatic Speech Recognition

Author: Dong Yu

Publisher: Springer

Published: 2014-11-11

Total Pages: 329

ISBN-13: 1447157796

DOWNLOAD EBOOK

This book provides a comprehensive overview of the recent advancement in the field of automatic speech recognition with a focus on deep learning models including deep neural networks and many of their variants. This is the first automatic speech recognition book dedicated to the deep learning approach. In addition to the rigorous mathematical treatment of the subject, the book also presents insights and theoretical foundation of a series of highly successful deep learning models.