Speech Recognition Algorithms Using Weighted Finite-State Transducers

Speech Recognition Algorithms Using Weighted Finite-State Transducers

Author: Takaaki Hori

Publisher: Springer Nature

Published: 2022-05-31

Total Pages: 161

ISBN-13: 3031025628

DOWNLOAD EBOOK

This book introduces the theory, algorithms, and implementation techniques for efficient decoding in speech recognition mainly focusing on the Weighted Finite-State Transducer (WFST) approach. The decoding process for speech recognition is viewed as a search problem whose goal is to find a sequence of words that best matches an input speech signal. Since this process becomes computationally more expensive as the system vocabulary size increases, research has long been devoted to reducing the computational cost. Recently, the WFST approach has become an important state-of-the-art speech recognition technology, because it offers improved decoding speed with fewer recognition errors compared with conventional methods. However, it is not easy to understand all the algorithms used in this framework, and they are still in a black box for many people. In this book, we review the WFST approach and aim to provide comprehensive interpretations of WFST operations and decoding algorithms to help anyone who wants to understand, develop, and study WFST-based speech recognizers. We also mention recent advances in this framework and its applications to spoken language processing. Table of Contents: Introduction / Brief Overview of Speech Recognition / Introduction to Weighted Finite-State Transducers / Speech Recognition by Weighted Finite-State Transducers / Dynamic Decoders with On-the-fly WFST Operations / Summary and Perspective


Springer Handbook of Speech Processing

Springer Handbook of Speech Processing

Author: Jacob Benesty

Publisher: Springer Science & Business Media

Published: 2007-11-28

Total Pages: 1170

ISBN-13: 3540491252

DOWNLOAD EBOOK

This handbook plays a fundamental role in sustainable progress in speech research and development. With an accessible format and with accompanying DVD-Rom, it targets three categories of readers: graduate students, professors and active researchers in academia, and engineers in industry who need to understand or implement some specific algorithms for their speech-related products. It is a superb source of application-oriented, authoritative and comprehensive information about these technologies, this work combines the established knowledge derived from research in such fast evolving disciplines as Signal Processing and Communications, Acoustics, Computer Science and Linguistics.


Finite-state Language Processing

Finite-state Language Processing

Author: Emmanuel Roche

Publisher: MIT Press

Published: 1997

Total Pages: 494

ISBN-13: 9780262181822

DOWNLOAD EBOOK

Finite-state devices, such as finite-state automata, graphs, and finite-state transducers, have been present since the emergence of computer science and are extensively used in areas as various as program compilation, hardware modeling, and database management. Although finite-state devices have been known for some time in computational linguistics, more powerful formalisms such as context-free grammars or unification grammars have typically been preferred. Recent mathematical and algorithmic results in the field of finite-state technology have had a great impact on the representation of electronic dictionaries and on natural language processing, resulting in a new technology for language emerging out of both industrial and academic research. This book presents a discussion of fundamental finite-state algorithms, and constitutes an approach from the perspective of natural language processing.


Finite-state Methods and Natural Language Processing

Finite-state Methods and Natural Language Processing

Author: Jakub Piskorski

Publisher: IOS Press

Published: 2009

Total Pages: 248

ISBN-13: 158603975X

DOWNLOAD EBOOK

Contains papers that cover a range of Natural Language Processing (NLP) applications, including machine learning and translation, logic, computational phonology, morphology and semantics, data mining, information extraction and disambiguation, as well as programming, optimization and compression of finite-state networks.


Computational Processing of the Portuguese Language

Computational Processing of the Portuguese Language

Author: Renata Vieira

Publisher: Springer

Published: 2006-05-24

Total Pages: 284

ISBN-13: 3540340467

DOWNLOAD EBOOK

This book constitutes the thoroughly refereed proceedings of the 7th International Workshop on Computational Processing of the Portuguese Language, PROPOR 2006. The 20 revised full papers and 17 revised short papers presented here are organized in topical sections on automatic summarization, resources, translation, named entity recognition, tools and frameworks, systems and models, information extraction, speech processing, lexicon, morpho-syntactic studies, and Web, corpus and evaluation.


Bayesian Speech and Language Processing

Bayesian Speech and Language Processing

Author: Shinji Watanabe

Publisher: Cambridge University Press

Published: 2015-07-15

Total Pages: 447

ISBN-13: 1107055571

DOWNLOAD EBOOK

A practical and comprehensive guide on how to apply Bayesian machine learning techniques to solve speech and language processing problems.


Formal Languages and Applications

Formal Languages and Applications

Author: Carlos Martin-Vide

Publisher: Springer

Published: 2013-03-09

Total Pages: 612

ISBN-13: 3540398864

DOWNLOAD EBOOK

Formal Languages and Applications provides a comprehensive study-aid and self-tutorial for graduates students and researchers. The main results and techniques are presented in an readily accessible manner and accompanied by many references and directions for further research. This carefully edited monograph is intended to be the gateway to formal language theory and its applications, so it is very useful as a review and reference source of information in formal language theory.


Distant Speech Recognition

Distant Speech Recognition

Author: Matthias Woelfel

Publisher: John Wiley & Sons

Published: 2009-04-20

Total Pages: 600

ISBN-13: 0470714077

DOWNLOAD EBOOK

A complete overview of distant automatic speech recognition The performance of conventional Automatic Speech Recognition (ASR) systems degrades dramatically as soon as the microphone is moved away from the mouth of the speaker. This is due to a broad variety of effects such as background noise, overlapping speech from other speakers, and reverberation. While traditional ASR systems underperform for speech captured with far-field sensors, there are a number of novel techniques within the recognition system as well as techniques developed in other areas of signal processing that can mitigate the deleterious effects of noise and reverberation, as well as separating speech from overlapping speakers. Distant Speech Recognitionpresents a contemporary and comprehensive description of both theoretic abstraction and practical issues inherent in the distant ASR problem. Key Features: Covers the entire topic of distant ASR and offers practical solutions to overcome the problems related to it Provides documentation and sample scripts to enable readers to construct state-of-the-art distant speech recognition systems Gives relevant background information in acoustics and filter techniques, Explains the extraction and enhancement of classification relevant speech features Describes maximum likelihood as well as discriminative parameter estimation, and maximum likelihood normalization techniques Discusses the use of multi-microphone configurations for speaker tracking and channel combination Presents several applications of the methods and technologies described in this book Accompanying website with open source software and tools to construct state-of-the-art distant speech recognition systems This reference will be an invaluable resource for researchers, developers, engineers and other professionals, as well as advanced students in speech technology, signal processing, acoustics, statistics and artificial intelligence fields.


Audio Source Separation and Speech Enhancement

Audio Source Separation and Speech Enhancement

Author: Emmanuel Vincent

Publisher: John Wiley & Sons

Published: 2018-07-24

Total Pages: 506

ISBN-13: 1119279887

DOWNLOAD EBOOK

Learn the technology behind hearing aids, Siri, and Echo Audio source separation and speech enhancement aim to extract one or more source signals of interest from an audio recording involving several sound sources. These technologies are among the most studied in audio signal processing today and bear a critical role in the success of hearing aids, hands-free phones, voice command and other noise-robust audio analysis systems, and music post-production software. Research on this topic has followed three convergent paths, starting with sensor array processing, computational auditory scene analysis, and machine learning based approaches such as independent component analysis, respectively. This book is the first one to provide a comprehensive overview by presenting the common foundations and the differences between these techniques in a unified setting. Key features: Consolidated perspective on audio source separation and speech enhancement. Both historical perspective and latest advances in the field, e.g. deep neural networks. Diverse disciplines: array processing, machine learning, and statistical signal processing. Covers the most important techniques for both single-channel and multichannel processing. This book provides both introductory and advanced material suitable for people with basic knowledge of signal processing and machine learning. Thanks to its comprehensiveness, it will help students select a promising research track, researchers leverage the acquired cross-domain knowledge to design improved techniques, and engineers and developers choose the right technology for their target application scenario. It will also be useful for practitioners from other fields (e.g., acoustics, multimedia, phonetics, and musicology) willing to exploit audio source separation or speech enhancement as pre-processing tools for their own needs.


Future Information Technology, Application, and Service

Future Information Technology, Application, and Service

Author: James (Jong Hyuk) Park

Publisher: Springer Science & Business Media

Published: 2012-06-05

Total Pages: 762

ISBN-13: 9400745168

DOWNLOAD EBOOK

This book is proceedings of the 7th FTRA International Conference on Future Information Technology (FutureTech 2012). The topics of FutureTech 2012 cover the current hot topics satisfying the world-wide ever-changing needs. The FutureTech 2012 is intended to foster the dissemination of state-of-the-art research in all future IT areas, including their models, services, and novel applications associated with their utilization. The FutureTech 2012 will provide an opportunity for academic and industry professionals to discuss the latest issues and progress in this area. In addition, the conference will publish high quality papers which are closely related to the various theories, modeling, and practical applications in many types of future technology. The main scope of FutureTech 2012 is as follows. Hybrid Information Technology Cloud and Cluster Computing Ubiquitous Networks and Wireless Communications Multimedia Convergence Intelligent and Pervasive Applications Security and Trust Computing IT Management and Service Bioinformatics and Bio-Inspired Computing Database and Data Mining Knowledge System and Intelligent Agent Human-centric Computing and Social Networks The FutureTech is a major forum for scientists, engineers, and practitioners throughout the world to present the latest research, results, ideas, developments and applications in all areas of future technologies.