Finite-state Language Processing

Finite-state Language Processing

Author: Emmanuel Roche

Publisher: MIT Press

Published: 1997

Total Pages: 494

ISBN-13: 9780262181822

DOWNLOAD EBOOK

Finite-state devices, such as finite-state automata, graphs, and finite-state transducers, have been present since the emergence of computer science and are extensively used in areas as various as program compilation, hardware modeling, and database management. Although finite-state devices have been known for some time in computational linguistics, more powerful formalisms such as context-free grammars or unification grammars have typically been preferred. Recent mathematical and algorithmic results in the field of finite-state technology have had a great impact on the representation of electronic dictionaries and on natural language processing, resulting in a new technology for language emerging out of both industrial and academic research. This book presents a discussion of fundamental finite-state algorithms, and constitutes an approach from the perspective of natural language processing.


Finite-State Methods and Natural Language Processing

Finite-State Methods and Natural Language Processing

Author: Anssi Yli-Jyrä

Publisher: Springer Science & Business Media

Published: 2010-08-05

Total Pages: 156

ISBN-13: 364214683X

DOWNLOAD EBOOK

This book constitutes the refereed proceedings of the 8th International Workshop on the Finite-State-Methods and Natural Language Processing, FSMNLP 2009. The workshop was held at the University of Pretoria, South Africa on July 2009. In total 21 papers were submitted and of those papers 13 were accepted as regular papers and a further 6 as extended abstracts. The papers are devoted to computational morphology, natural language processing, finite-state methods, automata, and related formal language theory.


Finite-State Techniques

Finite-State Techniques

Author: Stoyan Mihov

Publisher: Cambridge University Press

Published: 2019-08-01

Total Pages: 316

ISBN-13: 1108621139

DOWNLOAD EBOOK

Finite-state methods are the most efficient mechanisms for analysing textual and symbolic data, providing elegant solutions for an immense number of practical problems in computational linguistics and computer science. This book for graduate students and researchers gives a complete coverage of the field, starting from a conceptual introduction and building to advanced topics and applications. The central finite-state technologies are introduced with mathematical rigour, ranging from simple finite-state automata to transducers and bimachines as 'input-output' devices. Special attention is given to the rich possibilities of simplifying, transforming and combining finite-state devices. All algorithms presented are accompanied by full correctness proofs and executable source code in a new programming language, C(M), which focuses on transparency of steps and simplicity of code. Thus, by enabling readers to obtain a deep formal understanding of the subject and to put finite-state methods to real use, this book closes the gap between theory and practice.


Finite-state Methods and Natural Language Processing

Finite-state Methods and Natural Language Processing

Author: Jakub Piskorski

Publisher: IOS Press

Published: 2009

Total Pages: 248

ISBN-13: 158603975X

DOWNLOAD EBOOK

Contains papers that cover a range of Natural Language Processing (NLP) applications, including machine learning and translation, logic, computational phonology, morphology and semantics, data mining, information extraction and disambiguation, as well as programming, optimization and compression of finite-state networks.


Speech Recognition Algorithms Using Weighted Finite-State Transducers

Speech Recognition Algorithms Using Weighted Finite-State Transducers

Author: Takaaki Hori

Publisher: Springer Nature

Published: 2022-05-31

Total Pages: 161

ISBN-13: 3031025628

DOWNLOAD EBOOK

This book introduces the theory, algorithms, and implementation techniques for efficient decoding in speech recognition mainly focusing on the Weighted Finite-State Transducer (WFST) approach. The decoding process for speech recognition is viewed as a search problem whose goal is to find a sequence of words that best matches an input speech signal. Since this process becomes computationally more expensive as the system vocabulary size increases, research has long been devoted to reducing the computational cost. Recently, the WFST approach has become an important state-of-the-art speech recognition technology, because it offers improved decoding speed with fewer recognition errors compared with conventional methods. However, it is not easy to understand all the algorithms used in this framework, and they are still in a black box for many people. In this book, we review the WFST approach and aim to provide comprehensive interpretations of WFST operations and decoding algorithms to help anyone who wants to understand, develop, and study WFST-based speech recognizers. We also mention recent advances in this framework and its applications to spoken language processing. Table of Contents: Introduction / Brief Overview of Speech Recognition / Introduction to Weighted Finite-State Transducers / Speech Recognition by Weighted Finite-State Transducers / Dynamic Decoders with On-the-fly WFST Operations / Summary and Perspective


Applications of Finite-State Language Processing

Applications of Finite-State Language Processing

Author: Svetla Koeva

Publisher: Cambridge Scholars Publishing

Published: 2010-10-12

Total Pages: 225

ISBN-13: 1443826030

DOWNLOAD EBOOK

NooJ is both a corpus processing tool and a linguistic development environment: it allows linguists to formalize several levels of linguistic phenomena: orthography and spelling, lexicons for simple words, multiword units and frozen expressions, inflectional, derivational and productive morphology, local, structural syntax and transformational syntax. For each of these levels, NooJ provides linguists with one or more formal tools specifically designed to facilitate the description of each phenomenon, as well as parsing tools designed to be as computationally efficient as possible. This approach distinguishes NooJ from most computational linguistic tools, which provide a single formalism that should describe everything. As a corpus processing tool, NooJ allows users to apply sophisticated linguistic queries to large corpora in order to build indices and concordances, annotate texts automatically, perform statistical analyses, etc. NooJ is freely available and linguistic modules can already be downloaded for Acadian, Arabic, Armenian, Bulgarian, Catalan, Chinese, Croatian, French, English, German, Hebrew, Greek, Hungarian, Italian, Polish, Portuguese, Spanish and Turkish. The present volume contains papers from the 2008 International NooJ conference which was held 8–10 June 2008 in Budapest. While the focus of the Budapest conference was on making NooJ compatible with other applications, the papers vary with respect to whether they regard Natural Language Processing (NLP) as a research goal or as a tool. However, they all present a slightly different problem either in the field of NLP, or in one that can be solved using NLP, or present a new development in the tool itself. The range of problems dealt with in the volume is quite varied, which will hopefully enable the readers to find contributions that are relevant to their field of interest.


Introducing Speech and Language Processing

Introducing Speech and Language Processing

Author: John S. Coleman

Publisher: Cambridge University Press

Published: 2005-03-03

Total Pages: 324

ISBN-13: 9780521530699

DOWNLOAD EBOOK

This major new textbook provides a clearly-written, concise and accessible introduction to speech and language processing. Assuming knowledge of only the very basics of linguistics and written specifically for students with no technical background, it is the perfect starting point for anyone beginning to study the discipline. Student s are shown from an elementary level how to use two programming languages, C and Prolog, and the accompanying CD-ROM contains all the software needed. Setting an invaluable foundation for further study, this is set to become the leading introduction to the field.


An Introduction to Natural Language Processing Through Prolog

An Introduction to Natural Language Processing Through Prolog

Author: Clive Matthews

Publisher: Routledge

Published: 2016-07-01

Total Pages: 319

ISBN-13: 1317898346

DOWNLOAD EBOOK

Research into Natural Language Processing - the use of computers to process language - has developed over the last couple of decades into one of the most vigorous and interesting areas of current work on language and communication. This book introduces the subject through the discussion and development of various computer programs which illustrate some of the basic concepts and techniques in the field. The programming language used is Prolog, which is especially well-suited for Natural Language Processing and those with little or no background in computing. Following the general introduction, the first section of the book presents Prolog, and the following chapters illustrate how various Natural Language Processing programs may be written using this programming language. Since it is assumed that the reader has no previous experience in programming, great care is taken to provide a simple yet comprehensive introduction to Prolog. Due to the 'user friendly' nature of Prolog, simple yet effective programs may be written from an early stage. The reader is gradually introduced to various techniques for syntactic processing, ranging from Finite State Network recognisors to Chart parsers. An integral element of the book is the comprehensive set of exercises included in each chapter as a means of cementing the reader's understanding of each topic. Suggested answers are also provided. An Introduction to Natural Language Processing Through Prolog is an excellent introduction to the subject for students of linguistics and computer science, and will be especially useful for those with no background in the subject.