Machine Learning in Translation Corpora Processing

Machine Learning in Translation Corpora Processing

Author: Krzysztof Wolk

Publisher: CRC Press

Published: 2019-02-25

Total Pages: 205

ISBN-13: 0429588836

DOWNLOAD EBOOK

This book reviews ways to improve statistical machine speech translation between Polish and English. Research has been conducted mostly on dictionary-based, rule-based, and syntax-based, machine translation techniques. Most popular methodologies and tools are not well-suited for the Polish language and therefore require adaptation, and language resources are lacking in parallel and monolingual data. The main objective of this volume to develop an automatic and robust Polish-to-English translation system to meet specific translation requirements and to develop bilingual textual resources by mining comparable corpora.


Machine Learning in Translation Corpora Processing

Machine Learning in Translation Corpora Processing

Author: Krzysztof Wolk

Publisher: CRC Press

Published: 2019-02-25

Total Pages: 264

ISBN-13: 0429590776

DOWNLOAD EBOOK

This book reviews ways to improve statistical machine speech translation between Polish and English. Research has been conducted mostly on dictionary-based, rule-based, and syntax-based, machine translation techniques. Most popular methodologies and tools are not well-suited for the Polish language and therefore require adaptation, and language resources are lacking in parallel and monolingual data. The main objective of this volume to develop an automatic and robust Polish-to-English translation system to meet specific translation requirements and to develop bilingual textual resources by mining comparable corpora.


Learning Machine Translation

Learning Machine Translation

Author: Cyril Goutte

Publisher: MIT Press

Published: 2009

Total Pages: 329

ISBN-13: 0262072971

DOWNLOAD EBOOK

How Machine Learning can improve machine translation: enabling technologies and new statistical techniques.


Neural Machine Translation

Neural Machine Translation

Author: Philipp Koehn

Publisher: Cambridge University Press

Published: 2020-06-18

Total Pages: 409

ISBN-13: 1108497322

DOWNLOAD EBOOK

Learn how to build machine translation systems with deep learning from the ground up, from basic concepts to cutting-edge research.


Parallel Corpora for Contrastive and Translation Studies

Parallel Corpora for Contrastive and Translation Studies

Author: Irene Doval

Publisher: John Benjamins Publishing Company

Published: 2019-03-20

Total Pages: 313

ISBN-13: 9027262845

DOWNLOAD EBOOK

This volume assesses the state of the art of parallel corpus research as a whole, reporting on advances in both recent developments of parallel corpora – with some particular references to comparable corpora as well– and in ways of exploiting them for a variety of purposes. The first part of the book is devoted to new roles that parallel corpora can and should assume in translation studies and in contrastive linguistics, to the usefulness and usability of parallel corpora, and to advances in parallel corpus alignment, annotation and retrieval. There follows an up-to-date presentation of a number of parallel corpus projects currently being carried out in Europe, some of them multimodal, with certain chapters illustrating case studies developed on the basis of the corpora at hand. In most of these chapters, attention is paid to specific technical issues of corpus building. The third part of the book reflects on specific applications and on the creation of bilingual resources from parallel corpora. This volume will be welcomed by scholars, postgraduate and PhD students in the fields of contrastive linguistics, translation studies, lexicography, language teaching and learning, machine translation, and natural language processing.


Computational Linguistics and Intelligent Text Processing

Computational Linguistics and Intelligent Text Processing

Author: Alexander Gelbukh

Publisher: Springer Science & Business Media

Published: 2010-03-18

Total Pages: 778

ISBN-13: 3642121152

DOWNLOAD EBOOK

This book constitutes the proceedings of the 11th International Conference on Computational Linguistics and Intelligent Text Processing, held in Iaşi, Romania, in March 2010. The 60 paper included in the volume were carefully reviewed and selected from numerous submissions. The book also includes 3 invited papers. The topics covered are: lexical resources, syntax and parsing, word sense disambiguation and named entity recognition, semantics and dialog, humor and emotions, machine translation and multilingualism, information extraction, information retrieval, text categorization and classification, plagiarism detection, text summarization, and speech generation.


Language Corpora Annotation and Processing

Language Corpora Annotation and Processing

Author: Niladri Sekhar Dash

Publisher: Springer Nature

Published: 2021

Total Pages:

ISBN-13: 9811629609

DOWNLOAD EBOOK

This book addresses the research, analysis, and description of the methods and processes that are used in the annotation and processing of language corpora in advanced, semi-advanced, and non-advanced languages. It provides the background information and empirical data needed to understand the nature and depth of problems related to corpus annotation and text processing and shows readers how the linguistic elements found in texts are analyzed and applied to develop language technology systems and devices. As such, it offers valuable insights for researchers, educators, and students of linguistics and language technology.


Parallel Text Processing

Parallel Text Processing

Author: Jean Véronis

Publisher: Springer Science & Business Media

Published: 2000-09-30

Total Pages: 442

ISBN-13: 9780792365464

DOWNLOAD EBOOK

With the rising importance of multilingualism in language industries, brought about by global markets and world-wide information exchange, parallel corpora, i.e. corpora of texts accompanied by their translation, have become key resources in the development of natural language processing tools. The applications based upon parallel corpora are numerous and growing in number: multilingual lexicography and terminology, machine and human translation, cross-language information retrieval, language learning, etc. The book's chapters have been commissioned from major figures in the field of parallel corpus building and exploitation, with the aim of showing the state of the art in parallel text alignment and use ten to fifteen years after the first parallel-text alignment techniques were developed. Within the book, the following broad themes are addressed: (i) techniques for the alignment of parallel texts at various levels such as sentence, clause, and word; (ii) the use of parallel texts in fields as diverse as translation, lexicography, and information retrieval; (iii) available corpus resources and the evaluation of alignment methods. The book will be of interest to researchers and advanced students of computational linguistics, terminology, lexicography and translation, both in academia and industry.


Cohesion, Coherence and Temporal Reference from an Experimental Corpus Pragmatics Perspective

Cohesion, Coherence and Temporal Reference from an Experimental Corpus Pragmatics Perspective

Author: Cristina Grisot

Publisher: Springer

Published: 2018-10-06

Total Pages: 340

ISBN-13: 3319967525

DOWNLOAD EBOOK

This open access book provides new methodological and theoretical insights into temporal reference and its linguistic expression, from a cross-linguistic experimental corpus pragmatics approach. Verbal tenses, in general, and more specifically the categories of tense, grammatical and lexical aspect are treated as cohesion ties contributing to the temporal coherence of a discourse, as well as to the cognitive temporal coherence of the mental representations built in the language comprehension process. As such, it investigates the phenomenon of temporal reference at the interface between corpus linguistics, theoretical linguistics and pragmatics, experimental pragmatics, psycholinguistics, natural language processing and machine translation.