This volume provides a selection of the papers which were presented at the thirteenth conference on Computational Linguistics in the Netherlands (held in Groningen in November 2002). The subjects covered in this book represent a cross-section of current research topics in computational linguistics ranging from theoretical to applied research and development. The target audience consists of students and scholars of computational linguistics as well as speech and language processing, both in academia and industry.
From the contents: Ideas on multi-layer dialogue management for multi-party, multi-conversation, multi-modal communication. - The alpino dependency treebank. - Corpus-based acquisition of collocational prepositional phrases. - Conservative vs set-driven learning functions for the classes k-valued. - Memory-based phoneme-to-grapheme conversion. - Tagging the Dutch parole corpus. - A named entity recognition system for Dutch.
This book constitutes the proceedings of the 11th International Conference on Computational Linguistics and Intelligent Text Processing, held in Iaşi, Romania, in March 2010. The 60 paper included in the volume were carefully reviewed and selected from numerous submissions. The book also includes 3 invited papers. The topics covered are: lexical resources, syntax and parsing, word sense disambiguation and named entity recognition, semantics and dialog, humor and emotions, machine translation and multilingualism, information extraction, information retrieval, text categorization and classification, plagiarism detection, text summarization, and speech generation.
This comprehensive reference work provides an overview of the concepts, methodologies, and applications in computational linguistics and natural language processing (NLP). Features contributions by the top researchers in the field, reflecting the work that is driving the discipline forward Includes an introduction to the major theoretical issues in these fields, as well as the central engineering applications that the work has produced Presents the major developments in an accessible way, explaining the close connection between scientific understanding of the computational properties of natural language and the creation of effective language technologies Serves as an invaluable state-of-the-art reference source for computational linguists and software engineers developing NLP applications in industrial research and development labs of software companies
In vielen Bereichen der Linguistik werden Textkorpora, Sprachkorpora oder multimodale Korpora heute als empirische Basis verwendet. Aufbauend auf Methoden des 19. Jahrhunderts haben sich dabei mit dem Aufkommen von elektronischen Korpora seit den 1940ern neue Standards für linguistische Annotation und Vorverarbeitung sowie für qualitative und quantitative Untersuchungen entwickelt. Das Handbuch bietet einen umfassenden Überblick über Geschichte, Methoden und Anwendungen der Korpuslinguistik. Die einzelnen Überblicks- und Spezialartikel sind von Experten und Expertinnen der jeweiligen Gebiete geschrieben. Dabei wird auf klare und umfassende Darstellung, eine gute Vernetzung zwischen den Artikel und weiterführende Hinweise Wert gelegt.
This book constitutes the refereed proceedings of the 6th International Conference on Text, Speech and Dialogue, TSD 2003, held in Ceské Budejovice, Czech Republic in September 2003. The 60 revised full papers presented together with 2 invited contributions were carefully reviewed and selected from 121 submissions. The papers present a wealth of state-of-the-art research and development results in the field of natural language processing with an emphasis on text, speech, and spoken language ranging from theoretical and methodological issues to applications in various fields, such as web information retrieval, the semantic web, algorithmic learning, and dialogue systems.
This book constitutes the refereed proceedings of the 9th International Conference on Advances in Natural Language Processing, PolTAL 2014, Warsaw, Poland, in September 2014. The 27 revised full papers and 20 revised short papers presented were carefully reviewed and selected from 83 submissions. The papers are organized in topical sections on morphology, named entity recognition, term extraction; lexical semantics; sentence level syntax, semantics, and machine translation; discourse, coreference resolution, automatic summarization, and question answering; text classification, information extraction and information retrieval; and speech processing, language modelling, and spell- and grammar-checking.
This volume aims to overcome sub-disciplinary boundaries in the study of linguistic variation - be it language-internal or cross-linguistic. Even though dialectologists, register analysts, typologists, and quantitative linguists all deal with linguistic variation, there is astonishingly little interaction across these fields. But the fourteen contributions in this volume show that these subdisciplines actually share many interests and methodological concerns in common. The chapters specifically converge in the following ways: First, they all seek to explore linguistic variation, within or across languages. Second, they are based on usage data, that is, on corpora of (more or less) authentic text or speech of different languages or language varieties. Third, all chapters are concerned with the joint analysis (also sometimes known as “aggregation” or “data synthesis”) of multiple phenomena, features, or measurements of some sort. And lastly, the contributors all marshal quantitative analysis techniques to analyse the data. In short, the volume explores the text-feature-aggregation pipeline in variation studies, demonstrating that there is much mutual inspiration to be had by thinking outside the disciplinary box.