This volume provides a selection of the papers which were presented at the thirteenth conference on Computational Linguistics in the Netherlands (held in Groningen in November 2002). The subjects covered in this book represent a cross-section of current research topics in computational linguistics ranging from theoretical to applied research and development. The target audience consists of students and scholars of computational linguistics as well as speech and language processing, both in academia and industry.
From the contents: Ideas on multi-layer dialogue management for multi-party, multi-conversation, multi-modal communication. - The alpino dependency treebank. - Corpus-based acquisition of collocational prepositional phrases. - Conservative vs set-driven learning functions for the classes k-valued. - Memory-based phoneme-to-grapheme conversion. - Tagging the Dutch parole corpus. - A named entity recognition system for Dutch.
This volume provides a selection of the papers which were presented at the eleventh conference on Computational Linguistics in the Netherlands (Tilburg, 2000). It gives an accurate and up-to-date picture of the lively scene of computational linguistics in the Netherlands and Flanders. The volume covers the whole range from theoretical to applied research and development, and is hence of interest to both academia and industry. The target audience consists of students and scholars of computational linguistics, and speech and language processing (Linguistics, Computer Science, Electrical Engineering).
This book constitutes the refereed proceedings of the 9th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2008, held in Haifa, Israel, in February 2008. The 52 revised full papers presented together with 4 invited papers were carefully reviewed and selected from numerous submissions. The papers cover all current issues in computational linguistics research and present intelligent text processing applications. The papers are organized in topical sections on language resources, morphology and syntax, semantics and discourse, word sense disambiguation and named entity recognition, anaphora and co-reference, machine translation and parallel corpora, natural language generation, speech recognition, information retrieval and question answering, text classification, text summarization, as well as spell checking and authoring aid.
In vielen Bereichen der Linguistik werden Textkorpora, Sprachkorpora oder multimodale Korpora heute als empirische Basis verwendet. Aufbauend auf Methoden des 19. Jahrhunderts haben sich dabei mit dem Aufkommen von elektronischen Korpora seit den 1940ern neue Standards für linguistische Annotation und Vorverarbeitung sowie für qualitative und quantitative Untersuchungen entwickelt. Das Handbuch bietet einen umfassenden Überblick über Geschichte, Methoden und Anwendungen der Korpuslinguistik. Die einzelnen Überblicks- und Spezialartikel sind von Experten und Expertinnen der jeweiligen Gebiete geschrieben. Dabei wird auf klare und umfassende Darstellung, eine gute Vernetzung zwischen den Artikel und weiterführende Hinweise Wert gelegt.
This book focuses mainly on logical approaches to computational linguistics, but also discusses integrations with other approaches, presenting both classic and newly emerging theories and applications.Decades of research on theoretical work and practical applications have demonstrated that computational linguistics is a distinctively interdisciplinary area. There is convincing evidence that computational approaches to linguistics can benefit from research on the nature of human language, including from the perspective of its evolution. This book addresses various topics in computational theories of human language, covering grammar, syntax, and semantics. The common thread running through the research presented is the role of computer science, mathematical logic and other subjects of mathematics in computational linguistics and natural language processing (NLP). Promoting intelligent approaches to artificial intelligence (AI) and NLP, the book is intended for researchers and graduate students in the field.
This book uses new ideas and language for understanding how self-organization and complexity trend toward increased efficiency. Different measures for efficiency from multiple disciplines are used to probe the ones that provide the most insight. One major goal is to seek a common framework to trace the increase of efficiency as a measure of the level of organization and evolutionary stage of a complex system. The chapters come from a satellite meeting hosted at the Conference on Complex Systems, in Cancun, 2017. The contributions will be peer-reviewed and contributors from outside the conference will be invited to submit chapters to ensure full coverage of the topics. This text will appeal to students and researchers working on complex systems and efficiency.
This open access book presents a comprehensive collection of the European Language Equality (ELE) project’s results, its strategic agenda and roadmap with key recommendations to the European Union on how to achieve digital language equality in Europe by 2030. The fabric of the EU linguistic landscape comprises 24 official languages and over 60 regional and minority languages. However, language barriers still hamper communication and the free flow of information. Multilingualism is a key cultural cornerstone of Europe, signifying what it means to be and to feel European. Various studies and resolutions have found a striking imbalance in the support of Europe’s languages through technologies, issuing a call to action. Following an introduction, the book is divided into two parts. The first part describes the state of the art of language technology and language-centric AI and the definition and metrics developed to measure digital language equality. It also presents the status quo in 2022/2023, i.e., the current level of technology support for over 30 European languages. The second part describes plans and recommendations on how to bring about digital language equality in Europe by 2030. It includes chapters on the setup and results of the community consultation process, four technical deep dives, an overview of existing strategic documents and an abridged version of the strategic agenda and roadmap. The recommendations have been prepared jointly with the European community in the fields of language technology, natural language processing, and language-centric AI, as well as with representatives of relevant initiatives and associations, language communities and regional and minority language groups. Ensuring appropriate technology support for all European languages will not only create jobs, growth and opportunities in the digital single market. Overcoming language barriers in the digital environment is also essential for an inclusive society and for providing unity in diversity for many years to come.
The book provides an overview of more than a decade of joint R&D efforts in the Low Countries on HLT for Dutch. It not only presents the state of the art of HLT for Dutch in the areas covered, but, even more importantly, a description of the resources (data and tools) for Dutch that have been created are now available for both academia and industry worldwide. The contributions cover many areas of human language technology (for Dutch): corpus collection (including IPR issues) and building (in particular one corpus aiming at a collection of 500M word tokens), lexicology, anaphora resolution, a semantic network, parsing technology, speech recognition, machine translation, text (summaries) generation, web mining, information extraction, and text to speech to name the most important ones. The book also shows how a medium-sized language community (spanning two territories) can create a digital language infrastructure (resources, tools, etc.) as a basis for subsequent R&D. At the same time, it bundles contributions of almost all the HLT research groups in Flanders and the Netherlands, hence offers a view of their recent research activities. Targeted readers are mainly researchers in human language technology, in particular those focusing on Dutch. It concerns researchers active in larger networks such as the CLARIN, META-NET, FLaReNet and participating in conferences such as ACL, EACL, NAACL, COLING, RANLP, CICling, LREC, CLIN and DIR ( both in the Low Countries), InterSpeech, ASRU, ICASSP, ISCA, EUSIPCO, CLEF, TREC, etc. In addition, some chapters are interesting for human language technology policy makers and even for science policy makers in general.