Industrial Parsing of Software Manuals

Industrial Parsing of Software Manuals

Author: Sutcliffe

Publisher: BRILL

Published: 2023-11-27

Total Pages: 287

ISBN-13: 9004653619

DOWNLOAD EBOOK

The task of language engineering is to develop the technology for building computer systems which can perform useful linguistic tasks such as machine assisted translation, text retrieval, message classification and document summarisation. Such systems often require the use of a parser which can extract specific types of grammatical data from pre-defined classes of input text. There are many parsers already available for use in language engineering systems. However, many different linguistic formalisms and parsing algorithms are employed. Grammatical coverage varies, as does the nature of the syntactic information extracted. Direct comparison between systems is difficult because each is likely to have been evaluated using different test criteria. In this volume, eight different parsers are applied to the same task, that of analysing a set of sentences derived from software instruction manuals. Each parser is presented in a separate chapter. Evaluation of performance is carried out using a standard set of criteria with the results being presented in a set of tables which have the same format for each system. Three additional chapters provide further analysis of the results as well as discussing possible approaches to the standardisation of parse tree data. Five parse trees are provided for each system in an appendix, allowing further direct comparison between systems by the reader. The book will be of interest to students, researchers and practitioners in the areas of computational linguistics, computer science, information retrieval, language engineering, linguistics and machine assisted translation.


Recent Advances in Natural Language Processing

Recent Advances in Natural Language Processing

Author: Nicolas Nicolov

Publisher: John Benjamins Publishing

Published: 2000-09-15

Total Pages: 436

ISBN-13: 9027283974

DOWNLOAD EBOOK

This volume brings together revised versions of a selection of papers presented at the Second International Conference on “Recent Advances in Natural Language Processing” (RANLP’97) held in Tzigov Chark, Bulgaria, September 1997. The aim of the conference was to give researchers the opportunity to present new results in Natural Language Processing (NLP) based both on traditional and modern theories and approaches. The conference received substantial interest — 167 submissions from more than 20 countries. The best papers from the proceedings were selected for this volume, in the hope that they reflect the most significant and promising trends (and successful results) in NLP. The contributions have been grouped according to the following topics: tagging, lexical issues and parsing, word sense disambiguation and anaphora resolution, semantics, generation, machine translation, and categorisation and applications. The volume contains an extensive index.


Recent Advances in Natural Language Processing II

Recent Advances in Natural Language Processing II

Author: Nicolas Nicolov

Publisher: John Benjamins Publishing

Published: 2000

Total Pages: 435

ISBN-13: 902723695X

DOWNLOAD EBOOK

This volume brings together revised versions of a selection of papers presented at the Second International Conference on “Recent Advances in Natural Language Processing” (RANLP'97) held in Tzigov Chark, Bulgaria, September 1997. The aim of the conference was to give researchers the opportunity to present new results in Natural Language Processing (NLP) based both on traditional and modern theories and approaches. The conference received substantial interest — 167 submissions from more than 20 countries. The best papers from the proceedings were selected for this volume, in the hope that they reflect the most significant and promising trends (and successful results) in NLP. The contributions have been grouped according to the following topics: tagging, lexical issues and parsing, word sense disambiguation and anaphora resolution, semantics, generation, machine translation, and categorisation and applications. The volume contains an extensive index.


Corpus Linguistics. Volume 1

Corpus Linguistics. Volume 1

Author: Anke Lüdeling

Publisher: Walter de Gruyter

Published: 2008-12-10

Total Pages: 797

ISBN-13: 3110211424

DOWNLOAD EBOOK

This volume provides an up-to-date survey of the field of corpus linguistics, a field whose methodology has revolutionized much of the empirical work done in most fields of linguistic study over the past decade. Corpus linguistics investigates human language by starting out from large collections of texts - spoken, written, or recorded. These language corpora, which are now regularly available in electronic form, are the basis for quantitative and qualitative research on almost any question of linguistic interest. Many techniques that are in use in corpus linguistics today are rooted in the tradition of the late 18th and 19th century, when linguistics began to make use of mathematical and empirical methods. Modern corpus linguistics has used and developed these methods in close connection with computer science and computational linguistics. The handbook sketches the history of corpus linguistics, shows its potential, discusses its problems, and describes various methods of collecting, annotating, and searching corpora as well as processing corpus data. It also reports case studies that illustrate the wide range of linguistic research questions addressed in corpus linguistics. The over 60 articles included in the handbook are divided into five sections: (1) the origins and history of corpus linguistics and surveys of its relationship to central fields of linguistics (2) corpus compilation (3) corpus types (4) preprocessing of corpora (5) the use and exploitation of corpora. The final section gives an overview of the results of corpus studies obtained in phonetics, phonology, morphology, syntax, semantics, sociolinguistics, historical linguistics, stylometry, dialectology, and discourse analysis. It also reports on recent advances made in human and machine translation, contrastive studies, computer-assisted language learning, and automatic summarization. The contributors to the volume are internationally known experts in their respective fields. The handbook is intended for a wide audience ranging from teachers, university students, and scholars to anyone interested in the use of computers in linguistic analyses and applications.


Parallel corpora, parallel worlds

Parallel corpora, parallel worlds

Author:

Publisher: BRILL

Published: 2016-09-12

Total Pages: 227

ISBN-13: 9004334297

DOWNLOAD EBOOK

From the contents: Stig JOHANSSON: Towards a multilingual corpus for contrastive analysis and translation studies. - Anna SAGVALL HEIN: The PLUG project: parallel corpora in Linkoping, Uppsala, Goteborg: aims and achievements. - Raphael SALKIE: How can linguists profit from parallel corpora? - Trond TROSTERUD: Parallel corpora as tools for investigating and developing minority languages."


Corpus Linguistics. Volume 2

Corpus Linguistics. Volume 2

Author: Anke Lüdeling

Publisher: Walter de Gruyter

Published: 2009-03-26

Total Pages: 606

ISBN-13: 3110213885

DOWNLOAD EBOOK

In vielen Bereichen der Linguistik werden Textkorpora, Sprachkorpora oder multimodale Korpora heute als empirische Basis verwendet. Aufbauend auf Methoden des 19. Jahrhunderts haben sich dabei mit dem Aufkommen von elektronischen Korpora seit den 1940ern neue Standards für linguistische Annotation und Vorverarbeitung sowie für qualitative und quantitative Untersuchungen entwickelt. Das Handbuch bietet einen umfassenden Überblick über Geschichte, Methoden und Anwendungen der Korpuslinguistik. Die einzelnen Überblicks- und Spezialartikel sind von Experten und Expertinnen der jeweiligen Gebiete geschrieben. Dabei wird auf klare und umfassende Darstellung, eine gute Vernetzung zwischen den Artikel und weiterführende Hinweise Wert gelegt.


Treebanks

Treebanks

Author: A. Abeillé

Publisher: Springer Science & Business Media

Published: 2012-12-06

Total Pages: 411

ISBN-13: 9401002010

DOWNLOAD EBOOK

This book provides a state of the art on work being done with parsed corpora. It gathers 21 papers on building and using parsed corpora raising many relevant questions, and deals with a variety of languages and a variety of corpora. It is for those working in linguistics, computational linguistics, natural language, syntax, and grammar.


Computational Linguistics and Intelligent Text Processing

Computational Linguistics and Intelligent Text Processing

Author: Alexander Gelbukh

Publisher: Springer Science & Business Media

Published: 2006-02-03

Total Pages: 605

ISBN-13: 3540322051

DOWNLOAD EBOOK

This book constitutes the refereed proceedings of the 7th International Conference on Computational Linguistics and Intelligent Text Processing, held in February 2006. The 43 revised full papers and 16 revised short papers presented together with three invited papers were carefully reviewed and selected from 176 submissions. The papers are structured into two parts and organized in topical sections on computational linguistics research.