Corpus-based Perspectives in Linguistics

Corpus-based Perspectives in Linguistics

Author: Yuji Kawaguchi

Publisher: John Benjamins Publishing

Published: 2007

Total Pages: 464

ISBN-13: 9789027233189

DOWNLOAD EBOOK

UBLI has conducted field surveys since 2002 and built spoken language corpora for French, Spanish, Italian (Salentino dialect), Russian, Malaysian, Turkish, Japanese, and Canadian multilinguals. This volume features new research presented at the UBLI second workshop on Corpus Linguistics – Research Domain, which was held on September 14, 2006. The first part consisting of eleven presentations to this workshop shows a wide range of subjects within the area of corpus-based research, such as dictionary, linguistic atlas, dialect, translation, ancient texts, non-standard texts, sociolinguistics, second language acquisition, and natural language processing. The second part of this volume comprises ten additional contributions to both written and spoken corpora by the members and research assistants of UBLI.


Computational and Corpus Approaches to Chinese Language Learning

Computational and Corpus Approaches to Chinese Language Learning

Author: Xiaofei Lu

Publisher: Springer

Published: 2019-02-06

Total Pages: 268

ISBN-13: 9811335702

DOWNLOAD EBOOK

This book presents a collection of original research articles that showcase the state of the art of research in corpus and computational linguistic approaches to Chinese language teaching, learning and assessment. It offers a comprehensive set of corpus resources and natural language processing tools that are useful for teaching, learning and assessing Chinese as a second or foreign language; methods for implementing such resources and techniques in Chinese pedagogy and assessment; as well as research findings on the effectiveness of using such resources and techniques in various aspects of Chinese pedagogy and assessment.


Corpus-based Research in Applied Linguistics

Corpus-based Research in Applied Linguistics

Author: Viviana Cortes

Publisher: John Benjamins Publishing Company

Published: 2015-01-14

Total Pages: 240

ISBN-13: 902726905X

DOWNLOAD EBOOK

This volume comprises nine contributions that were written by up-and-coming corpus-based researchers with varied areas of expertise, who were all disciples of Douglas Biber sometime in the past two decades. These papers cover a wide variety of linguistic analyses and describe the principles of the Flagstaff school: a careful procedure for language corpora collection with special consideration for corpus size, representativeness, sampling and systematic analysis; the use of computer programming abilities that allow the posing of corpus-based research questions never asked before; and a strong emphasis on the combination of quantitative methods based on sound and innovative statistical procedures complemented with comprehensive qualitative functional analyses of the language. This volume has been edited in honor of Douglas Biber, a pioneer of the American school of corpus-based research.


Computational Methods for Corpus Annotation and Analysis

Computational Methods for Corpus Annotation and Analysis

Author: Xiaofei Lu

Publisher: Springer

Published: 2014-07-08

Total Pages: 192

ISBN-13: 9401786453

DOWNLOAD EBOOK

In the past few decades the use of increasingly large text corpora has grown rapidly in language and linguistics research. This was enabled by remarkable strides in natural language processing (NLP) technology, technology that enables computers to automatically and efficiently process, annotate and analyze large amounts of spoken and written text in linguistically and/or pragmatically meaningful ways. It has become more desirable than ever before for language and linguistics researchers who use corpora in their research to gain an adequate understanding of the relevant NLP technology to take full advantage of its capabilities. This volume provides language and linguistics researchers with an accessible introduction to the state-of-the-art NLP technology that facilitates automatic annotation and analysis of large text corpora at both shallow and deep linguistic levels. The book covers a wide range of computational tools for lexical, syntactic, semantic, pragmatic and discourse analysis, together with detailed instructions on how to obtain, install and use each tool in different operating systems and platforms. The book illustrates how NLP technology has been applied in recent corpus-based language studies and suggests effective ways to better integrate such technology in future corpus linguistics research. This book provides language and linguistics researchers with a valuable reference for corpus annotation and analysis.


Advances in Corpus-based Contrastive Linguistics

Advances in Corpus-based Contrastive Linguistics

Author: Karin Aijmer

Publisher: John Benjamins Publishing

Published: 2013-03-13

Total Pages: 307

ISBN-13: 9027272328

DOWNLOAD EBOOK

Contrastive studies have experienced a dramatic revival in the last decades. By combining the methodological advantages of computer corpus linguistics and the possibility of contrasting texts in two or more languages, the structure and use of languages can be explored with greater accuracy, detail and empirical strength than before. The approach has also proved to have fruitful practical applications in a number of areas such as language teaching, lexicography, translation studies and computer-aided translation. This volume contains twelve studies comparing linguistic phenomena in English and seven other languages. The topics range from comparisons of specific lexical categories and word combinations to syntactic constructions and discourse phenomena such as cohesion and thematic structure. The studies highlight similarities and differences in the use, semantics and functions of the compared items, as well as the emergence of new meanings and language change. The emphasis varies from purely linguistic studies to those focusing on practical applications.


Corpus linguistics

Corpus linguistics

Author: Stefanowitsch, Anatol

Publisher: Language Science Press

Published: 2020

Total Pages: 510

ISBN-13: 3961102244

DOWNLOAD EBOOK

Corpora are used widely in linguistics, but not always wisely. This book attempts to frame corpus linguistics systematically as a variant of the observational method. The first part introduces the reader to the general methodological discussions surrounding corpus data as well as the practice of doing corpus linguistics, including issues such as the scientific research cycle, research design, extraction of corpus data and statistical evaluation. The second part consists of a number of case studies from the main areas of corpus linguistics (lexical associations, morphology, grammar, text and metaphor), surveying the range of issues studied in corpus linguistics while at the same time showing how they fit into the methodology outlined in the first part.


Methods in Latin Computational Linguistics

Methods in Latin Computational Linguistics

Author: Barbara McGillivray

Publisher: BRILL

Published: 2013-11-29

Total Pages: 246

ISBN-13: 9004260129

DOWNLOAD EBOOK

In Methods in Latin Computational Linguistics, Barbara McGillivray presents some of the most significant methodological foundations of the emerging field of Latin Computational Linguistics. The reader will find an overview of the computational resources and tools available for Latin and three corpus case studies covering morpho-syntactic and lexical-semantic aspects of Latin verb valency, as well as quantitative diachronic explorations of the argument realization of Latin prefixed verbs. The computational models and the multivariate data analysis techniques employed are explained with a detailed but accessible language. Barbara McGillivray convincingly shows the challenges and opportunities of combining computational methods and historical language data, and contributes to driving the technological change that is affecting Historical Linguistics and the Humanities.


Corpus Linguistics and Statistics with R

Corpus Linguistics and Statistics with R

Author: Guillaume Desagulier

Publisher: Springer

Published: 2017-11-17

Total Pages: 359

ISBN-13: 3319645722

DOWNLOAD EBOOK

This textbook examines empirical linguistics from a theoretical linguist’s perspective. It provides both a theoretical discussion of what quantitative corpus linguistics entails and detailed, hands-on, step-by-step instructions to implement the techniques in the field. The statistical methodology and R-based coding from this book teach readers the basic and then more advanced skills to work with large data sets in their linguistics research and studies. Massive data sets are now more than ever the basis for work that ranges from usage-based linguistics to the far reaches of applied linguistics. This book presents much of the methodology in a corpus-based approach. However, the corpus-based methods in this book are also essential components of recent developments in sociolinguistics, historical linguistics, computational linguistics, and psycholinguistics. Material from the book will also be appealing to researchers in digital humanities and the many non-linguistic fields that use textual data analysis and text-based sensorimetrics. Chapters cover topics including corpus processing, frequencing data, and clustering methods. Case studies illustrate each chapter with accompanying data sets, R code, and exercises for use by readers. This book may be used in advanced undergraduate courses, graduate courses, and self-study.