Corpus-based Perspectives in Linguistics

Corpus-based Perspectives in Linguistics

Author: Yuji Kawaguchi

Publisher: John Benjamins Publishing

Published: 2007

Total Pages: 464

ISBN-13: 9789027233189

DOWNLOAD EBOOK

UBLI has conducted field surveys since 2002 and built spoken language corpora for French, Spanish, Italian (Salentino dialect), Russian, Malaysian, Turkish, Japanese, and Canadian multilinguals. This volume features new research presented at the UBLI second workshop on Corpus Linguistics – Research Domain, which was held on September 14, 2006. The first part consisting of eleven presentations to this workshop shows a wide range of subjects within the area of corpus-based research, such as dictionary, linguistic atlas, dialect, translation, ancient texts, non-standard texts, sociolinguistics, second language acquisition, and natural language processing. The second part of this volume comprises ten additional contributions to both written and spoken corpora by the members and research assistants of UBLI.


Corpus-based and Computational Approaches to Discourse Anaphora

Corpus-based and Computational Approaches to Discourse Anaphora

Author: Simon Botley

Publisher: John Benjamins Publishing

Published: 2000

Total Pages: 270

ISBN-13: 9781556193972

DOWNLOAD EBOOK

Discourse anaphora is a challenging linguistic phenomenon that has given rise to research in fields as diverse as linguistics, computational linguistics and cognitive science. Because of the diversity of approaches these fields bring to the anaphora problem, the editors of this volume argue that there needs to be a synthesis, or at least a principled attempt to draw the differing strands of anaphora research together. The selected papers in this volume all contribute to the aim of synthesis and were selected to represent the growing importance of corpus-based and computational approaches to anaphora description, and to developing natural language systems for resolving anaphora in natural language.


Corpus-based Research in Applied Linguistics

Corpus-based Research in Applied Linguistics

Author: Viviana Cortes

Publisher: John Benjamins Publishing Company

Published: 2015-01-14

Total Pages: 240

ISBN-13: 902726905X

DOWNLOAD EBOOK

This volume comprises nine contributions that were written by up-and-coming corpus-based researchers with varied areas of expertise, who were all disciples of Douglas Biber sometime in the past two decades. These papers cover a wide variety of linguistic analyses and describe the principles of the Flagstaff school: a careful procedure for language corpora collection with special consideration for corpus size, representativeness, sampling and systematic analysis; the use of computer programming abilities that allow the posing of corpus-based research questions never asked before; and a strong emphasis on the combination of quantitative methods based on sound and innovative statistical procedures complemented with comprehensive qualitative functional analyses of the language. This volume has been edited in honor of Douglas Biber, a pioneer of the American school of corpus-based research.


Natural Language Processing Using Very Large Corpora

Natural Language Processing Using Very Large Corpora

Author: S. Armstrong

Publisher: Springer Science & Business Media

Published: 2013-04-17

Total Pages: 314

ISBN-13: 9401723907

DOWNLOAD EBOOK

ABOUT THIS BOOK This book is intended for researchers who want to keep abreast of cur rent developments in corpus-based natural language processing. It is not meant as an introduction to this field; for readers who need one, several entry-level texts are available, including those of (Church and Mercer, 1993; Charniak, 1993; Jelinek, 1997). This book captures the essence of a series of highly successful work shops held in the last few years. The response in 1993 to the initial Workshop on Very Large Corpora (Columbus, Ohio) was so enthusias tic that we were encouraged to make it an annual event. The following year, we staged the Second Workshop on Very Large Corpora in Ky oto. As a way of managing these annual workshops, we then decided to register a special interest group called SIGDAT with the Association for Computational Linguistics. The demand for international forums on corpus-based NLP has been expanding so rapidly that in 1995 SIGDAT was led to organize not only the Third Workshop on Very Large Corpora (Cambridge, Mass. ) but also a complementary workshop entitled From Texts to Tags (Dublin). Obviously, the success of these workshops was in some measure a re flection of the growing popularity of corpus-based methods in the NLP community. But first and foremost, it was due to the fact that the work shops attracted so many high-quality papers.


Corpus Linguistics at Work

Corpus Linguistics at Work

Author: Elena Tognini-Bonelli

Publisher: John Benjamins Publishing

Published: 2001-04-11

Total Pages: 238

ISBN-13: 9027285446

DOWNLOAD EBOOK

The book offers a combined discussion of the main theoretical, methodological and application issues related to corpus work. Thus, starting from the definition of what is a corpus and why reading a corpus calls for a different methodology from reading a text, the underlying assumptions behind corpus work are discussed. The two main approaches to corpus work are discussed as the “corpus-based” and the “corpus-driven” approach and the theoretical positions underlying them explored in detail. The book adopts and exemplifies the parameters of the corpus-driven approach and posits a new unit of linguistic description defined systematically in the light of corpus evidence. The applications where the corpus-driven approach is exemplified are language teaching and contrastive linguistics. Alternating between practical examples and theoretical evaluation, the reader is led step-by-step to a detailed understanding of the issues involved in corpus work and, at the same time, tempted to explore for himself some of the major applications where a corpus-driven methodology can reveal unprecedented insights into linguistic patterning.


Corpus Linguistics and Statistics with R

Corpus Linguistics and Statistics with R

Author: Guillaume Desagulier

Publisher: Springer

Published: 2017-11-17

Total Pages: 359

ISBN-13: 3319645722

DOWNLOAD EBOOK

This textbook examines empirical linguistics from a theoretical linguist’s perspective. It provides both a theoretical discussion of what quantitative corpus linguistics entails and detailed, hands-on, step-by-step instructions to implement the techniques in the field. The statistical methodology and R-based coding from this book teach readers the basic and then more advanced skills to work with large data sets in their linguistics research and studies. Massive data sets are now more than ever the basis for work that ranges from usage-based linguistics to the far reaches of applied linguistics. This book presents much of the methodology in a corpus-based approach. However, the corpus-based methods in this book are also essential components of recent developments in sociolinguistics, historical linguistics, computational linguistics, and psycholinguistics. Material from the book will also be appealing to researchers in digital humanities and the many non-linguistic fields that use textual data analysis and text-based sensorimetrics. Chapters cover topics including corpus processing, frequencing data, and clustering methods. Case studies illustrate each chapter with accompanying data sets, R code, and exercises for use by readers. This book may be used in advanced undergraduate courses, graduate courses, and self-study.


Advances in Corpus-Based Contrastive Linguistics

Advances in Corpus-Based Contrastive Linguistics

Author: Karin Aijmer

Publisher: John Benjamins Publishing

Published: 2013

Total Pages: 306

ISBN-13: 9027203598

DOWNLOAD EBOOK

Contrastive studies have experienced a dramatic revival in the last decades. By combining the methodological advantages of computer corpus linguistics and the possibility of contrasting texts in two or more languages, the structure and use of languages can be explored with greater accuracy, detail and empirical strength than before. The approach has also proved to have fruitful practical applications in a number of areas such as language teaching, lexicography, translation studies and computer-aided translation. This volume contains twelve studies comparing linguistic phenomena in English and seven other languages. The topics range from comparisons of specific lexical categories and word combinations to syntactic constructions and discourse phenomena such as cohesion and thematic structure. The studies highlight similarities and differences in the use, semantics and functions of the compared items, as well as the emergence of new meanings and language change. The emphasis varies from purely linguistic studies to those focusing on practical applications.


Corpus Linguistics: An Introduction

Corpus Linguistics: An Introduction

Author: Dash, Niladri Sekhar

Publisher: Pearson Education India

Published: 2008

Total Pages: 208

ISBN-13: 8131752623

DOWNLOAD EBOOK

Corpus Linguistics: An Introduction will appeal to a wide spectrum of scholars, researchers, and particularly to students of linguistics. It offers guidelines for the creation and usage of corpora in the form of empirical language databases with direct functional and theoretical interpretation of a natural language. Drawn from original research and written in an accessible language and style, this book will create avenues for further advancements in mainstream and applied linguistics and language technology.