Web As Corpus

Web As Corpus

Author: Maristella Gatto

Publisher: A&C Black

Published: 2014-02-13

Total Pages: 250

ISBN-13: 1472571533

DOWNLOAD EBOOK

Is the internet a suitable linguistic corpus? How can we use it in corpus techniques? What are the special properties that we need to be aware of? This book answers those questions. The Web is an exponentially increasing source of language and corpus linguistics data. From gigantic static information resources to user-generated Web 2.0 content, the breadth and depth of information available is breathtaking – and bewildering. This book explores the theory and practice of the “web as corpus”. It looks at the most common tools and methods used and features a plethora of examples based on the author's own teaching experience. This book also bridges the gap between studies in computational linguistics, which emphasize technical aspects, and studies in corpus linguistics, which focus on the implications for language theory and use.


Linguistic Informatics – State of the Art and the Future

Linguistic Informatics – State of the Art and the Future

Author: Yuji Kawaguchi

Publisher: John Benjamins Publishing

Published: 2005-04-14

Total Pages: 373

ISBN-13: 9027294429

DOWNLOAD EBOOK

It is widely believed that linguistic theories and information technology have considerably influenced foreign language education. However, the collaboration of these three domains has not brought about new scientific results. It it thus, our attempt to realize an integration of theoretical and applied linguistics on the basis of computer sciences, and establish a new synthetic field called "Linguistic Informatics." The present volume constitutes the Proceedings of the First International Conference on Linguistic Informatics held at Tokyo University of Foreign Studies (TUFS) in December 2003. The volume is comprised of five chapters. 1. Computer-Assisted Linguistics: Potential for collaboration between linguistics and informatics. 2. Corpus Linguistics : Status report on corpus-based linguistic research. 3. Applied Linguistics : Relationship between second language acquisition and linguistic theory. 4. Discourse Analysis and Language Teaching : Current status of natural dialogue-based discourse analysis. 5. TUFS Language Modules : Development of multilingual e-learning materials covering 17 different languages.


Corpus-based Perspectives in Linguistics

Corpus-based Perspectives in Linguistics

Author: Yuji Kawaguchi

Publisher: John Benjamins Publishing

Published: 2007

Total Pages: 464

ISBN-13: 9789027233189

DOWNLOAD EBOOK

UBLI has conducted field surveys since 2002 and built spoken language corpora for French, Spanish, Italian (Salentino dialect), Russian, Malaysian, Turkish, Japanese, and Canadian multilinguals. This volume features new research presented at the UBLI second workshop on Corpus Linguistics – Research Domain, which was held on September 14, 2006. The first part consisting of eleven presentations to this workshop shows a wide range of subjects within the area of corpus-based research, such as dictionary, linguistic atlas, dialect, translation, ancient texts, non-standard texts, sociolinguistics, second language acquisition, and natural language processing. The second part of this volume comprises ten additional contributions to both written and spoken corpora by the members and research assistants of UBLI.


Spoken Language Understanding

Spoken Language Understanding

Author: Gokhan Tur

Publisher: John Wiley & Sons

Published: 2011-05-03

Total Pages: 443

ISBN-13: 1119993946

DOWNLOAD EBOOK

Spoken language understanding (SLU) is an emerging field in between speech and language processing, investigating human/ machine and human/ human communication by leveraging technologies from signal processing, pattern recognition, machine learning and artificial intelligence. SLU systems are designed to extract the meaning from speech utterances and its applications are vast, from voice search in mobile devices to meeting summarization, attracting interest from both commercial and academic sectors. Both human/machine and human/human communications can benefit from the application of SLU, using differing tasks and approaches to better understand and utilize such communications. This book covers the state-of-the-art approaches for the most popular SLU tasks with chapters written by well-known researchers in the respective fields. Key features include: Presents a fully integrated view of the two distinct disciplines of speech processing and language processing for SLU tasks. Defines what is possible today for SLU as an enabling technology for enterprise (e.g., customer care centers or company meetings), and consumer (e.g., entertainment, mobile, car, robot, or smart environments) applications and outlines the key research areas. Provides a unique source of distilled information on methods for computer modeling of semantic information in human/machine and human/human conversations. This book can be successfully used for graduate courses in electronics engineering, computer science or computational linguistics. Moreover, technologists interested in processing spoken communications will find it a useful source of collated information of the topic drawn from the two distinct disciplines of speech processing and language processing under the new area of SLU.


Corpus Linguistics

Corpus Linguistics

Author: Geoffrey Sampson

Publisher: A&C Black

Published: 2005-10-01

Total Pages: 541

ISBN-13: 1441139370

DOWNLOAD EBOOK

Corpus Linguistics seeks to provide a comprehensive sampling of real-life usage in a given language, and to use these empirical data to test language hypotheses. Modern corpus linguistics began fifty years ago, but the subject has seen explosive growth since the early 1990s. These days corpora are being used to advance virtually every aspect of language study, from computer processing techniques such as machine translation, to literary stylistics, social aspects of language use, and improved language-teaching methods. Because corpus linguistics has grown fast from small beginnings, newcomers to the field often find it hard to get their bearings. Important papers can be difficult to track down. This volume reprints forty-two articles on corpus linguistics by an international selection of authors, which comprehensively illustrate the directions in which the subject is developing. It includes articles that are already recognized as classics, and others which deserve to become so, supplemented with editorial introductions relating the individual contributions to the field as a whole. This collection of readings will be useful to students of corpus linguistics at both undergraduate and postgraduate level, as well as academics researching this fascinating area of linguistics.