Corpus Design and Construction in Minoritised Language Contexts - Cynllunio a Chreu Corpws mewn Cyd-destunau Ieithoedd Lleiafrifoledig

Corpus Design and Construction in Minoritised Language Contexts - Cynllunio a Chreu Corpws mewn Cyd-destunau Ieithoedd Lleiafrifoledig

Author: Dawn Knight

Publisher: Springer Nature

Published: 2021-07-05

Total Pages: 178

ISBN-13: 3030724840

DOWNLOAD EBOOK

This bilingual book provides a detailed overview of the project to construct a National Corpus of Contemporary Welsh (CorCenCC), addressing the conceptual and methodological challenges faced when developing language corpora for minoritised languages. A conceptual framework is presented for the user-driven design that underpinned the CorCenCC project, along with a detailed blueprint that can function as a scaffold for other researchers embarking on projects of this nature. This book will be of value to those working in language teaching, learning and assessment, language policy and planning, translation, corpus linguistics and language technology, and to anyone with an interest in Welsh and other minoritised languages. Mae'r llyfr dwyieithog hwn yn rhoi trosolwg manwl o'r prosiect i greu Corpws Cenedlaethol Cymraeg Cyfoes (CorCenCC), ac yn mynd i'r afael â'r heriau cysyniadol a methodolegol a wynebir wrth ddatblygu corpora iaith ar gyfer ieithoedd lleiafrifoledig. Cyflwynir fframwaith cysyniadol ar gyfer y cynllun wedi'i yrru gan ddefnyddwyr sy'n greiddiol i brosiect CorCenCC, ynghyd â glasbrint manwl a all weithredu fel sgaffald i ymchwilwyr eraill sy'n dechrau ar brosiectau o'r fath. Bydd y llyfr hwn o werth i'r rhai sy'n gweithio ym meysydd addysgu, dysgu ac asesu ieithoedd, polisi iaith a chynllunio ieithyddol, cyfieithu, ieithyddiaeth gorpws a thechnoleg iaith, ac unrhyw un â diddordeb yn y Gymraeg ac ieithoedd lleiafrifoledig eraill.


The Routledge Handbook of Corpus Linguistics

The Routledge Handbook of Corpus Linguistics

Author: Anne O'Keeffe

Publisher: Taylor & Francis

Published: 2022-02-08

Total Pages: 755

ISBN-13: 0429634137

DOWNLOAD EBOOK

The Routledge Handbook of Corpus Linguistics 2e provides an updated overview of a dynamic and rapidly growing area with a widely applied methodology. Over a decade on from the first edition of the Handbook, this collection of 47 chapters from experts in key areas offers a comprehensive introduction to both the development and use of corpora as well as their ever-evolving applications to other areas, such as digital humanities, sociolinguistics, stylistics, translation studies, materials design, language teaching and teacher development, media discourse, discourse analysis, forensic linguistics, second language acquisition and testing. The new edition updates all core chapters and includes new chapters on corpus linguistics and statistics, digital humanities, translation, phonetics and phonology, second language acquisition, social media and theoretical perspectives. Chapters provide annotated further reading lists and step-by-step guides as well as detailed overviews across a wide range of themes. The Handbook also includes a wealth of case studies that draw on some of the many new corpora and corpus tools that have emerged in the last decade. Organised across four themes, moving from the basic start-up topics such as corpus building and design to analysis, application and reflection, this second edition remains a crucial point of reference for advanced undergraduates, postgraduates and scholars in applied linguistics.


Developing Linguistic Corpora

Developing Linguistic Corpora

Author: Martin Wynne

Publisher: Oxbow Books Limited

Published: 2005

Total Pages: 100

ISBN-13:

DOWNLOAD EBOOK

A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.


Corpus and Context

Corpus and Context

Author: Svenja Adolphs

Publisher: John Benjamins Publishing

Published: 2008

Total Pages: 176

ISBN-13: 9789027223043

DOWNLOAD EBOOK

Corpus and Context explores the relationship between corpus linguistics and pragmatics by discussing possible frameworks for analysing utterance function on the basis of spoken corpora. The book articulates the challenges and opportunities associated with a change of focus in corpus research, from lexical to functional units, from concordance lines to extended stretches of discourse, and from the purely textual to multi-modal analysis of spoken corpus data. Drawing on a number of spoken corpora including the five million word Cambridge and Nottingham Corpus of Discourse in English (CANCODE, funded by CUP (c)), a specific speech act function is being explored using different approaches and different levels of analysis. This involves a close analysis of contextual variables in relation to lexico-grammatical and discoursal patterns that emerge from the corpus data, as well as a wider discussion of the role of context in spoken corpus research.


Building and Using the Siarad Corpus

Building and Using the Siarad Corpus

Author: Margaret Deuchar

Publisher:

Published: 2018

Total Pages: 0

ISBN-13: 9789027200112

DOWNLOAD EBOOK

Introduction -- Building the corpus. Data collection and profile of the speakers in our corpus -- Transcription of the data -- Code-switching vs. borrowing: New implications arising from our data -- Using the corpus. The grammar of code-switching -- Code-switching and independent variables -- Change in Welsh grammar -- Additional research using Siarad -- Conclusion and future directions


Corpus Pragmatics

Corpus Pragmatics

Author: Karin Aijmer

Publisher: Cambridge University Press

Published: 2015

Total Pages: 481

ISBN-13: 1107015049

DOWNLOAD EBOOK

The first handbook to survey and expand the burgeoning field of corpus pragmatics, the intersection of pragmatics and corpus linguistics.


Comparative Stylistics of Welsh and English

Comparative Stylistics of Welsh and English

Author: Steve Morris

Publisher: University of Wales Press

Published: 2018-07-15

Total Pages: 308

ISBN-13: 1786832577

DOWNLOAD EBOOK

An analysis of Welsh stylistics in a corpus of 20th and 21st century texts. A study of the structure of Welsh compared with English via a translation corpus. A study of methods in translation.


The Language of Business Meetings

The Language of Business Meetings

Author: Michael Handford

Publisher: Cambridge University Press

Published: 2010-08-19

Total Pages: 289

ISBN-13: 052111666X

DOWNLOAD EBOOK

This book presents a corpus-based study of the language used in business meetings.


Multimodality and Active Listenership

Multimodality and Active Listenership

Author: Dawn Knight

Publisher: Bloomsbury Publishing

Published: 2011-10-13

Total Pages: 260

ISBN-13: 1441107479

DOWNLOAD EBOOK

Current corpora are invaluable resources for generating accurate and objective analyses of patterns of language use. However, spoken corpora are effectively mono-modal, presenting data in the same physical medium – text. The reality of a discourse situation is lost in its representation as text. Using multimodal data sets when conducting corpus-based pragmatic analyses is one solution. This book looks at multimodal corpora in some depth, using backchanneling as the conversational feature to be analysed. It provides a bottom-up investigation of the issues and challenges faced at every stage of multimodal corpus construction and analysis, as well as providing an in-depth linguistic analysis of a cross section of multimodal corpus data. The collaborative and co-operative nature of backchannels is highlighted in this book and an adapted pragmatic-functional linguistic coding matrix for the characterisation of backchanneling phenomena is presented. Dawn Knight also looks at possible directions in the construction and use of multimodal corpus linguistics.


Building a National Corpus

Building a National Corpus

Author: Dawn Knight

Publisher: Springer Nature

Published: 2021-10-08

Total Pages: 192

ISBN-13: 3030818586

DOWNLOAD EBOOK

This book aims to provide a micro-level, working model of a methodological approach and practical guidelines for building a corpus, informed by the work on the CorCenCC project (Corpws Cenedlaethol Cymraeg Cyfoes - the National Corpus of Contemporary Welsh). It focuses specifically on the development of detailed design frames for corpora across communicative modes (spoken, written and e-language), and the practical processes involved in the planning, collection, transcription, collation and (re)presentation of language data. The book is designed to be of significant value and relevance to those interested in critically engaging with corpus methodology. Although Welsh is the language under discussion, the processes and approaches discussed in the building of CorCenCC can be applied to a lesser or greater extent to other language contexts. This book provides a working model, and an account of how to build a corpus dataset from which step by step guidelines for creating other linguistic corpora in any language can be easily extrapolated. It will be of value to students and scholars of minority languages and corpus linguistics.