QUALICO has been held for the first time as an international conference to demonstrate the state of the art in quantitative linguistics. This domain of language study and research is gaining considerable interest due to recent advances in linguistic modelling, particularly in computational linguistics, cognitive science, and developments in mathematics like modern systems theory. Progress in hardware and software technology, together with ease of access to data and numerical processing, has provided new means of empirical data acquisition and the application of mathematical models of adequate complexity. This volume contains the papers read at QUALICO 91, and provides a representative overview of the state of the art in quantitative linguistic research.
Quantitative Linguistics is a rapidly developing discipline covering more and more areas of linguistic and textological research. The book represents an overview of the state of the art in Quantitative Linguistics, its scope and reach. Some of the topics: linguistic laws, frequency analyses, synergetic models of language, networks, part-of-speech systems, authorship attribution, polyfunctionality and polysemy, and opinion target identification.
Over the past two decades, statistical and other quantitative concepts, models and methods have been increasingly gaining importance and interest in all areas of linguistics and text analysis, as well as in a number of neighboring disciplines and areas of application. The term "quantitative linguistics" comprises all scientific and technical approaches which use such terms and methods in the analysis of or work with language(s), texts and other related subjects. The 71 articles in this handbook, written by internationally-recognized experts, offer a broad, up-to-date overview of the scientific-theoretical principles, the history, the diversity of the subject areas studied, the methods and models used, the results obtained thus far and their applications. The articles are divided up into thirteen chapters: the first chapter includes contributions on the basic principles and the history of the field, nine additional chapters are dedicated to individual descriptions of the levels of linguistic research (from phonology to pragmatics) as well as typological, diachronic and geolinguistic questions. The next two chapters include a description of important models, hypotheses and principles; selected areas of application; and references to neighboring disciplines. The last portion of the handbook is an informative contribution, with information about publication forums, bibliographies, major projects, Internet links, etc. This handbook is useful not only for researchers, teachers and students of all branches of linguistics and the philologies, but also for scientists in neighboring fields, whose theoretical and empirical research touches on linguistic questions (for instance, psychology and sociology), or for those who want to make use of the proven methods or results from quantitative linguistics in their own research.
Quantitative Methods in Linguistics offers a practical introduction to statistics and quantitative analysis with data sets drawn from the field and coverage of phonetics, psycholinguistics, sociolinguistics, historical linguistics, and syntax, as well as probability distribution and quantitative methods. Provides balanced treatment of the practical aspects of handling quantitative linguistic data Includes sample datasets contributed by researchers working in a variety of sub-disciplines of linguistics Uses R, the statistical software package most commonly used by linguists, to discover patterns in quantitative data and to test linguistic hypotheses Includes student-friendly end-of-chapter assignments and is accompanied by online resources at available in the 'Downloads' section, below
Quantitative Linguistics is a rapidly developing discipline covering more and more areas of linguistic and textological research. The book represents an overview of the state of the art in Quantitative Linguistics, its scope and reach. Some of the topics: linguistic laws, frequency analyses, synergetic models of language, networks, part-of-speech systems, authorship attribution, polyfunctionality and polysemy, and opinion target identification.
Designed to serve as a textbook for courses in statistical analysis in linguistics, this book orients the reader to various quantitative methods and explains their implications for the field. The methods include chi-square, Fisher test, binomial test, ANOVA, correlation, regression, and cluster analysis. The advantages and limitations of each method are detailed and each method is illustrated with exemplary articles presenting linguistic data.
This is the first book which brings together the fields of theoretical and empirical studies in syntax on the one hand and the methodology of quantitative linguistics on the other hand. The author provides the theoretical background for this enterprise on the basis of the philosophy of science and of linguistic considerations including a discussion of Chomsky’s attitude against the application of statistical methods to syntactic phenomena. He gives a short introduction into the aims and methods of the quantitative approach to linguistics in general and to syntax in particular. The following chapters inform the reader about the measurement of syntactic properties, possibilities to acquire empirical data from syntactically annotated text corpora and the most common mathematical models and methods for the analysis of syntactic and syntagmatic material. Then, a number of prominent approaches and hypotheses about interrelations between properties of syntactic constructions are presented and evaluated on material from various languages and text kinds. Finally, the theory of synergetic linguistics and its application to syntax is introduced including the integration of such famous hypotheses as Yngve’s depth hypothesis and Hawkins’s "Early immediate constituent" principle. The book concludes with a number of perspectives with respect to follow-up studies and extensions to the presented models with interfaces to neighbouring disciplines.
Presents a comprehensive introduction to analysing quantitative linguistic data. Starting with an definition of quantitative data, and how it differs from qualitative data, Seb Rasinger examines what the student linguist is trying to find out through analysing data, and how quantitative techniques can help arrive at meaningful and accurate conclusions. This expanded, 2nd edition now also includes a discussion of Analysis of Variance (ANOVA) and MANOVA, and provides a brief introduction to statistical meta-analysis. A companion website allows readers to download crib sheets and Excel templates for the main statistical tools. The book introduces: -using statistics -variables -reliability of data -describing data -analysing data -testing hypotheses -dealing with problematic data. Each chapter includes graphs and figures explaining theory through worked examples, chapter summaries, and exercises to aid student understanding. An appendix containing a summary of statistical formulae, excel commands and statistical tables is included and is an invaluable resource. Presenting a down-to-earth and readable introduction to quantitative research, this book is a useful how-to guide for students encountering quantitative data for the first time, or for postgraduates embarking on linguistic research projects.
Quantitative methods in linguistics, which the protean American structuralist linguist Morris Swadesh introduced in the 1950s, have become increasingly popular and have opened the world of languages to interdisciplinary approaches. The papers collected here are the work not only of descriptive and historical linguists, but also statisticians, physicists and computer scientists. They demonstrate the application of quantitative methods to the elucidation of linguistic prehistory on an unprecedented world-wide scale, providing cutting-edge insights into issues of the linguistic correlates of subsistence strategies, rates of birth and extinction of languages, lexical borrowability, the identification of language family homelands, the assessment of genealogical relationships, and the development of new phylogenetic methods appropriate for linguistic data. Originally published in Diachronica 27:2 (2010).
The linguistic community tend to regard statistical methods, or more generally quantitative techniques, with a certain amount of fear and suspicion. There is a feeling that statistics falls in the province of science and mathematics and such methods may destroy the magic of the literary text. This book seeks to make quantitative methods and statistical techniques less forbidding and show how they can contribute to linguistic analysis and research. It present some mathematical and statistical properties of natural languages and introduces some of the quantitative methods which are of the most value in working empirically with texts and corpora. The various issues are illustrated with helpful examples from the most basic descriptive techniques to decision-taking techniques and to more sophisticated multivariate statistical language models.