Humanities Data Analysis

Humanities Data Analysis

Author: Folgert Karsdorp

Publisher: Princeton University Press

Published: 2021-01-12

Total Pages: 352

ISBN-13: 0691172366

DOWNLOAD EBOOK

A practical guide to data-intensive humanities research using the Python programming language The use of quantitative methods in the humanities and related social sciences has increased considerably in recent years, allowing researchers to discover patterns in a vast range of source materials. Despite this growth, there are few resources addressed to students and scholars who wish to take advantage of these powerful tools. Humanities Data Analysis offers the first intermediate-level guide to quantitative data analysis for humanities students and scholars using the Python programming language. This practical textbook, which assumes a basic knowledge of Python, teaches readers the necessary skills for conducting humanities research in the rapidly developing digital environment. The book begins with an overview of the place of data science in the humanities, and proceeds to cover data carpentry: the essential techniques for gathering, cleaning, representing, and transforming textual and tabular data. Then, drawing from real-world, publicly available data sets that cover a variety of scholarly domains, the book delves into detailed case studies. Focusing on textual data analysis, the authors explore such diverse topics as network analysis, genre theory, onomastics, literacy, author attribution, mapping, stylometry, topic modeling, and time series analysis. Exercises and resources for further reading are provided at the end of each chapter. An ideal resource for humanities students and scholars aiming to take their Python skills to the next level, Humanities Data Analysis illustrates the benefits that quantitative methods can bring to complex research questions. Appropriate for advanced undergraduates, graduate students, and scholars with a basic knowledge of Python Applicable to many humanities disciplines, including history, literature, and sociology Offers real-world case studies using publicly available data sets Provides exercises at the end of each chapter for students to test acquired skills Emphasizes visual storytelling via data visualizations


Data Analytics in Digital Humanities

Data Analytics in Digital Humanities

Author: Shalin Hai-Jew

Publisher: Springer

Published: 2017-05-03

Total Pages: 304

ISBN-13: 3319544993

DOWNLOAD EBOOK

This book covers computationally innovative methods and technologies including data collection and elicitation, data processing, data analysis, data visualizations, and data presentation. It explores how digital humanists have harnessed the hypersociality and social technologies, benefited from the open-source sharing not only of data but of code, and made technological capabilities a critical part of humanities work. Chapters are written by researchers from around the world, bringing perspectives from diverse fields and subject areas. The respective authors describe their work, their research, and their learning. Topics include semantic web for cultural heritage valorization, machine learning for parody detection by classification, psychological text analysis, crowdsourcing imagery coding in natural disasters, and creating inheritable digital codebooks.Designed for researchers and academics, this book is suitable for those interested in methodologies and analytics that can be applied in literature, history, philosophy, linguistics, and related disciplines. Professionals such as librarians, archivists, and historians will also find the content informative and instructive.


Humanities Data Analysis

Humanities Data Analysis

Author: Folgert Karsdorp

Publisher: Princeton University Press

Published: 2021-01-12

Total Pages: 360

ISBN-13: 0691200335

DOWNLOAD EBOOK

A practical guide to data-intensive humanities research using the Python programming language The use of quantitative methods in the humanities and related social sciences has increased considerably in recent years, allowing researchers to discover patterns in a vast range of source materials. Despite this growth, there are few resources addressed to students and scholars who wish to take advantage of these powerful tools. Humanities Data Analysis offers the first intermediate-level guide to quantitative data analysis for humanities students and scholars using the Python programming language. This practical textbook, which assumes a basic knowledge of Python, teaches readers the necessary skills for conducting humanities research in the rapidly developing digital environment. The book begins with an overview of the place of data science in the humanities, and proceeds to cover data carpentry: the essential techniques for gathering, cleaning, representing, and transforming textual and tabular data. Then, drawing from real-world, publicly available data sets that cover a variety of scholarly domains, the book delves into detailed case studies. Focusing on textual data analysis, the authors explore such diverse topics as network analysis, genre theory, onomastics, literacy, author attribution, mapping, stylometry, topic modeling, and time series analysis. Exercises and resources for further reading are provided at the end of each chapter. An ideal resource for humanities students and scholars aiming to take their Python skills to the next level, Humanities Data Analysis illustrates the benefits that quantitative methods can bring to complex research questions. Appropriate for advanced undergraduates, graduate students, and scholars with a basic knowledge of Python Applicable to many humanities disciplines, including history, literature, and sociology Offers real-world case studies using publicly available data sets Provides exercises at the end of each chapter for students to test acquired skills Emphasizes visual storytelling via data visualizations


Quantitative Methods in the Humanities

Quantitative Methods in the Humanities

Author: Claire Lemercier

Publisher:

Published: 2019

Total Pages: 188

ISBN-13: 9780813942698

DOWNLOAD EBOOK

This timely and lucid guide is intended for students and scholars working on all historical periods and topics in the humanities and social sciences--especially for those who do not think of themselves as experts in quantification, "big data," or "digital humanities." The authors reveal quantification to be a powerful and versatile tool, applicable to a myriad of materials from the past. Their book, accessible to complete beginners, offers detailed advice and practical tips on how to build a dataset from historical sources and how to categorize it according to specific research questions. Drawing on examples from works in social, political, economic, and cultural history, the book guides readers through a wide range of methods, including sampling, cross-tabulations, statistical tests, regression, factor analysis, network analysis, sequence analysis, event history analysis, geographical information systems, text analysis, and visualization. The requirements, advantages, and pitfalls of these techniques are presented in layperson's terms, avoiding mathematical terminology. Conceived primarily for historians, the book will prove invaluable to other humanists, as well as to social scientists looking for a nontechnical introduction to quantitative methods. Covering the most recent techniques, in addition to others not often enough discussed, the book will also have much to offer to the most seasoned practitioners of quantification.


Visualization and Interpretation

Visualization and Interpretation

Author: Johanna Drucker

Publisher: MIT Press

Published: 2020-11-10

Total Pages: 205

ISBN-13: 0262044730

DOWNLOAD EBOOK

An analysis of visual epistemology in the digital humanities, with attention to the need for interpretive digital tools within humanities contexts. In the several decades since humanists have taken up computational tools, they have borrowed many techniques from other fields, including visualization methods to create charts, graphs, diagrams, maps, and other graphic displays of information. But are these visualizations actually adequate for the interpretive approach that distinguishes much of the work in the humanities? Information visualization, as practiced today, lacks the interpretive frameworks required for humanities-oriented methodologies. In this book, Johanna Drucker continues her interrogation of visual epistemology in the digital humanities, reorienting the creation of digital tools within humanities contexts. Drucker examines various theoretical understandings of visual images and their relation to knowledge and how the specifics of the graphical are to be engaged directly as a primary means of knowledge production for digital humanities. She draws on work from aesthetics, critical theory, and formal study of graphical systems, addressing them within the specific framework of computational and digital activity as they apply to digital humanities. Finally, she presents a series of standard problems in visualization for the humanities (including time/temporality, space/spatial relations, and data analysis), posing the investigation in terms of innovative graphical systems informed by probabilistic critical hermeneutics. She concludes with a final brief sketch of discovery tools as an additional interface into which modeling can be worked.


Big Data in Computational Social Science and Humanities

Big Data in Computational Social Science and Humanities

Author: Shu-Heng Chen

Publisher: Springer

Published: 2018-11-21

Total Pages: 391

ISBN-13: 3319954652

DOWNLOAD EBOOK

This edited volume focuses on big data implications for computational social science and humanities from management to usage. The first part of the book covers geographic data, text corpus data, and social media data, and exemplifies their concrete applications in a wide range of fields including anthropology, economics, finance, geography, history, linguistics, political science, psychology, public health, and mass communications. The second part of the book provides a panoramic view of the development of big data in the fields of computational social sciences and humanities. The following questions are addressed: why is there a need for novel data governance for this new type of data?, why is big data important for social scientists?, and how will it revolutionize the way social scientists conduct research? With the advent of the information age and technologies such as Web 2.0, ubiquitous computing, wearable devices, and the Internet of Things, digital society has fundamentally changed what we now know as "data", the very use of this data, and what we now call "knowledge". Big data has become the standard in social sciences, and has made these sciences more computational. Big Data in Computational Social Science and Humanities will appeal to graduate students and researchers working in the many subfields of the social sciences and humanities.


Text Analysis with R

Text Analysis with R

Author: Matthew L. Jockers

Publisher: Springer Nature

Published: 2020-03-30

Total Pages: 283

ISBN-13: 3030396436

DOWNLOAD EBOOK

Now in its second edition, Text Analysis with R provides a practical introduction to computational text analysis using the open source programming language R. R is an extremely popular programming language, used throughout the sciences; due to its accessibility, R is now used increasingly in other research areas. In this volume, readers immediately begin working with text, and each chapter examines a new technique or process, allowing readers to obtain a broad exposure to core R procedures and a fundamental understanding of the possibilities of computational text analysis at both the micro and the macro scale. Each chapter builds on its predecessor as readers move from small scale “microanalysis” of single texts to large scale “macroanalysis” of text corpora, and each concludes with a set of practice exercises that reinforce and expand upon the chapter lessons. The book’s focus is on making the technical palatable and making the technical useful and immediately gratifying. Text Analysis with R is written with students and scholars of literature in mind but will be applicable to other humanists and social scientists wishing to extend their methodological toolkit to include quantitative and computational approaches to the study of text. Computation provides access to information in text that readers simply cannot gather using traditional qualitative methods of close reading and human synthesis. This new edition features two new chapters: one that introduces dplyr and tidyr in the context of parsing and analyzing dramatic texts to extract speaker and receiver data, and one on sentiment analysis using the syuzhet package. It is also filled with updated material in every chapter to integrate new developments in the field, current practices in R style, and the use of more efficient algorithms.


The Shape of Data in Digital Humanities

The Shape of Data in Digital Humanities

Author: Julia Flanders

Publisher: Routledge

Published: 2018-11-02

Total Pages: 373

ISBN-13: 1317016149

DOWNLOAD EBOOK

Data and its technologies now play a large and growing role in humanities research and teaching. This book addresses the needs of humanities scholars who seek deeper expertise in the area of data modeling and representation. The authors, all experts in digital humanities, offer a clear explanation of key technical principles, a grounded discussion of case studies, and an exploration of important theoretical concerns. The book opens with an orientation, giving the reader a history of data modeling in the humanities and a grounding in the technical concepts necessary to understand and engage with the second part of the book. The second part of the book is a wide-ranging exploration of topics central for a deeper understanding of data modeling in digital humanities. Chapters cover data modeling standards and the role they play in shaping digital humanities practice, traditional forms of modeling in the humanities and how they have been transformed by digital approaches, ontologies which seek to anchor meaning in digital humanities resources, and how data models inhabit the other analytical tools used in digital humanities research. It concludes with a glossary chapter that explains specific terms and concepts for data modeling in the digital humanities context. This book is a unique and invaluable resource for teaching and practising data modeling in a digital humanities context.


Digital Humanities and Film Studies

Digital Humanities and Film Studies

Author: Adelheid Heftberger

Publisher: Springer

Published: 2019-01-02

Total Pages: 316

ISBN-13: 303002864X

DOWNLOAD EBOOK

This book highlights the quantitative methods of data mining and information visualization and explores their use in relation to the films and writings of the Russian director, Dziga Vertov. The theoretical basis of the work harkens back to the time when a group of Russian artists and scholars, known as the “formalists,” developed new concepts of how art could be studied and measured. This book brings those ideas to the digital age. One of the central questions the book intends to address is, “How can hypothetical notions in film studies be supported or falsified using empirical data and statistical tools?” The first stage involves manual and computer-assisted annotation of the films, leading to the production of empirical data which is then used for statistical analysis but more importantly for the development of visualizations. Studies of this type furthermore shed light on the field of visual presentation of time-based processes; an area which has its origin in the Russian formalist sphere of the 1920s and which has recently gained new relevance due to technological advances and new possibilities for computer-assisted analysis of large and complex data sets. In order to reach a profound understanding of Vertov and his films, the manual or computer-assisted data analysis must be combined with film-historical knowledge and a study of primary sources. In addition, the status of the surviving film materials and the precise analysis of these materials combined with knowledge of historical film technology provide insight into archival policy and political culture in the Soviet Union in the 1920s and 30s.