Humanities Data Analysis

Humanities Data Analysis

Author: Folgert Karsdorp

Publisher: Princeton University Press

Published: 2021-01-12

Total Pages: 352

ISBN-13: 0691172366

DOWNLOAD EBOOK

A practical guide to data-intensive humanities research using the Python programming language The use of quantitative methods in the humanities and related social sciences has increased considerably in recent years, allowing researchers to discover patterns in a vast range of source materials. Despite this growth, there are few resources addressed to students and scholars who wish to take advantage of these powerful tools. Humanities Data Analysis offers the first intermediate-level guide to quantitative data analysis for humanities students and scholars using the Python programming language. This practical textbook, which assumes a basic knowledge of Python, teaches readers the necessary skills for conducting humanities research in the rapidly developing digital environment. The book begins with an overview of the place of data science in the humanities, and proceeds to cover data carpentry: the essential techniques for gathering, cleaning, representing, and transforming textual and tabular data. Then, drawing from real-world, publicly available data sets that cover a variety of scholarly domains, the book delves into detailed case studies. Focusing on textual data analysis, the authors explore such diverse topics as network analysis, genre theory, onomastics, literacy, author attribution, mapping, stylometry, topic modeling, and time series analysis. Exercises and resources for further reading are provided at the end of each chapter. An ideal resource for humanities students and scholars aiming to take their Python skills to the next level, Humanities Data Analysis illustrates the benefits that quantitative methods can bring to complex research questions. Appropriate for advanced undergraduates, graduate students, and scholars with a basic knowledge of Python Applicable to many humanities disciplines, including history, literature, and sociology Offers real-world case studies using publicly available data sets Provides exercises at the end of each chapter for students to test acquired skills Emphasizes visual storytelling via data visualizations


Humanities Data Analysis

Humanities Data Analysis

Author: Folgert Karsdorp

Publisher: Princeton University Press

Published: 2021-01-12

Total Pages: 360

ISBN-13: 0691200335

DOWNLOAD EBOOK

A practical guide to data-intensive humanities research using the Python programming language The use of quantitative methods in the humanities and related social sciences has increased considerably in recent years, allowing researchers to discover patterns in a vast range of source materials. Despite this growth, there are few resources addressed to students and scholars who wish to take advantage of these powerful tools. Humanities Data Analysis offers the first intermediate-level guide to quantitative data analysis for humanities students and scholars using the Python programming language. This practical textbook, which assumes a basic knowledge of Python, teaches readers the necessary skills for conducting humanities research in the rapidly developing digital environment. The book begins with an overview of the place of data science in the humanities, and proceeds to cover data carpentry: the essential techniques for gathering, cleaning, representing, and transforming textual and tabular data. Then, drawing from real-world, publicly available data sets that cover a variety of scholarly domains, the book delves into detailed case studies. Focusing on textual data analysis, the authors explore such diverse topics as network analysis, genre theory, onomastics, literacy, author attribution, mapping, stylometry, topic modeling, and time series analysis. Exercises and resources for further reading are provided at the end of each chapter. An ideal resource for humanities students and scholars aiming to take their Python skills to the next level, Humanities Data Analysis illustrates the benefits that quantitative methods can bring to complex research questions. Appropriate for advanced undergraduates, graduate students, and scholars with a basic knowledge of Python Applicable to many humanities disciplines, including history, literature, and sociology Offers real-world case studies using publicly available data sets Provides exercises at the end of each chapter for students to test acquired skills Emphasizes visual storytelling via data visualizations


Data Analytics in Digital Humanities

Data Analytics in Digital Humanities

Author: Shalin Hai-Jew

Publisher: Springer

Published: 2017-05-03

Total Pages: 304

ISBN-13: 3319544993

DOWNLOAD EBOOK

This book covers computationally innovative methods and technologies including data collection and elicitation, data processing, data analysis, data visualizations, and data presentation. It explores how digital humanists have harnessed the hypersociality and social technologies, benefited from the open-source sharing not only of data but of code, and made technological capabilities a critical part of humanities work. Chapters are written by researchers from around the world, bringing perspectives from diverse fields and subject areas. The respective authors describe their work, their research, and their learning. Topics include semantic web for cultural heritage valorization, machine learning for parody detection by classification, psychological text analysis, crowdsourcing imagery coding in natural disasters, and creating inheritable digital codebooks.Designed for researchers and academics, this book is suitable for those interested in methodologies and analytics that can be applied in literature, history, philosophy, linguistics, and related disciplines. Professionals such as librarians, archivists, and historians will also find the content informative and instructive.


Quantitative Methods in the Humanities

Quantitative Methods in the Humanities

Author: Claire Lemercier

Publisher:

Published: 2019

Total Pages: 188

ISBN-13: 9780813942698

DOWNLOAD EBOOK

This timely and lucid guide is intended for students and scholars working on all historical periods and topics in the humanities and social sciences--especially for those who do not think of themselves as experts in quantification, "big data," or "digital humanities." The authors reveal quantification to be a powerful and versatile tool, applicable to a myriad of materials from the past. Their book, accessible to complete beginners, offers detailed advice and practical tips on how to build a dataset from historical sources and how to categorize it according to specific research questions. Drawing on examples from works in social, political, economic, and cultural history, the book guides readers through a wide range of methods, including sampling, cross-tabulations, statistical tests, regression, factor analysis, network analysis, sequence analysis, event history analysis, geographical information systems, text analysis, and visualization. The requirements, advantages, and pitfalls of these techniques are presented in layperson's terms, avoiding mathematical terminology. Conceived primarily for historians, the book will prove invaluable to other humanists, as well as to social scientists looking for a nontechnical introduction to quantitative methods. Covering the most recent techniques, in addition to others not often enough discussed, the book will also have much to offer to the most seasoned practitioners of quantification.


Computational History and Data-Driven Humanities

Computational History and Data-Driven Humanities

Author: Bojan Bozic

Publisher: Springer

Published: 2016-11-07

Total Pages: 133

ISBN-13: 3319462245

DOWNLOAD EBOOK

This book constitutes the refereed post-proceedings of the Second IFIP WG 12.7 International Workshop on Computational History and Data-Driven Humanities, held in Dublin, Ireland, in May 2016. The 7 full papers presented together with 2 invited talks and 4 lightning talks were carefully reviewed and selected from 14 submissions. The papers focus on the challenge and opportunities of data-driven humanities and cover topics at the interface between computer science, social science, humanities, and mathematics.


Humanities Data in R

Humanities Data in R

Author: Taylor Arnold

Publisher: Springer

Published: 2015-09-23

Total Pages: 218

ISBN-13: 3319207024

DOWNLOAD EBOOK

​This pioneering book teaches readers to use R within four core analytical areas applicable to the Humanities: networks, text, geospatial data, and images. This book is also designed to be a bridge: between quantitative and qualitative methods, individual and collaborative work, and the humanities and social sciences. Humanities Data with R does not presuppose background programming experience. Early chapters take readers from R set-up to exploratory data analysis (continuous and categorical data, multivariate analysis, and advanced graphics with emphasis on aesthetics and facility). Following this, networks, geospatial data, image data, natural language processing and text analysis each have a dedicated chapter. Each chapter is grounded in examples to move readers beyond the intimidation of adding new tools to their research. Everything is hands-on: networks are explained using U.S. Supreme Court opinions, and low-level NLP methods are applied to short stories by Sir Arthur Conan Doyle. After working through these examples with the provided data, code and book website, readers are prepared to apply new methods to their own work. The open source R programming language, with its myriad packages and popularity within the sciences and social sciences, is particularly well-suited to working with humanities data. R packages are also highlighted in an appendix. This book uses an expanded conception of the forms data may take and the information it represents. The methodology will have wide application in classrooms and self-study for the humanities, but also for use in linguistics, anthropology, and political science. Outside the classroom, this intersection of humanities and computing is particularly relevant for research and new modes of dissemination across archives, museums and libraries. ​


The Shape of Data in Digital Humanities

The Shape of Data in Digital Humanities

Author: Julia Flanders

Publisher: Routledge

Published: 2018-11-02

Total Pages: 382

ISBN-13: 1317016149

DOWNLOAD EBOOK

Data and its technologies now play a large and growing role in humanities research and teaching. This book addresses the needs of humanities scholars who seek deeper expertise in the area of data modeling and representation. The authors, all experts in digital humanities, offer a clear explanation of key technical principles, a grounded discussion of case studies, and an exploration of important theoretical concerns. The book opens with an orientation, giving the reader a history of data modeling in the humanities and a grounding in the technical concepts necessary to understand and engage with the second part of the book. The second part of the book is a wide-ranging exploration of topics central for a deeper understanding of data modeling in digital humanities. Chapters cover data modeling standards and the role they play in shaping digital humanities practice, traditional forms of modeling in the humanities and how they have been transformed by digital approaches, ontologies which seek to anchor meaning in digital humanities resources, and how data models inhabit the other analytical tools used in digital humanities research. It concludes with a glossary chapter that explains specific terms and concepts for data modeling in the digital humanities context. This book is a unique and invaluable resource for teaching and practising data modeling in a digital humanities context.


Guide to Intelligent Data Analysis

Guide to Intelligent Data Analysis

Author: Michael R. Berthold

Publisher: Springer Science & Business Media

Published: 2010-06-23

Total Pages: 399

ISBN-13: 184882260X

DOWNLOAD EBOOK

Each passing year bears witness to the development of ever more powerful computers, increasingly fast and cheap storage media, and even higher bandwidth data connections. This makes it easy to believe that we can now – at least in principle – solve any problem we are faced with so long as we only have enough data. Yet this is not the case. Although large databases allow us to retrieve many different single pieces of information and to compute simple aggregations, general patterns and regularities often go undetected. Furthermore, it is exactly these patterns, regularities and trends that are often most valuable. To avoid the danger of “drowning in information, but starving for knowledge” the branch of research known as data analysis has emerged, and a considerable number of methods and software tools have been developed. However, it is not these tools alone but the intelligent application of human intuition in combination with computational power, of sound background knowledge with computer-aided modeling, and of critical reflection with convenient automatic model construction, that results in successful intelligent data analysis projects. Guide to Intelligent Data Analysis provides a hands-on instructional approach to many basic data analysis techniques, and explains how these are used to solve data analysis problems. Topics and features: guides the reader through the process of data analysis, following the interdependent steps of project understanding, data understanding, data preparation, modeling, and deployment and monitoring; equips the reader with the necessary information in order to obtain hands-on experience of the topics under discussion; provides a review of the basics of classical statistics that support and justify many data analysis methods, and a glossary of statistical terms; includes numerous examples using R and KNIME, together with appendices introducing the open source software; integrates illustrations and case-study-style examples to support pedagogical exposition. This practical and systematic textbook/reference for graduate and advanced undergraduate students is also essential reading for all professionals who face data analysis problems. Moreover, it is a book to be used following one’s exploration of it. Dr. Michael R. Berthold is Nycomed-Professor of Bioinformatics and Information Mining at the University of Konstanz, Germany. Dr. Christian Borgelt is Principal Researcher at the Intelligent Data Analysis and Graphical Models Research Unit of the European Centre for Soft Computing, Spain. Dr. Frank Höppner is Professor of Information Systems at Ostfalia University of Applied Sciences, Germany. Dr. Frank Klawonn is a Professor in the Department of Computer Science and Head of the Data Analysis and Pattern Recognition Laboratory at Ostfalia University of Applied Sciences, Germany. He is also Head of the Bioinformatics and Statistics group at the Helmholtz Centre for Infection Research, Braunschweig, Germany.


Visualization and Interpretation

Visualization and Interpretation

Author: Johanna Drucker

Publisher: MIT Press

Published: 2020-11-10

Total Pages: 205

ISBN-13: 0262044730

DOWNLOAD EBOOK

An analysis of visual epistemology in the digital humanities, with attention to the need for interpretive digital tools within humanities contexts. In the several decades since humanists have taken up computational tools, they have borrowed many techniques from other fields, including visualization methods to create charts, graphs, diagrams, maps, and other graphic displays of information. But are these visualizations actually adequate for the interpretive approach that distinguishes much of the work in the humanities? Information visualization, as practiced today, lacks the interpretive frameworks required for humanities-oriented methodologies. In this book, Johanna Drucker continues her interrogation of visual epistemology in the digital humanities, reorienting the creation of digital tools within humanities contexts. Drucker examines various theoretical understandings of visual images and their relation to knowledge and how the specifics of the graphical are to be engaged directly as a primary means of knowledge production for digital humanities. She draws on work from aesthetics, critical theory, and formal study of graphical systems, addressing them within the specific framework of computational and digital activity as they apply to digital humanities. Finally, she presents a series of standard problems in visualization for the humanities (including time/temporality, space/spatial relations, and data analysis), posing the investigation in terms of innovative graphical systems informed by probabilistic critical hermeneutics. She concludes with a final brief sketch of discovery tools as an additional interface into which modeling can be worked.