Data Science Ethics

Data Science Ethics

Author: David Martens

Publisher: Oxford University Press

Published: 2022-03-24

Total Pages: 273

ISBN-13: 0192847260

DOWNLOAD EBOOK

Data science ethics is all about what is right and wrong when conducting data science. Data science has so far been primarily used for positive outcomes for businesses and society. However, just as with any technology, data science has also come with some negative consequences: an increase of privacy invasion, data-driven discrimination against sensitive groups, and decision making by complex models without explanations. While data scientists and business managers are not inherently unethical, they are not trained to weigh the ethical considerations that come from their work - Data Science Ethics addresses this increasingly significant gap and highlights different concepts and techniques that aid understanding, ranging from k-anonymity and differential privacy to homomorphic encryption and zero-knowledge proofs to address privacy concerns, techniques to remove discrimination against sensitive groups, and various explainable AI techniques. Real-life cautionary tales further illustrate the importance and potential impact of data science ethics, including tales of racist bots, search censoring, government backdoors, and face recognition. The book is punctuated with structured exercises that provide hypothetical scenarios and ethical dilemmas for reflection that teach readers how to balance the ethical concerns and the utility of data.


97 Things About Ethics Everyone in Data Science Should Know

97 Things About Ethics Everyone in Data Science Should Know

Author: Bill Franks

Publisher: O'Reilly Media

Published: 2020-08-06

Total Pages: 347

ISBN-13: 149207263X

DOWNLOAD EBOOK

Most of the high-profile cases of real or perceived unethical activity in data science aren’t matters of bad intent. Rather, they occur because the ethics simply aren’t thought through well enough. Being ethical takes constant diligence, and in many situations identifying the right choice can be difficult. In this in-depth book, contributors from top companies in technology, finance, and other industries share experiences and lessons learned from collecting, managing, and analyzing data ethically. Data science professionals, managers, and tech leaders will gain a better understanding of ethics through powerful, real-world best practices. Articles include: Ethics Is Not a Binary Concept—Tim Wilson How to Approach Ethical Transparency—Rado Kotorov Unbiased ≠ Fair—Doug Hague Rules and Rationality—Christof Wolf Brenner The Truth About AI Bias—Cassie Kozyrkov Cautionary Ethics Tales—Sherrill Hayes Fairness in the Age of Algorithms—Anna Jacobson The Ethical Data Storyteller—Brent Dykes Introducing Ethicize™, the Fully AI-Driven Cloud-Based Ethics Solution!—Brian O’Neill Be Careful with "Decisions of the Heart"—Hugh Watson Understanding Passive Versus Proactive Ethics—Bill Schmarzo


Handbook of Ethics in Quantitative Methodology

Handbook of Ethics in Quantitative Methodology

Author: A. T. Panter

Publisher: Routledge

Published: 2011-03-01

Total Pages: 508

ISBN-13: 1136888721

DOWNLOAD EBOOK

This comprehensive Handbook is the first to provide a practical, interdisciplinary review of ethical issues as they relate to quantitative methodology including how to present evidence for reliability and validity, what comprises an adequate tested population, and what constitutes scientific knowledge for eliminating biases. The book uses an ethical framework that emphasizes the human cost of quantitative decision making to help researchers understand the specific implications of their choices. The order of the Handbook chapters parallels the chronology of the research process: determining the research design and data collection; data analysis; and communicating findings. Each chapter: Explores the ethics of a particular topic Identifies prevailing methodological issues Reviews strategies and approaches for handling such issues and their ethical implications Provides one or more case examples Outlines plausible approaches to the issue including best-practice solutions. Part 1 presents ethical frameworks that cross-cut design, analysis, and modeling in the behavioral sciences. Part 2 focuses on ideas for disseminating ethical training in statistics courses. Part 3 considers the ethical aspects of selecting measurement instruments and sample size planning and explores issues related to high stakes testing, the defensibility of experimental vs. quasi-experimental research designs, and ethics in program evaluation. Decision points that shape a researchers’ approach to data analysis are examined in Part 4 – when and why analysts need to account for how the sample was selected, how to evaluate tradeoffs of hypothesis-testing vs. estimation, and how to handle missing data. Ethical issues that arise when using techniques such as factor analysis or multilevel modeling and when making causal inferences are also explored. The book concludes with ethical aspects of reporting meta-analyses, of cross-disciplinary statistical reform, and of the publication process. This Handbook appeals to researchers and practitioners in psychology, human development, family studies, health, education, sociology, social work, political science, and business/marketing. This book is also a valuable supplement for quantitative methods courses required of all graduate students in these fields.


Registries for Evaluating Patient Outcomes

Registries for Evaluating Patient Outcomes

Author: Agency for Healthcare Research and Quality/AHRQ

Publisher: Government Printing Office

Published: 2014-04-01

Total Pages: 385

ISBN-13: 1587634333

DOWNLOAD EBOOK

This User’s Guide is intended to support the design, implementation, analysis, interpretation, and quality evaluation of registries created to increase understanding of patient outcomes. For the purposes of this guide, a patient registry is an organized system that uses observational study methods to collect uniform data (clinical and other) to evaluate specified outcomes for a population defined by a particular disease, condition, or exposure, and that serves one or more predetermined scientific, clinical, or policy purposes. A registry database is a file (or files) derived from the registry. Although registries can serve many purposes, this guide focuses on registries created for one or more of the following purposes: to describe the natural history of disease, to determine clinical effectiveness or cost-effectiveness of health care products and services, to measure or monitor safety and harm, and/or to measure quality of care. Registries are classified according to how their populations are defined. For example, product registries include patients who have been exposed to biopharmaceutical products or medical devices. Health services registries consist of patients who have had a common procedure, clinical encounter, or hospitalization. Disease or condition registries are defined by patients having the same diagnosis, such as cystic fibrosis or heart failure. The User’s Guide was created by researchers affiliated with AHRQ’s Effective Health Care Program, particularly those who participated in AHRQ’s DEcIDE (Developing Evidence to Inform Decisions About Effectiveness) program. Chapters were subject to multiple internal and external independent reviews.


The Handbook of Social Research Ethics

The Handbook of Social Research Ethics

Author: Donna M. Mertens

Publisher: SAGE

Published: 2009

Total Pages: 689

ISBN-13: 1412949181

DOWNLOAD EBOOK

Brings together international scholars across the social and behavioural sciences and education to address those ethical issues that arise in the theory and practice of research within the technologically advancing and culturally complex world in which we live.


Ethics and Data Science

Ethics and Data Science

Author: Mike Loukides

Publisher: "O'Reilly Media, Inc."

Published: 2018-07-25

Total Pages: 37

ISBN-13: 1492078212

DOWNLOAD EBOOK

As the impact of data science continues to grow on society there is an increased need to discuss how data is appropriately used and how to address misuse. Yet, ethical principles for working with data have been available for decades. The real issue today is how to put those principles into action. With this report, authors Mike Loukides, Hilary Mason, and DJ Patil examine practical ways for making ethical data standards part of your work every day. To help you consider all of possible ramifications of your work on data projects, this report includes: A sample checklist that you can adapt for your own procedures Five framing guidelines (the Five C’s) for building data products: consent, clarity, consistency, control, and consequences Suggestions for building ethics into your data-driven culture Now is the time to invest in a deliberate practice of data ethics, for better products, better teams, and better outcomes. Get a copy of this report and learn what it takes to do good data science today.


The Big Data Agenda

The Big Data Agenda

Author: Annika Richterich

Publisher: University of Westminster Press

Published: 2018-04-13

Total Pages: 156

ISBN-13: 1911534734

DOWNLOAD EBOOK

This book highlights that the capacity for gathering, analysing, and utilising vast amounts of digital (user) data raises significant ethical issues. Annika Richterich provides a systematic contemporary overview of the field of critical data studies that reflects on practices of digital data collection and analysis. The book assesses in detail one big data research area: biomedical studies, focused on epidemiological surveillance. Specific case studies explore how big data have been used in academic work. The Big Data Agenda concludes that the use of big data in research urgently needs to be considered from the vantage point of ethics and social justice. Drawing upon discourse ethics and critical data studies, Richterich argues that entanglements between big data research and technology/ internet corporations have emerged. In consequence, more opportunities for discussing and negotiating emerging research practices and their implications for societal values are needed.


Ethics in Statistics

Ethics in Statistics

Author: Hassan Doosti

Publisher: Ethics International Press

Published: 2024-03-29

Total Pages: 598

ISBN-13: 1871891663

DOWNLOAD EBOOK

Data plays a vital role in different parts of our lives. In the world of big data, and policy determined by a variety of statistical artifacts, discussions around the ethics of data gathering, manipulation and presentation are increasingly important. Ethics in Statistics aims to make a significant contribution to that debate. The processes of gathering data through sampling, summarising of the findings, and extending results to a population, need to be checked via an ethical prospective, as well as a statistical one. Statistical learning without ethics can be harmful for mankind. This edited collection brings together contributors in the field of data science, data analytics and statistics, to share their thoughts about the role of ethics in different aspects of statistical learning.


The Cult of Statistical Significance

The Cult of Statistical Significance

Author: Stephen Thomas Ziliak

Publisher: University of Michigan Press

Published: 2008-02-19

Total Pages: 349

ISBN-13: 0472050079

DOWNLOAD EBOOK

How the most important statistical method used in many of the sciences doesn't pass the test for basic common sense


Modern Data Science with R

Modern Data Science with R

Author: Benjamin S. Baumer

Publisher: CRC Press

Published: 2021-03-31

Total Pages: 830

ISBN-13: 0429575394

DOWNLOAD EBOOK

From a review of the first edition: "Modern Data Science with R... is rich with examples and is guided by a strong narrative voice. What’s more, it presents an organizing framework that makes a convincing argument that data science is a course distinct from applied statistics" (The American Statistician). Modern Data Science with R is a comprehensive data science textbook for undergraduates that incorporates statistical and computational thinking to solve real-world data problems. Rather than focus exclusively on case studies or programming syntax, this book illustrates how statistical programming in the state-of-the-art R/RStudio computing environment can be leveraged to extract meaningful information from a variety of data in the service of addressing compelling questions. The second edition is updated to reflect the growing influence of the tidyverse set of packages. All code in the book has been revised and styled to be more readable and easier to understand. New functionality from packages like sf, purrr, tidymodels, and tidytext is now integrated into the text. All chapters have been revised, and several have been split, re-organized, or re-imagined to meet the shifting landscape of best practice.