Synthetic Datasets for Statistical Disclosure Control

Synthetic Datasets for Statistical Disclosure Control

Author: Jörg Drechsler

Publisher: Springer Science & Business Media

Published: 2011-06-24

Total Pages: 148

ISBN-13: 146140326X

DOWNLOAD EBOOK

The aim of this book is to give the reader a detailed introduction to the different approaches to generating multiply imputed synthetic datasets. It describes all approaches that have been developed so far, provides a brief history of synthetic datasets, and gives useful hints on how to deal with real data problems like nonresponse, skip patterns, or logical constraints. Each chapter is dedicated to one approach, first describing the general concept followed by a detailed application to a real dataset providing useful guidelines on how to implement the theory in practice. The discussed multiple imputation approaches include imputation for nonresponse, generating fully synthetic datasets, generating partially synthetic datasets, generating synthetic datasets when the original data is subject to nonresponse, and a two-stage imputation approach that helps to better address the omnipresent trade-off between analytical validity and the risk of disclosure. The book concludes with a glimpse into the future of synthetic datasets, discussing the potential benefits and possible obstacles of the approach and ways to address the concerns of data users and their understandable discomfort with using data that doesn’t consist only of the originally collected values. The book is intended for researchers and practitioners alike. It helps the researcher to find the state of the art in synthetic data summarized in one book with full reference to all relevant papers on the topic. But it is also useful for the practitioner at the statistical agency who is considering the synthetic data approach for data dissemination in the future and wants to get familiar with the topic.


Data Disclosure

Data Disclosure

Author: Moritz Hennemann

Publisher: Walter de Gruyter GmbH & Co KG

Published: 2023-04-26

Total Pages: 228

ISBN-13: 3111010600

DOWNLOAD EBOOK

Data has become a key factor for the competitiveness of private and state actors alike. Personal data in particular fuels manifold corresponding data ecosystems - in many cases based on the disclosure decision of an individual. This volume presents the proceedings of the bidt "Vectors of Data Disclosure" conference held in Munich 2022. The contributions give comparative insights into the data disclosure process - combining perspectives of law, cultural studies, and business information systems. The authors thereby tackle the question in which way regulation and cultural settings shape (or do not shape) respective decisions in different parts of the world. The volume also includes interim results of the corresponding bidt research project - including in-depth reports covering the regulatory and cultural dimensions of data disclosure in eight different countries / regions worldwide, a business information systems model of the disclosure decision process, and empirical studies. The volume thereby lays the ground for interdisciplinary informed policy decisions and gives guidance to stakeholders.


Overview of the Privacy Act of 1974

Overview of the Privacy Act of 1974

Author: United States. Department of Justice. Privacy and Civil Liberties Office

Publisher:

Published: 2010

Total Pages: 276

ISBN-13:

DOWNLOAD EBOOK

The "Overview of the Privacy Act of 1974," prepared by the Department of Justice's Office of Privacy and Civil Liberties (OPCL), is a discussion of the Privacy Act's disclosure prohibition, its access and amendment provisions, and its agency recordkeeping requirements. Tracking the provisions of the Act itself, the Overview provides reference to, and legal analysis of, court decisions interpreting the Act's provisions.


Elements of Statistical Disclosure Control

Elements of Statistical Disclosure Control

Author: Leon Willenborg

Publisher: Springer Science & Business Media

Published: 2012-12-06

Total Pages: 273

ISBN-13: 1461301211

DOWNLOAD EBOOK

Statistical disclosure control is the discipline that deals with producing statistical data that are safe enough to be released to external researchers. This book concentrates on the methodology of the area. It deals with both microdata (individual data) and tabular (aggregated) data. The book attempts to develop the theory from what can be called the paradigm of statistical confidentiality: to modify unsafe data in such a way that safe (enough) data emerge, with minimum information loss. This book discusses what safe data, are, how information loss can be measured, and how to modify the data in a (near) optimal way. Once it has been decided how to measure safety and information loss, the production of safe data from unsafe data is often a matter of solving an optimization problem. Several such problems are discussed in the book, and most of them turn out to be hard problems that can be solved only approximately. The authors present new results that have not been published before. The book is not a description of an area that is closed, but, on the contrary, one that still has many spots awaiting to be more fully explored. Some of these are indicated in the book. The book will be useful for official, social and medical statisticians and others who are involved in releasing personal or business data for statistical use. Operations researchers may be interested in the optimization problems involved, particularly for the challenges they present. Leon Willenborg has worked at the Department of Statistical Methods at Statistics Netherlands since 1983, first as a researcher and since 1989 as a senior researcher. Since 1989 his main field of research and consultancy has been statistical disclosure control. From 1996-1998 he was the project coordinator of the EU co-funded SDC project.


Sharing Clinical Trial Data

Sharing Clinical Trial Data

Author: Institute of Medicine

Publisher: National Academies Press

Published: 2015-04-20

Total Pages: 236

ISBN-13: 0309316324

DOWNLOAD EBOOK

Data sharing can accelerate new discoveries by avoiding duplicative trials, stimulating new ideas for research, and enabling the maximal scientific knowledge and benefits to be gained from the efforts of clinical trial participants and investigators. At the same time, sharing clinical trial data presents risks, burdens, and challenges. These include the need to protect the privacy and honor the consent of clinical trial participants; safeguard the legitimate economic interests of sponsors; and guard against invalid secondary analyses, which could undermine trust in clinical trials or otherwise harm public health. Sharing Clinical Trial Data presents activities and strategies for the responsible sharing of clinical trial data. With the goal of increasing scientific knowledge to lead to better therapies for patients, this book identifies guiding principles and makes recommendations to maximize the benefits and minimize risks. This report offers guidance on the types of clinical trial data available at different points in the process, the points in the process at which each type of data should be shared, methods for sharing data, what groups should have access to data, and future knowledge and infrastructure needs. Responsible sharing of clinical trial data will allow other investigators to replicate published findings and carry out additional analyses, strengthen the evidence base for regulatory and clinical decisions, and increase the scientific knowledge gained from investments by the funders of clinical trials. The recommendations of Sharing Clinical Trial Data will be useful both now and well into the future as improved sharing of data leads to a stronger evidence base for treatment. This book will be of interest to stakeholders across the spectrum of research-from funders, to researchers, to journals, to physicians, and ultimately, to patients.


Statistical Disclosure Control for Microdata

Statistical Disclosure Control for Microdata

Author: Matthias Templ

Publisher: Springer

Published: 2017-05-05

Total Pages: 299

ISBN-13: 3319502727

DOWNLOAD EBOOK

This book on statistical disclosure control presents the theory, applications and software implementation of the traditional approach to (micro)data anonymization, including data perturbation methods, disclosure risk, data utility, information loss and methods for simulating synthetic data. Introducing readers to the R packages sdcMicro and simPop, the book also features numerous examples and exercises with solutions, as well as case studies with real-world data, accompanied by the underlying R code to allow readers to reproduce all results. The demand for and volume of data from surveys, registers or other sources containing sensible information on persons or enterprises have increased significantly over the last several years. At the same time, privacy protection principles and regulations have imposed restrictions on the access and use of individual data. Proper and secure microdata dissemination calls for the application of statistical disclosure control methods to the da ta before release. This book is intended for practitioners at statistical agencies and other national and international organizations that deal with confidential data. It will also be interesting for researchers working in statistical disclosure control and the health sciences.


Statistical Disclosure Control in Practice

Statistical Disclosure Control in Practice

Author: Leon Willenborg

Publisher: Springer Science & Business Media

Published: 2012-12-06

Total Pages: 164

ISBN-13: 146124028X

DOWNLOAD EBOOK

The aim of this book is to discuss various aspects associated with disseminating personal or business data collected in censuses or surveys or copied from administrative sources. The problem is to present the data in such a form that they are useful for statistical research and to provide sufficient protection for the individuals or businesses to whom the data refer. The major part of this book is concerned with how to define the disclosure problem and how to deal with it in practical circumstances.


Registries for Evaluating Patient Outcomes

Registries for Evaluating Patient Outcomes

Author: Agency for Healthcare Research and Quality/AHRQ

Publisher: Government Printing Office

Published: 2014-04-01

Total Pages: 385

ISBN-13: 1587634333

DOWNLOAD EBOOK

This User’s Guide is intended to support the design, implementation, analysis, interpretation, and quality evaluation of registries created to increase understanding of patient outcomes. For the purposes of this guide, a patient registry is an organized system that uses observational study methods to collect uniform data (clinical and other) to evaluate specified outcomes for a population defined by a particular disease, condition, or exposure, and that serves one or more predetermined scientific, clinical, or policy purposes. A registry database is a file (or files) derived from the registry. Although registries can serve many purposes, this guide focuses on registries created for one or more of the following purposes: to describe the natural history of disease, to determine clinical effectiveness or cost-effectiveness of health care products and services, to measure or monitor safety and harm, and/or to measure quality of care. Registries are classified according to how their populations are defined. For example, product registries include patients who have been exposed to biopharmaceutical products or medical devices. Health services registries consist of patients who have had a common procedure, clinical encounter, or hospitalization. Disease or condition registries are defined by patients having the same diagnosis, such as cystic fibrosis or heart failure. The User’s Guide was created by researchers affiliated with AHRQ’s Effective Health Care Program, particularly those who participated in AHRQ’s DEcIDE (Developing Evidence to Inform Decisions About Effectiveness) program. Chapters were subject to multiple internal and external independent reviews.


Environmental Regulation and Compulsory Public Disclosure

Environmental Regulation and Compulsory Public Disclosure

Author: Shakeb Afsah

Publisher: Routledge

Published: 2013

Total Pages: 151

ISBN-13: 0415657652

DOWNLOAD EBOOK

This book is a remarkable case study of an environmental policy initiative for a national environmental regulatory system in the information age. In 1995 the Indonesian Ministry of Environment took the bold step to launch an environmental disclosure initiative called the Program for Pollution Control, Evaluation and Rating (PROPER). Under PROPER, environmental performance of companies is mapped into a five-color grading scale - Gold for excellent, Green for very good, Blue for good, Red for non-compliance, and Black for causing environmental damage. These ratings are then publicly disclosed through a formal press conference and posted on the internet. Not only did this simple rating scheme create a major media buzz and enhanced environmental awareness of the general public, but it also unleashed a wide range of performance incentives that showed how markets with environmental information could function in a developing country setting. The authors provide a multidisciplinary analysis of how the PROPER program harnessed the power of public disclosure to abate the problem of industrial pollution. They describe how the program has successfully improved the average environmental compliance rate from close to thrity per cent in 1995 to as high as seventy per cent in 2011. This improvement was driven primarily by information disclosure, which avoided expensive and unpredictable legal enforcement through the court system of Indonesia. The combination of institutional history and detailed economic and analyses sheds light on the role of policy entrepreneurs who laid the foundation for disclosure and transparency, despite the constraints of the Suharto regime. The PROPER program is now internationally recognized and continues to serve as a model for many developing countries.