The War on Statistical Significance

The War on Statistical Significance

Author: DONALD B. MACNAUGHTON

Publisher:

Published: 2021-03-30

Total Pages: 260

ISBN-13:

DOWNLOAD EBOOK

From the preface The "threshold p-value"-the arbiter of statistical significance-has been a widely used gateway to believability and acceptance for publication in scientific research since 1925. However, a growing number of statisticians and other researchers say we should "move beyond" these ideas, suggesting we should greatly reduce our emphasis on them in scientific research. These authors are waging a well-intentioned, polite, and vigorous intellectual war on the ideas of a threshold p-value and statistical significance. This is a "good" war, because it forces important issues into the open, where they can be best understood and assessed. This book grew from a sense that the threshold-p-value gateway to publication of scientific research results is highly useful but is also widely misunderstood. The book presents, from first principles, a modern view of the role of the gateway, as used by some scientific journals. The ideas are explained in terms of the recent disagreement about them between the editorial in a Special Issue on Statistical Inference of the American Statistician and a subsequent editorial in the New England Journal of Medicine. The ideas are developed with almost no reference to mathematics. (A computer can do all the standard math if the user properly understands the key ideas.) The explanations are reinforced with practical examples. The discussion shows how the concept of a threshold-p-value gateway helps researchers and journal editors maximize the overall scientific, social, and commercial benefit of scientific research. The gateway does this by optimally balancing the rates of costly "false-positive" and "false-negative" errors in a scientific journal. The book also discusses the important related ideas of a relationship between variables, a scientific hypothesis test, and the "replication crisis" in some branches of scientific research. The body of the book, which covers the key ideas, is roughly 30% of the text. The remainder consists of 23 appendices that expand the ideas in useful directions. The material is aimed at scientific researchers, journal editors, science teachers, and science students in the biological, social, and physical sciences. It will also be of interest to statisticians, data scientists, philosophers of science, and lay readers seeking an integrated modern view of the high-level operation of the study of relationships between variables in scientific research. About the author Donald B. Macnaughton has been a statistical consultant for more than 40 years. He has managed the statistical aspects of research in the fields of experimental psychology, zoology, drug dependence, nursing, education, business, geography, physical education, and inmate rehabilitation, among others. His consulting work supports and informs his main interest, which is to read, understand, and write about the vital role of the field of statistics in scientific research.


Spatio-Temporal Statistics with R

Spatio-Temporal Statistics with R

Author: Christopher K. Wikle

Publisher: CRC Press

Published: 2019-02-18

Total Pages: 380

ISBN-13: 0429649789

DOWNLOAD EBOOK

The world is becoming increasingly complex, with larger quantities of data available to be analyzed. It so happens that much of these "big data" that are available are spatio-temporal in nature, meaning that they can be indexed by their spatial locations and time stamps. Spatio-Temporal Statistics with R provides an accessible introduction to statistical analysis of spatio-temporal data, with hands-on applications of the statistical methods using R Labs found at the end of each chapter. The book: Gives a step-by-step approach to analyzing spatio-temporal data, starting with visualization, then statistical modelling, with an emphasis on hierarchical statistical models and basis function expansions, and finishing with model evaluation Provides a gradual entry to the methodological aspects of spatio-temporal statistics Provides broad coverage of using R as well as "R Tips" throughout. Features detailed examples and applications in end-of-chapter Labs Features "Technical Notes" throughout to provide additional technical detail where relevant Supplemented by a website featuring the associated R package, data, reviews, errata, a discussion forum, and more The book fills a void in the literature and available software, providing a bridge for students and researchers alike who wish to learn the basics of spatio-temporal statistics. It is written in an informal style and functions as a down-to-earth introduction to the subject. Any reader familiar with calculus-based probability and statistics, and who is comfortable with basic matrix-algebra representations of statistical models, would find this book easy to follow. The goal is to give as many people as possible the tools and confidence to analyze spatio-temporal data.


Anthology of Statistics in Sports

Anthology of Statistics in Sports

Author: Jim Albert

Publisher: SIAM

Published: 2005-01-31

Total Pages: 298

ISBN-13: 0898715873

DOWNLOAD EBOOK

Sport and statistics collide in this collection of articles (from American Statistical Association publications) on using statistics to analyze sport. Most of the articles will be accessible to readers with a general knowledge of statistics. New material from the editors and other notable contributors introduces each section of the book.


Strength in Numbers: The Rising of Academic Statistics Departments in the U. S.

Strength in Numbers: The Rising of Academic Statistics Departments in the U. S.

Author: Alan Agresti

Publisher: Springer Science & Business Media

Published: 2012-11-02

Total Pages: 558

ISBN-13: 1461436494

DOWNLOAD EBOOK

Statistical science as organized in formal academic departments is relatively new. With a few exceptions, most Statistics and Biostatistics departments have been created within the past 60 years. This book consists of a set of memoirs, one for each department in the U.S. created by the mid-1960s. The memoirs describe key aspects of the department’s history -- its founding, its growth, key people in its development, success stories (such as major research accomplishments) and the occasional failure story, PhD graduates who have had a significant impact, its impact on statistical education, and a summary of where the department stands today and its vision for the future. Read here all about how departments such as at Berkeley, Chicago, Harvard, and Stanford started and how they got to where they are today. The book should also be of interests to scholars in the field of disciplinary history.


MM Optimization Algorithms

MM Optimization Algorithms

Author: Kenneth Lange

Publisher: SIAM

Published: 2016-07-11

Total Pages: 229

ISBN-13: 1611974399

DOWNLOAD EBOOK

MM Optimization Algorithms?offers an overview of the MM principle, a device for deriving optimization algorithms satisfying the ascent or descent property. These algorithms can separate the variables of a problem, avoid large matrix inversions, linearize a problem, restore symmetry, deal with equality and inequality constraints gracefully, and turn a nondifferentiable problem into a smooth problem.? The author presents the first extended treatment of MM algorithms, which are ideal for high-dimensional optimization problems in data mining, imaging, and genomics; derives numerous algorithms from a broad diversity of application areas, with a particular emphasis on statistics, biology, and data mining; and summarizes a large amount of literature that has not reached book form before.?


Statistics for Making Decisions

Statistics for Making Decisions

Author: Nicholas T. Longford

Publisher: CRC Press

Published: 2021-03-30

Total Pages: 273

ISBN-13: 1000347605

DOWNLOAD EBOOK

Making decisions is a ubiquitous mental activity in our private and professional or public lives. It entails choosing one course of action from an available shortlist of options. Statistics for Making Decisions places decision making at the centre of statistical inference, proposing its theory as a new paradigm for statistical practice. The analysis in this paradigm is earnest about prior information and the consequences of the various kinds of errors that may be committed. Its conclusion is a course of action tailored to the perspective of the specific client or sponsor of the analysis. The author’s intention is a wholesale replacement of hypothesis testing, indicting it with the argument that it has no means of incorporating the consequences of errors which self-evidently matter to the client. The volume appeals to the analyst who deals with the simplest statistical problems of comparing two samples (which one has a greater mean or variance), or deciding whether a parameter is positive or negative. It combines highlighting the deficiencies of hypothesis testing with promoting a principled solution based on the idea of a currency for error, of which we want to spend as little as possible. This is implemented by selecting the option for which the expected loss is smallest (the Bayes rule). The price to pay is the need for a more detailed description of the options, and eliciting and quantifying the consequences (ramifications) of the errors. This is what our clients do informally and often inexpertly after receiving outputs of the analysis in an established format, such as the verdict of a hypothesis test or an estimate and its standard error. As a scientific discipline and profession, statistics has a potential to do this much better and deliver to the client a more complete and more relevant product. Nicholas T. Longford is a senior statistician at Imperial College, London, specialising in statistical methods for neonatal medicine. His interests include causal analysis of observational studies, decision theory, and the contest of modelling and design in data analysis. His longer-term appointments in the past include Educational Testing Service, Princeton, NJ, USA, de Montfort University, Leicester, England, and directorship of SNTL, a statistics research and consulting company. He is the author of over 100 journal articles and six other monographs on a variety of topics in applied statistics.


All of Statistics

All of Statistics

Author: Larry Wasserman

Publisher: Springer Science & Business Media

Published: 2013-12-11

Total Pages: 446

ISBN-13: 0387217363

DOWNLOAD EBOOK

Taken literally, the title "All of Statistics" is an exaggeration. But in spirit, the title is apt, as the book does cover a much broader range of topics than a typical introductory book on mathematical statistics. This book is for people who want to learn probability and statistics quickly. It is suitable for graduate or advanced undergraduate students in computer science, mathematics, statistics, and related disciplines. The book includes modern topics like non-parametric curve estimation, bootstrapping, and classification, topics that are usually relegated to follow-up courses. The reader is presumed to know calculus and a little linear algebra. No previous knowledge of probability and statistics is required. Statistics, data mining, and machine learning are all concerned with collecting and analysing data.


Modern Data Science with R

Modern Data Science with R

Author: Benjamin S. Baumer

Publisher: CRC Press

Published: 2021-03-31

Total Pages: 830

ISBN-13: 0429575394

DOWNLOAD EBOOK

From a review of the first edition: "Modern Data Science with R... is rich with examples and is guided by a strong narrative voice. What’s more, it presents an organizing framework that makes a convincing argument that data science is a course distinct from applied statistics" (The American Statistician). Modern Data Science with R is a comprehensive data science textbook for undergraduates that incorporates statistical and computational thinking to solve real-world data problems. Rather than focus exclusively on case studies or programming syntax, this book illustrates how statistical programming in the state-of-the-art R/RStudio computing environment can be leveraged to extract meaningful information from a variety of data in the service of addressing compelling questions. The second edition is updated to reflect the growing influence of the tidyverse set of packages. All code in the book has been revised and styled to be more readable and easier to understand. New functionality from packages like sf, purrr, tidymodels, and tidytext is now integrated into the text. All chapters have been revised, and several have been split, re-organized, or re-imagined to meet the shifting landscape of best practice.


Medical Statistics

Medical Statistics

Author: Ramakrishna HK

Publisher: Springer

Published: 2016-11-08

Total Pages: 188

ISBN-13: 9811019231

DOWNLOAD EBOOK

This book deals with statistics in medicine in a simple way. The text is supported by abundant examples from medical data. This book aims to explain and simplify the process of data presentation. Further aspects addressed include how to design and conduct clinical trials, and how to write journal articles.


Exploratory Data Analysis with MATLAB

Exploratory Data Analysis with MATLAB

Author: Wendy L. Martinez

Publisher: CRC Press

Published: 2017-08-07

Total Pages: 589

ISBN-13: 1315349841

DOWNLOAD EBOOK

Praise for the Second Edition: "The authors present an intuitive and easy-to-read book. ... accompanied by many examples, proposed exercises, good references, and comprehensive appendices that initiate the reader unfamiliar with MATLAB." —Adolfo Alvarez Pinto, International Statistical Review "Practitioners of EDA who use MATLAB will want a copy of this book. ... The authors have done a great service by bringing together so many EDA routines, but their main accomplishment in this dynamic text is providing the understanding and tools to do EDA. —David A Huckaby, MAA Reviews Exploratory Data Analysis (EDA) is an important part of the data analysis process. The methods presented in this text are ones that should be in the toolkit of every data scientist. As computational sophistication has increased and data sets have grown in size and complexity, EDA has become an even more important process for visualizing and summarizing data before making assumptions to generate hypotheses and models. Exploratory Data Analysis with MATLAB, Third Edition presents EDA methods from a computational perspective and uses numerous examples and applications to show how the methods are used in practice. The authors use MATLAB code, pseudo-code, and algorithm descriptions to illustrate the concepts. The MATLAB code for examples, data sets, and the EDA Toolbox are available for download on the book’s website. New to the Third Edition Random projections and estimating local intrinsic dimensionality Deep learning autoencoders and stochastic neighbor embedding Minimum spanning tree and additional cluster validity indices Kernel density estimation Plots for visualizing data distributions, such as beanplots and violin plots A chapter on visualizing categorical data