A Practitioner’s Guide to Resampling for Data Analysis, Data Mining, and Modeling

A Practitioner’s Guide to Resampling for Data Analysis, Data Mining, and Modeling

Author: Phillip Good

Publisher: Chapman and Hall/CRC

Published: 2011-08-25

Total Pages: 0

ISBN-13: 9781439855508

DOWNLOAD EBOOK

Distribution-free resampling methods—permutation tests, decision trees, and the bootstrap—are used today in virtually every research area. A Practitioner’s Guide to Resampling for Data Analysis, Data Mining, and Modeling explains how to use the bootstrap to estimate the precision of sample-based estimates and to determine sample size, data permutations to test hypotheses, and the readily-interpreted decision tree to replace arcane regression methods. Highlights Each chapter contains dozens of thought provoking questions, along with applicable R and Stata code Methods are illustrated with examples from agriculture, audits, bird migration, clinical trials, epidemiology, image processing, immunology, medicine, microarrays and gene selection Lists of commercially available software for the bootstrap, decision trees, and permutation tests are incorporated in the text Access to APL, MATLAB, and SC code for many of the routines is provided on the author’s website The text covers estimation, two-sample and k-sample univariate, and multivariate comparisons of means and variances, sample size determination, categorical data, multiple hypotheses, and model building Statistics practitioners will find the methods described in the text easy to learn and to apply in a broad range of subject areas from A for Accounting, Agriculture, Anthropology, Aquatic science, Archaeology, Astronomy, and Atmospheric science to V for Virology and Vocational Guidance, and Z for Zoology. Practitioners and research workers and in the biomedical, engineering and social sciences, as well as advanced students in biology, business, dentistry, medicine, psychology, public health, sociology, and statistics will find an easily-grasped guide to estimation, testing hypotheses and model building.


Combinatorial Inference in Geometric Data Analysis

Combinatorial Inference in Geometric Data Analysis

Author: Brigitte Le Roux

Publisher: CRC Press

Published: 2019-03-20

Total Pages: 256

ISBN-13: 1498781624

DOWNLOAD EBOOK

Geometric Data Analysis designates the approach of Multivariate Statistics that conceptualizes the set of observations as a Euclidean cloud of points. Combinatorial Inference in Geometric Data Analysis gives an overview of multidimensional statistical inference methods applicable to clouds of points that make no assumption on the process of generating data or distributions, and that are not based on random modelling but on permutation procedures recasting in a combinatorial framework. It focuses particularly on the comparison of a group of observations to a reference population (combinatorial test) or to a reference value of a location parameter (geometric test), and on problems of homogeneity, that is the comparison of several groups for two basic designs. These methods involve the use of combinatorial procedures to build a reference set in which we place the data. The chosen test statistics lead to original extensions, such as the geometric interpretation of the observed level, and the construction of a compatibility region. Features: Defines precisely the object under study in the context of multidimensional procedures, that is clouds of points Presents combinatorial tests and related computations with R and Coheris SPAD software Includes four original case studies to illustrate application of the tests Includes necessary mathematical background to ensure it is self–contained This book is suitable for researchers and students of multivariate statistics, as well as applied researchers of various scientific disciplines. It could be used for a specialized course taught at either master or PhD level.


Robust Multivariate Analysis

Robust Multivariate Analysis

Author: David J. Olive

Publisher: Springer

Published: 2017-11-28

Total Pages: 508

ISBN-13: 3319682539

DOWNLOAD EBOOK

This text presents methods that are robust to the assumption of a multivariate normal distribution or methods that are robust to certain types of outliers. Instead of using exact theory based on the multivariate normal distribution, the simpler and more applicable large sample theory is given. The text develops among the first practical robust regression and robust multivariate location and dispersion estimators backed by theory. The robust techniques are illustrated for methods such as principal component analysis, canonical correlation analysis, and factor analysis. A simple way to bootstrap confidence regions is also provided. Much of the research on robust multivariate analysis in this book is being published for the first time. The text is suitable for a first course in Multivariate Statistical Analysis or a first course in Robust Statistics. This graduate text is also useful for people who are familiar with the traditional multivariate topics, but want to know more about handling data sets with outliers. Many R programs and R data sets are available on the author’s website.


Medical Biostatistics

Medical Biostatistics

Author: Abhaya Indrayan

Publisher: CRC Press

Published: 2012-08-07

Total Pages: 1027

ISBN-13: 146651390X

DOWNLOAD EBOOK

Encyclopedic in breadth, yet practical and concise, Medical Biostatistics, Third Edition focuses on the statistical aspects of medicine with a medical perspective, showing the utility of biostatistics as a tool to manage many medical uncertainties. The author concludes "Just as results of medical tests, statistical results can be false negative or


Statistical Roundtables

Statistical Roundtables

Author: Christine M. Anderson-Cook

Publisher: Quality Press

Published: 2016-04-22

Total Pages: 552

ISBN-13: 087389930X

DOWNLOAD EBOOK

Quality Progress, the flagship journal of ASQ, has been publishing the column “Statistics Roundtable” since 1999. With over 130 contributions from leading authors in applied statistics, the column has been highly successful and widely read. This book collects 90 of the most interesting and useful articles on some key topics. The editors have constructed this book to be a resource for statisticians and practitioners alike – with short, accessible, practical advice in important core areas of statistics from world-renowned experts. This book is intended to be an informative read, with bite-sized columns, as well as a starting point for deeper exploration of key statistical areas. The book contains nine chapters with collections of articles on the following topics: Statistical engineering Data quality and measurement Data collection Key statistical tools Quality control Reliability Multiple response and meta-analysis Applications Communication and training Chapter introductions provide a quick overview of the material contained in the columns of that chapter, as well as complementary articles for that topic that appear elsewhere in the book. Also included at the end of the each chapter introduction is a short list of key references that can provide additional details or examples for material in the topic area.


Becoming a Behavioral Science Researcher

Becoming a Behavioral Science Researcher

Author: Rex B. Kline

Publisher: Guilford Publications

Published: 2019-11-12

Total Pages: 377

ISBN-13: 1462541283

DOWNLOAD EBOOK

Students and beginning researchers often discover that their introductory statistics and methods courses have not fully equipped them to plan and execute their own behavioral research studies. This indispensable book bridges the gap between coursework and conducting independent research. With clarity and wit, the author helps the reader build needed skills to formulate a precise, meaningful research question; understand the pros and cons of widely used research designs and analysis options; correctly interpret the outcomes of statistical tests; make informed measurement choices for a particular study; manage the practical aspects of data screening and preparation; and craft effective journal articles, oral presentations, and posters. Including annotated examples and recommended readings, most chapters feature theoretical and computer-based exercises; an answer appendix at the back of the book allows readers to check their work.


The A-Z of Error-Free Research

The A-Z of Error-Free Research

Author: Phillip I. Good

Publisher: CRC Press

Published: 2012-08-01

Total Pages: 273

ISBN-13: 1439897379

DOWNLOAD EBOOK

A Practical Guide with Step-by-Step Explanations, Numerous Worked Examples, and R Code The A–Z of Error-Free Research describes the design, analysis, modeling, and reporting of experiments, clinical trials, and surveys. The book shows you when to use statistics, the best ways to cope with variation, and how to design an experiment, determine optimal sample size, and collect useable data. It also helps you choose the best statistical procedures for your application and takes you step by step through model development and reporting results for publication. Transition from Student to Researcher Helping you become a confident researcher, the book begins with an overview of when—and when not—to use statistics. It guides you through the planning and data collection phases and presents various data analysis techniques, including methods for sample size determination. The author then covers techniques for developing models that provide a basis for future research. He also discusses reporting techniques to ensure your research efforts get the proper credit. The book concludes with case-control and cohort studies.


Concise Encyclopedia of Biostatistics for Medical Professionals

Concise Encyclopedia of Biostatistics for Medical Professionals

Author: Abhaya Indrayan

Publisher: CRC Press

Published: 2016-11-25

Total Pages: 1589

ISBN-13: 1315355574

DOWNLOAD EBOOK

Concise Encyclopedia of Biostatistics for Medical Professionals focuses on conceptual knowledge and practical advice rather than mathematical details, enhancing its usefulness as a reference for medical professionals. The book defines and describes nearly 1000 commonly and not so commonly used biostatistical terms and methods arranged in alphabetical order. These range from simple terms, such as mean and median to advanced terms such as multilevel models and generalized estimating equations. Synonyms or alternative phrases for each topic covered are listed with a reference to the topic.


Data Mining for Business Analytics

Data Mining for Business Analytics

Author: Galit Shmueli

Publisher: John Wiley & Sons

Published: 2019-10-14

Total Pages: 608

ISBN-13: 111954985X

DOWNLOAD EBOOK

Data Mining for Business Analytics: Concepts, Techniques, and Applications in Python presents an applied approach to data mining concepts and methods, using Python software for illustration Readers will learn how to implement a variety of popular data mining algorithms in Python (a free and open-source software) to tackle business problems and opportunities. This is the sixth version of this successful text, and the first using Python. It covers both statistical and machine learning algorithms for prediction, classification, visualization, dimension reduction, recommender systems, clustering, text mining and network analysis. It also includes: A new co-author, Peter Gedeck, who brings both experience teaching business analytics courses using Python, and expertise in the application of machine learning methods to the drug-discovery process A new section on ethical issues in data mining Updates and new material based on feedback from instructors teaching MBA, undergraduate, diploma and executive courses, and from their students More than a dozen case studies demonstrating applications for the data mining techniques described End-of-chapter exercises that help readers gauge and expand their comprehension and competency of the material presented A companion website with more than two dozen data sets, and instructor materials including exercise solutions, PowerPoint slides, and case solutions Data Mining for Business Analytics: Concepts, Techniques, and Applications in Python is an ideal textbook for graduate and upper-undergraduate level courses in data mining, predictive analytics, and business analytics. This new edition is also an excellent reference for analysts, researchers, and practitioners working with quantitative methods in the fields of business, finance, marketing, computer science, and information technology. “This book has by far the most comprehensive review of business analytics methods that I have ever seen, covering everything from classical approaches such as linear and logistic regression, through to modern methods like neural networks, bagging and boosting, and even much more business specific procedures such as social network analysis and text mining. If not the bible, it is at the least a definitive manual on the subject.” —Gareth M. James, University of Southern California and co-author (with Witten, Hastie and Tibshirani) of the best-selling book An Introduction to Statistical Learning, with Applications in R


Data Mining and Predictive Analytics

Data Mining and Predictive Analytics

Author: Daniel T. Larose

Publisher: John Wiley & Sons

Published: 2015-02-19

Total Pages: 827

ISBN-13: 1118868676

DOWNLOAD EBOOK

Learn methods of data analysis and their application to real-world data sets This updated second edition serves as an introduction to data mining methods and models, including association rules, clustering, neural networks, logistic regression, and multivariate analysis. The authors apply a unified “white box” approach to data mining methods and models. This approach is designed to walk readers through the operations and nuances of the various methods, using small data sets, so readers can gain an insight into the inner workings of the method under review. Chapters provide readers with hands-on analysis problems, representing an opportunity for readers to apply their newly-acquired data mining expertise to solving real problems using large, real-world data sets. Data Mining and Predictive Analytics: Offers comprehensive coverage of association rules, clustering, neural networks, logistic regression, multivariate analysis, and R statistical programming language Features over 750 chapter exercises, allowing readers to assess their understanding of the new material Provides a detailed case study that brings together the lessons learned in the book Includes access to the companion website, www.dataminingconsultant, with exclusive password-protected instructor content Data Mining and Predictive Analytics will appeal to computer science and statistic students, as well as students in MBA programs, and chief executives.