This book combines theoretical underpinnings of statistics with practical analysis of Earth sciences data using MATLAB. Supplementary resources are available online.
This textbook on computational statistics presents tools and concepts of univariate and multivariate statistical data analysis with a strong focus on applications and implementations in the statistical software R. It covers mathematical, statistical as well as programming problems in computational statistics and contains a wide variety of practical examples. In addition to the numerous R sniplets presented in the text, all computer programs (quantlets) and data sets to the book are available on GitHub and referred to in the book. This enables the reader to fully reproduce as well as modify and adjust all examples to their needs. The book is intended for advanced undergraduate and first-year graduate students as well as for data analysts new to the job who would like a tour of the various statistical tools in a data analysis workshop. The experienced reader with a good knowledge of statistics and programming might skip some sections on univariate models and enjoy the various ma thematical roots of multivariate techniques. The Quantlet platform quantlet.de, quantlet.com, quantlet.org is an integrated QuantNet environment consisting of different types of statistics-related documents and program codes. Its goal is to promote reproducibility and offer a platform for sharing validated knowledge native to the social web. QuantNet and the corresponding Data-Driven Documents-based visualization allows readers to reproduce the tables, pictures and calculations inside this Springer book.
From the reviews: "All in all, Graham Borradaile has written and interesting and idiosyncratic book on statistics for geoscientists that will be welcome among students, researchers, and practitioners dealing with orientation data. That should include engineering geologists who work with things like rock fracture orientation measurements or clast alignment in paleoseismic trenches. It won’t replace the collection of statistics and geostatistics texts in my library, but it will have a place among them and will likely be one of several references to which I turn when working with orientation data.... The text is easy to follow and illustrations are generally clear and easy to read..."(William C. Haneberg, Haneberg Geoscience)
The Handbook of Computational Statistics: Concepts and Methodology is divided into four parts. It begins with an overview over the field of Computational Statistics. The second part presents several topics in the supporting field of statistical computing. Emphasis is placed on the need of fast and accurate numerical algorithms and it discusses some of the basic methodologies for transformation, data base handling and graphics treatment. The third part focuses on statistical methodology. Special attention is given to smoothing, iterative procedures, simulation and visualization of multivariate data. Finally a set of selected applications like Bioinformatics, Medical Imaging, Finance and Network Intrusion Detection highlight the usefulness of computational statistics.
An introductory text for the next generation of geospatial analysts and data scientists, Spatial Analysis: Statistics, Visualization, and Computational Methods focuses on the fundamentals of spatial analysis using traditional, contemporary, and computational methods. Outlining both non-spatial and spatial statistical concepts, the authors present p
This textbook teaches the essential background and skills for understanding and quantifying uncertainties in a computational simulation, and for predicting the behavior of a system under those uncertainties. It addresses a critical knowledge gap in the widespread adoption of simulation in high-consequence decision-making throughout the engineering and physical sciences. Constructing sophisticated techniques for prediction from basic building blocks, the book first reviews the fundamentals that underpin later topics of the book including probability, sampling, and Bayesian statistics. Part II focuses on applying Local Sensitivity Analysis to apportion uncertainty in the model outputs to sources of uncertainty in its inputs. Part III demonstrates techniques for quantifying the impact of parametric uncertainties on a problem, specifically how input uncertainties affect outputs. The final section covers techniques for applying uncertainty quantification to make predictions under uncertainty, including treatment of epistemic uncertainties. It presents the theory and practice of predicting the behavior of a system based on the aggregation of data from simulation, theory, and experiment. The text focuses on simulations based on the solution of systems of partial differential equations and includes in-depth coverage of Monte Carlo methods, basic design of computer experiments, as well as regularized statistical techniques. Code references, in python, appear throughout the text and online as executable code, enabling readers to perform the analysis under discussion. Worked examples from realistic, model problems help readers understand the mechanics of applying the methods. Each chapter ends with several assignable problems. Uncertainty Quantification and Predictive Computational Science fills the growing need for a classroom text for senior undergraduate and early-career graduate students in the engineering and physical sciences and supports independent study by researchers and professionals who must include uncertainty quantification and predictive science in the simulations they develop and/or perform.
Gathering the right kind and the right amount of information is crucial for any decision-making process. This book presents a unified framework for assessing the value of potential data gathering schemes by integrating spatial modelling and decision analysis, with a focus on the Earth sciences. The authors discuss the value of imperfect versus perfect information, and the value of total versus partial information, where only subsets of the data are acquired. Concepts are illustrated using a suite of quantitative tools from decision analysis, such as decision trees and influence diagrams, as well as models for continuous and discrete dependent spatial variables, including Bayesian networks, Markov random fields, Gaussian processes, and multiple-point geostatistics. Unique in scope, this book is of interest to students, researchers and industry professionals in the Earth and environmental sciences, who use applied statistics and decision analysis techniques, and particularly to those working in petroleum, mining, and environmental geoscience.
Special Features: · Offers a comprehensive treatment of statistics in geology.· Topics progress from background information to analysis of geological sequences, then maps, and finally multivariate observations.· The book places special emphasis on probability and statistics, including nonparametric statistics, constant-sum data, eigenvalue calculations, analysis of directional data, mapping and geostatistics, fractals, and multivariate analysis.· The text now includes numerous geological data sets that illustrate how specific computational procedures can be applied to problems in the Earth sciences. All data sets are available on the book's companion Web site.· Each chapter now ends with a set of exercises of greater or lesser complexity that the student can address using methods discussed in the chapter.· Provides expanded coverage of elementary probability theory.· The discussion of nonparametric methods has been expanded to address closure effects.· Coverage of eigenvalues and eigenvectors has been revised.· Includes a new section on singular value decomposition and the relationship between R- and Q-mode factor methods in the chapter on multivariate analysis.· The section on contour mapping has been revised to reflect modern practices.· Includes revised coverage of the many varieties of kriging and provides of series of simple demonstrations that illustrate how geostatistical methodologies work.· Includes a discussion of fractals, a promising area of future research.· The section on regression has been expanded to include several variants that have special significance in the Earth sciences.
One of the pathways by which the scientific community confirms the validity of a new scientific discovery is by repeating the research that produced it. When a scientific effort fails to independently confirm the computations or results of a previous study, some fear that it may be a symptom of a lack of rigor in science, while others argue that such an observed inconsistency can be an important precursor to new discovery. Concerns about reproducibility and replicability have been expressed in both scientific and popular media. As these concerns came to light, Congress requested that the National Academies of Sciences, Engineering, and Medicine conduct a study to assess the extent of issues related to reproducibility and replicability and to offer recommendations for improving rigor and transparency in scientific research. Reproducibility and Replicability in Science defines reproducibility and replicability and examines the factors that may lead to non-reproducibility and non-replicability in research. Unlike the typical expectation of reproducibility between two computations, expectations about replicability are more nuanced, and in some cases a lack of replicability can aid the process of scientific discovery. This report provides recommendations to researchers, academic institutions, journals, and funders on steps they can take to improve reproducibility and replicability in science.