This undergraduate textbook presents an inquiry-based learning course in stochastic models and computing designed to serve as a first course in probability. Its modular structure complements a traditional lecture format, introducing new topics chapter by chapter with accompanying projects for group collaboration. The text addresses probability axioms leading to Bayes’ theorem, discrete and continuous random variables, Markov chains, and Brownian motion, as well as applications including randomized algorithms, randomized surveys, Benford’s law, and Monte Carlo methods. Adopting a unique application-driven approach to better study probability in action, the book emphasizes data, simulation, and games to strengthen reader insight and intuition while proving theorems. Additionally, the text incorporates codes and exercises in the Julia programming language to further promote a hands-on focus in modelling. Students should have prior knowledge of single variable calculus. Giray Ökten received his PhD from Claremont Graduate University. He has held academic positions at University of Alaska Fairbanks, Ball State University, and Florida State University. He received a Fulbright U.S. Scholar award in 2015. He is the author of an open access textbook in numerical analysis, First Semester in Numerical Analysis with Julia, published by Florida State University Libraries, and a co-author of a children’s math book, The Mathematical Investigations of Dr. O and Arya, published by Tumblehome. His research interests include Monte Carlo methods and computational finance.
This book is a fresh approach to a calculus based, first course in probability and statistics, using R throughout to give a central role to data and simulation. The book introduces probability with Monte Carlo simulation as an essential tool. Simulation makes challenging probability questions quickly accessible and easily understandable. Mathematical approaches are included, using calculus when appropriate, but are always connected to experimental computations. Using R and simulation gives a nuanced understanding of statistical inference. The impact of departure from assumptions in statistical tests is emphasized, quantified using simulations, and demonstrated with real data. The book compares parametric and non-parametric methods through simulation, allowing for a thorough investigation of testing error and power. The text builds R skills from the outset, allowing modern methods of resampling and cross validation to be introduced along with traditional statistical techniques. Fifty-two data sets are included in the complementary R package fosdata. Most of these data sets are from recently published papers, so that you are working with current, real data, which is often large and messy. Two central chapters use powerful tidyverse tools (dplyr, ggplot2, tidyr, stringr) to wrangle data and produce meaningful visualizations. Preliminary versions of the book have been used for five semesters at Saint Louis University, and the majority of the more than 400 exercises have been classroom tested.
This accessible new edition explores the major topics in Monte Carlo simulation that have arisen over the past 30 years and presents a sound foundation for problem solving Simulation and the Monte Carlo Method, Third Edition reflects the latest developments in the field and presents a fully updated and comprehensive account of the state-of-the-art theory, methods and applications that have emerged in Monte Carlo simulation since the publication of the classic First Edition over more than a quarter of a century ago. While maintaining its accessible and intuitive approach, this revised edition features a wealth of up-to-date information that facilitates a deeper understanding of problem solving across a wide array of subject areas, such as engineering, statistics, computer science, mathematics, and the physical and life sciences. The book begins with a modernized introduction that addresses the basic concepts of probability, Markov processes, and convex optimization. Subsequent chapters discuss the dramatic changes that have occurred in the field of the Monte Carlo method, with coverage of many modern topics including: Markov Chain Monte Carlo, variance reduction techniques such as importance (re-)sampling, and the transform likelihood ratio method, the score function method for sensitivity analysis, the stochastic approximation method and the stochastic counter-part method for Monte Carlo optimization, the cross-entropy method for rare events estimation and combinatorial optimization, and application of Monte Carlo techniques for counting problems. An extensive range of exercises is provided at the end of each chapter, as well as a generous sampling of applied examples. The Third Edition features a new chapter on the highly versatile splitting method, with applications to rare-event estimation, counting, sampling, and optimization. A second new chapter introduces the stochastic enumeration method, which is a new fast sequential Monte Carlo method for tree search. In addition, the Third Edition features new material on: • Random number generation, including multiple-recursive generators and the Mersenne Twister • Simulation of Gaussian processes, Brownian motion, and diffusion processes • Multilevel Monte Carlo method • New enhancements of the cross-entropy (CE) method, including the “improved” CE method, which uses sampling from the zero-variance distribution to find the optimal importance sampling parameters • Over 100 algorithms in modern pseudo code with flow control • Over 25 new exercises Simulation and the Monte Carlo Method, Third Edition is an excellent text for upper-undergraduate and beginning graduate courses in stochastic simulation and Monte Carlo techniques. The book also serves as a valuable reference for professionals who would like to achieve a more formal understanding of the Monte Carlo method. Reuven Y. Rubinstein, DSc, was Professor Emeritus in the Faculty of Industrial Engineering and Management at Technion-Israel Institute of Technology. He served as a consultant at numerous large-scale organizations, such as IBM, Motorola, and NEC. The author of over 100 articles and six books, Dr. Rubinstein was also the inventor of the popular score-function method in simulation analysis and generic cross-entropy methods for combinatorial optimization and counting. Dirk P. Kroese, PhD, is a Professor of Mathematics and Statistics in the School of Mathematics and Physics of The University of Queensland, Australia. He has published over 100 articles and four books in a wide range of areas in applied probability and statistics, including Monte Carlo methods, cross-entropy, randomized algorithms, tele-traffic c theory, reliability, computational statistics, applied probability, and stochastic modeling.
"In formulating a stochastic model to describe a real phenomenon, it used to be that one compromised between choosing a model that is a realistic replica of the actual situation and choosing one whose mathematical analysis is tractable. That is, there did not seem to be any payoff in choosing a model that faithfully conformed to the phenomenon under study if it were not possible to mathematically analyze that model. Similar considerations have led to the concentration on asymptotic or steady-state results as opposed to the more useful ones on transient time. However, the relatively recent advent of fast and inexpensive computational power has opened up another approach--namely, to try to model the phenomenon as faithfully as possible and then to rely on a simulation study to analyze it"--
Statistics and hypothesis testing are routinely used in areas (such as linguistics) that are traditionally not mathematically intensive. In such fields, when faced with experimental data, many students and researchers tend to rely on commercial packages to carry out statistical data analysis, often without understanding the logic of the statistical tests they rely on. As a consequence, results are often misinterpreted, and users have difficulty in flexibly applying techniques relevant to their own research — they use whatever they happen to have learned. A simple solution is to teach the fundamental ideas of statistical hypothesis testing without using too much mathematics. This book provides a non-mathematical, simulation-based introduction to basic statistical concepts and encourages readers to try out the simulations themselves using the source code and data provided (the freely available programming language R is used throughout). Since the code presented in the text almost always requires the use of previously introduced programming constructs, diligent students also acquire basic programming abilities in R. The book is intended for advanced undergraduate and graduate students in any discipline, although the focus is on linguistics, psychology, and cognitive science. It is designed for self-instruction, but it can also be used as a textbook for a first course on statistics. Earlier versions of the book have been used in undergraduate and graduate courses in Europe and the US. ”Vasishth and Broe have written an attractive introduction to the foundations of statistics. It is concise, surprisingly comprehensive, self-contained and yet quite accessible. Highly recommended.” Harald Baayen, Professor of Linguistics, University of Alberta, Canada ”By using the text students not only learn to do the specific things outlined in the book, they also gain a skill set that empowers them to explore new areas that lie beyond the book’s coverage.” Colin Phillips, Professor of Linguistics, University of Maryland, USA
This book presents in a compact form the program carried out in introductory statistics courses and discusses some essential topics for research activity, such as Monte Carlo simulation techniques, methods of statistical inference, best fit and analysis of laboratory data. All themes are developed starting from fundamentals, highlighting their applicative aspects, up to the detailed description of several cases particularly relevant for technical and scientific research. The text is dedicated to university students in scientific fields and to all researchers who have to solve practical problems by applying data analysis and simulation procedures. The R software is adopted throughout the book, with a rich library of original programs accessible to the readers through a website.
WILEY-INTERSCIENCE PAPERBACK SERIES The Wiley-Interscience Paperback Series consists of selected books that have been made more accessible to consumers in an effort to increase global appeal and general circulation. With these new unabridged softcover volumes, Wiley hopes to extend the lives of these works by making them available to future generations of statisticians, mathematicians, and scientists. ". . .this is a very competently written and useful addition to the statistical literature; a book every statistician should look at and that many should study!" —Short Book Reviews, International Statistical Institute ". . .reading this book was an enjoyable learning experience. The suggestions and recommendations on the methods [make] this book an excellent reference for anyone interested in simulation. With its compact structure and good coverage of material, it [is] an excellent textbook for a simulation course." —Technometrics ". . .this work is an excellent comprehensive guide to simulation methods, written by a very competent author. It is especially recommended for those users of simulation methods who want more than a 'cook book'. " —Mathematics Abstracts This book is a comprehensive guide to simulation methods with explicit recommendations of methods and algorithms. It covers both the technical aspects of the subject, such as the generation of random numbers, non-uniform random variates and stochastic processes, and the use of simulation. Supported by the relevant mathematical theory, the text contains a great deal of unpublished research material, including coverage of the analysis of shift-register generators, sensitivity analysis of normal variate generators, analysis of simulation output, and more.
The first seven chapters use R for probability simulation and computation, including random number generation, numerical and Monte Carlo integration, and finding limiting distributions of Markov Chains with both discrete and continuous states. Applications include coverage probabilities of binomial confidence intervals, estimation of disease prevalence from screening tests, parallel redundancy for improved reliability of systems, and various kinds of genetic modeling. These initial chapters can be used for a non-Bayesian course in the simulation of applied probability models and Markov Chains. Chapters 8 through 10 give a brief introduction to Bayesian estimation and illustrate the use of Gibbs samplers to find posterior distributions and interval estimates, including some examples in which traditional methods do not give satisfactory results. WinBUGS software is introduced with a detailed explanation of its interface and examples of its use for Gibbs sampling for Bayesian estimation. No previous experience using R is required. An appendix introduces R, and complete R code is included for almost all computational examples and problems (along with comments and explanations). Noteworthy features of the book are its intuitive approach, presenting ideas with examples from biostatistics, reliability, and other fields; its large number of figures; and its extraordinarily large number of problems (about a third of the pages), ranging from simple drill to presentation of additional topics. Hints and answers are provided for many of the problems. These features make the book ideal for students of statistics at the senior undergraduate and at the beginning graduate levels.
In this fully revised second edition of Understanding Probability, the reader can learn about the world of probability in an informal way. The author demystifies the law of large numbers, betting systems, random walks, the bootstrap, rare events, the central limit theorem, the Bayesian approach and more. This second edition has wider coverage, more explanations and examples and exercises, and a new chapter introducing Markov chains, making it a great choice for a first probability course. But its easy-going style makes it just as valuable if you want to learn about the subject on your own, and high school algebra is really all the mathematical background you need.
This textbook may be downloaded as a free PDF on the project's website, and the paperback is sold royalty-free. OpenIntro develops free textbooks and course resources for introductory statistics that exceeds the quality standards of traditional textbooks and resources, and that maximizes accessibility options for the typical student. The approach taken in this textbooks differs from OpenIntro Statistics in its introduction to inference. The foundations for inference are provided using randomization and simulation methods. Once a solid foundation is formed, a transition is made to traditional approaches, where the normal and t distributions are used for hypothesis testing and the construction of confidence intervals.