A Computationally Efficient Bootstrap-equivalent Test for ANOVA in Skewed Populations with a Large Number of Factor Levels

A Computationally Efficient Bootstrap-equivalent Test for ANOVA in Skewed Populations with a Large Number of Factor Levels

Author: Richard Opoku-Nsiah

Publisher:

Published: 2017

Total Pages:

ISBN-13:

DOWNLOAD EBOOK

Advances in technology easily collect a large amount of data in scientific research such as agricultural screening and micro-array experiments. We are particularly interested in data from one-way and crossed two-way designs that have a large number of treatment combinations but small replications with heteroscedastic variances. In this framework, several test statistics have been proposed in the literature. Even though the form of these proposed test statistics may be different, they all use limiting normal or chi-square distribution to conduct their tests. Such approximation approaches the true distribution very slowly when the sample size ni is small while the number of levels of treatments a gets large. A strategy to obtain better accuracy in the classical large sample size setting is to use the bootstrap procedure with studentized statistic. Unfortunately, the available bootstrap method fails when the number of treatment level combinations is large while the number of replications is small. The Fisher and Hall (1990) asymptotic pivotal statistic under large sample size setting is no longer pivotal under small sample size setting with large number of treatment levels. In the first part of this dissertation, we start with describing suitable bootstrap statistics and procedures for hypothesis tests in one- and two-way ANOVA with a large number of levels and small sample sizes. We prove that the theoretical type I error-rate of Akritas and Papadatos (2004) and Wang and Akritas (2006) test statistics and their corresponding bootstrap versions have accuracy of order O(1/√a). We then modify their statistics to obtain asymptotically pivotal statistics in our current framework. We prove that the theoretical type I error-rate of the bootstrap version of the pivotal statistics is accurate up to order O(1/√a). In the second part of the dissertation, we propose a new test statistic in one-way ANOVA which is asymptotically pivotal in the current setting. We improve the accuracy of approximation of the distribution of the test statistic by deriving asymptotic expansion of the statistic under the current framework and define a new test rejection region through Cornish-Fisher expansion of quantiles. The type I error-rate of the new test has a faster convergence rate and is accurate up to order O(1/√a). Simulation studies show that our tests performs better in terms of type I error-rate but comparable power with that of Akritas and Papadatos (2004) in the large a small ni setting. The connection between our asymptotic expansions and bootstrap distribution in the large a small ni setting is discussed. Our proposed test based on asymptotic expansion and Cornish-Fisher expansion of quantiles have both the advantage of higher accuracy and computational efficiency due to no resampling is needed.


Statistical and Econometric Methods for Transportation Data Analysis

Statistical and Econometric Methods for Transportation Data Analysis

Author: Simon Washington

Publisher: CRC Press

Published: 2020-01-30

Total Pages: 496

ISBN-13: 0429520751

DOWNLOAD EBOOK

The book's website (with databases and other support materials) can be accessed here. Praise for the Second Edition: The second edition introduces an especially broad set of statistical methods ... As a lecturer in both transportation and marketing research, I find this book an excellent textbook for advanced undergraduate, Master’s and Ph.D. students, covering topics from simple descriptive statistics to complex Bayesian models. ... It is one of the few books that cover an extensive set of statistical methods needed for data analysis in transportation. The book offers a wealth of examples from the transportation field. —The American Statistician Statistical and Econometric Methods for Transportation Data Analysis, Third Edition offers an expansion over the first and second editions in response to the recent methodological advancements in the fields of econometrics and statistics and to provide an increasing range of examples and corresponding data sets. It describes and illustrates some of the statistical and econometric tools commonly used in transportation data analysis. It provides a wide breadth of examples and case studies, covering applications in various aspects of transportation planning, engineering, safety, and economics. Ample analytical rigor is provided in each chapter so that fundamental concepts and principles are clear and numerous references are provided for those seeking additional technical details and applications. New to the Third Edition Updated references and improved examples throughout. New sections on random parameters linear regression and ordered probability models including the hierarchical ordered probit model. A new section on random parameters models with heterogeneity in the means and variances of parameter estimates. Multiple new sections on correlated random parameters and correlated grouped random parameters in probit, logit and hazard-based models. A new section discussing the practical aspects of random parameters model estimation. A new chapter on Latent Class Models. A new chapter on Bivariate and Multivariate Dependent Variable Models. Statistical and Econometric Methods for Transportation Data Analysis, Third Edition can serve as a textbook for advanced undergraduate, Masters, and Ph.D. students in transportation-related disciplines including engineering, economics, urban and regional planning, and sociology. The book also serves as a technical reference for researchers and practitioners wishing to examine and understand a broad range of statistical and econometric tools required to study transportation problems.