Large Sample Covariance Matrices and High-Dimensional Data Analysis

Large Sample Covariance Matrices and High-Dimensional Data Analysis

Author: Jianfeng Yao

Publisher: Cambridge University Press

Published: 2015-03-26

Total Pages: 0

ISBN-13: 9781107065178

DOWNLOAD EBOOK

High-dimensional data appear in many fields, and their analysis has become increasingly important in modern statistics. However, it has long been observed that several well-known methods in multivariate analysis become inefficient, or even misleading, when the data dimension p is larger than, say, several tens. A seminal example is the well-known inefficiency of Hotelling's T2-test in such cases. This example shows that classical large sample limits may no longer hold for high-dimensional data; statisticians must seek new limiting theorems in these instances. Thus, the theory of random matrices (RMT) serves as a much-needed and welcome alternative framework. Based on the authors' own research, this book provides a first-hand introduction to new high-dimensional statistical methods derived from RMT. The book begins with a detailed introduction to useful tools from RMT, and then presents a series of high-dimensional problems with solutions provided by RMT methods.


High-Dimensional Covariance Estimation

High-Dimensional Covariance Estimation

Author: Mohsen Pourahmadi

Publisher: John Wiley & Sons

Published: 2013-06-24

Total Pages: 204

ISBN-13: 1118034295

DOWNLOAD EBOOK

Methods for estimating sparse and large covariance matrices Covariance and correlation matrices play fundamental roles in every aspect of the analysis of multivariate data collected from a variety of fields including business and economics, health care, engineering, and environmental and physical sciences. High-Dimensional Covariance Estimation provides accessible and comprehensive coverage of the classical and modern approaches for estimating covariance matrices as well as their applications to the rapidly developing areas lying at the intersection of statistics and machine learning. Recently, the classical sample covariance methodologies have been modified and improved upon to meet the needs of statisticians and researchers dealing with large correlated datasets. High-Dimensional Covariance Estimation focuses on the methodologies based on shrinkage, thresholding, and penalized likelihood with applications to Gaussian graphical models, prediction, and mean-variance portfolio management. The book relies heavily on regression-based ideas and interpretations to connect and unify many existing methods and algorithms for the task. High-Dimensional Covariance Estimation features chapters on: Data, Sparsity, and Regularization Regularizing the Eigenstructure Banding, Tapering, and Thresholding Covariance Matrices Sparse Gaussian Graphical Models Multivariate Regression The book is an ideal resource for researchers in statistics, mathematics, business and economics, computer sciences, and engineering, as well as a useful text or supplement for graduate-level courses in multivariate analysis, covariance estimation, statistical learning, and high-dimensional data analysis.


High-Dimensional Covariance Matrix Estimation

High-Dimensional Covariance Matrix Estimation

Author: Aygul Zagidullina

Publisher: Springer Nature

Published: 2021-10-29

Total Pages: 123

ISBN-13: 3030800652

DOWNLOAD EBOOK

This book presents covariance matrix estimation and related aspects of random matrix theory. It focuses on the sample covariance matrix estimator and provides a holistic description of its properties under two asymptotic regimes: the traditional one, and the high-dimensional regime that better fits the big data context. It draws attention to the deficiencies of standard statistical tools when used in the high-dimensional setting, and introduces the basic concepts and major results related to spectral statistics and random matrix theory under high-dimensional asymptotics in an understandable and reader-friendly way. The aim of this book is to inspire applied statisticians, econometricians, and machine learning practitioners who analyze high-dimensional data to apply the recent developments in their work.


Spectral Analysis of Large Dimensional Random Matrices

Spectral Analysis of Large Dimensional Random Matrices

Author: Zhidong Bai

Publisher: Springer Science & Business Media

Published: 2009-12-10

Total Pages: 560

ISBN-13: 1441906614

DOWNLOAD EBOOK

The aim of the book is to introduce basic concepts, main results, and widely applied mathematical tools in the spectral analysis of large dimensional random matrices. The core of the book focuses on results established under moment conditions on random variables using probabilistic methods, and is thus easily applicable to statistics and other areas of science. The book introduces fundamental results, most of them investigated by the authors, such as the semicircular law of Wigner matrices, the Marcenko-Pastur law, the limiting spectral distribution of the multivariate F matrix, limits of extreme eigenvalues, spectrum separation theorems, convergence rates of empirical distributions, central limit theorems of linear spectral statistics, and the partial solution of the famous circular law. While deriving the main results, the book simultaneously emphasizes the ideas and methodologies of the fundamental mathematical tools, among them being: truncation techniques, matrix identities, moment convergence theorems, and the Stieltjes transform. Its treatment is especially fitting to the needs of mathematics and statistics graduate students and beginning researchers, having a basic knowledge of matrix theory and an understanding of probability theory at the graduate level, who desire to learn the concepts and tools in solving problems in this area. It can also serve as a detailed handbook on results of large dimensional random matrices for practical users. This second edition includes two additional chapters, one on the authors' results on the limiting behavior of eigenvectors of sample covariance matrices, another on applications to wireless communications and finance. While attempting to bring this edition up-to-date on recent work, it also provides summaries of other areas which are typically considered part of the general field of random matrix theory.


On the Inference about the Spectral Distribution of High-Dimensional Covariance Matrix Based on High-Frequency Noisy Observations

On the Inference about the Spectral Distribution of High-Dimensional Covariance Matrix Based on High-Frequency Noisy Observations

Author: Ningning Xia

Publisher:

Published: 2017

Total Pages: 0

ISBN-13:

DOWNLOAD EBOOK

In practice, observations are often contaminated by noise, making the resulting sample covariance matrix a signal-plus-noise sample covariance matrix. Aiming to make inferences about the spectral distribution of the population covariance matrix under such a situation, we establish an asymptotic relationship that describes how the limiting spectral distribution of (signal) sample covariance matrices depends on that of signal-plus-noise-type sample covariance matrices. As an application, we consider inferences about the spectral distribution of integrated covolatility (ICV) matrices of high-dimensional diffusion processes based on high-frequency data with microstructure noise. The (slightly modified) pre-averaging estimator is a signal-plus-noise sample covariance matrix, and the aforementioned result, together with a (generalized) connection between the spectral distribution of signal sample covariance matrices and that of the population covariance matrix, enables us to propose a two-step procedure to consistently estimate the spectral distribution of ICV for a class of diffusion processes. An alternative approach is further proposed, which possesses several desirable properties: it is more robust, it eliminates the effects of microstructure noise, and the asymptotic relationship that enables consistent estimation of the spectral distribution of ICV is the standard Mar v{c}enko-Pastur equation. The performance of the two approaches is examined via simulation studies under both synchronous and asynchronous observation settings.


Big and Complex Data Analysis

Big and Complex Data Analysis

Author: S. Ejaz Ahmed

Publisher: Springer

Published: 2017-03-21

Total Pages: 390

ISBN-13: 3319415735

DOWNLOAD EBOOK

This volume conveys some of the surprises, puzzles and success stories in high-dimensional and complex data analysis and related fields. Its peer-reviewed contributions showcase recent advances in variable selection, estimation and prediction strategies for a host of useful models, as well as essential new developments in the field. The continued and rapid advancement of modern technology now allows scientists to collect data of increasingly unprecedented size and complexity. Examples include epigenomic data, genomic data, proteomic data, high-resolution image data, high-frequency financial data, functional and longitudinal data, and network data. Simultaneous variable selection and estimation is one of the key statistical problems involved in analyzing such big and complex data. The purpose of this book is to stimulate research and foster interaction between researchers in the area of high-dimensional data analysis. More concretely, its goals are to: 1) highlight and expand the breadth of existing methods in big data and high-dimensional data analysis and their potential for the advancement of both the mathematical and statistical sciences; 2) identify important directions for future research in the theory of regularization methods, in algorithmic development, and in methodologies for different application areas; and 3) facilitate collaboration between theoretical and subject-specific researchers.


Spectral Theory of Large Dimensional Random Matrices and Its Applications to Wireless Communications and Finance Statistics

Spectral Theory of Large Dimensional Random Matrices and Its Applications to Wireless Communications and Finance Statistics

Author: Zhidong Bai

Publisher: World Scientific Publishing Company Incorporated

Published: 2014

Total Pages: 220

ISBN-13: 9789814579056

DOWNLOAD EBOOK

The book contains three parts: Spectral theory of large dimensional random matrices; Applications to wireless communications; and Applications to finance. In the first part, we introduce some basic theorems of spectral analysis of large dimensional random matrices that are obtained under finite moment conditions, such as the limiting spectral distributions of Wigner matrix and that of large dimensional sample covariance matrix, limits of extreme eigenvalues, and the central limit theorems for linear spectral statistics. In the second part, we introduce some basic examples of applications of random matrix theory to wireless communications and in the third part, we present some examples of Applications to statistical finance.


High-Dimensional Probability

High-Dimensional Probability

Author: Roman Vershynin

Publisher: Cambridge University Press

Published: 2018-09-27

Total Pages: 299

ISBN-13: 1108415199

DOWNLOAD EBOOK

An integrated package of powerful probabilistic tools and key applications in modern mathematical data science.


Spectra of Large-Dimensional Sample Covariance Matrices

Spectra of Large-Dimensional Sample Covariance Matrices

Author: V.I. Serdobolskii

Publisher:

Published: 2018

Total Pages: 18

ISBN-13:

DOWNLOAD EBOOK

Limit spectral theory of sample covariance matrices of increasing dimension was recently used as a base for the development of improved non-degenerating methods of multivariate statistical analysis. We present results of a numerical investigation of fundamental relations of this theory (of the “canonical equations”) that thus prove the accuracy of asymptotic relations, find boundaries of the applicability, and the rate of decrease of the remainder terms. The distribution free property of quality functionals for regularized statistical procedures is confirmed experimentally. We show that theoretical upper estimates of the asymptotics remainder terms are 10-100 times overstated.