High-Dimensional Covariance Matrix Estimation

High-Dimensional Covariance Matrix Estimation

Author: Aygul Zagidullina

Publisher: Springer Nature

Published: 2021-10-29

Total Pages: 123

ISBN-13: 3030800652

DOWNLOAD EBOOK

This book presents covariance matrix estimation and related aspects of random matrix theory. It focuses on the sample covariance matrix estimator and provides a holistic description of its properties under two asymptotic regimes: the traditional one, and the high-dimensional regime that better fits the big data context. It draws attention to the deficiencies of standard statistical tools when used in the high-dimensional setting, and introduces the basic concepts and major results related to spectral statistics and random matrix theory under high-dimensional asymptotics in an understandable and reader-friendly way. The aim of this book is to inspire applied statisticians, econometricians, and machine learning practitioners who analyze high-dimensional data to apply the recent developments in their work.


High-Dimensional Covariance Estimation

High-Dimensional Covariance Estimation

Author: Mohsen Pourahmadi

Publisher: John Wiley & Sons

Published: 2013-06-24

Total Pages: 204

ISBN-13: 1118034295

DOWNLOAD EBOOK

Methods for estimating sparse and large covariance matrices Covariance and correlation matrices play fundamental roles in every aspect of the analysis of multivariate data collected from a variety of fields including business and economics, health care, engineering, and environmental and physical sciences. High-Dimensional Covariance Estimation provides accessible and comprehensive coverage of the classical and modern approaches for estimating covariance matrices as well as their applications to the rapidly developing areas lying at the intersection of statistics and machine learning. Recently, the classical sample covariance methodologies have been modified and improved upon to meet the needs of statisticians and researchers dealing with large correlated datasets. High-Dimensional Covariance Estimation focuses on the methodologies based on shrinkage, thresholding, and penalized likelihood with applications to Gaussian graphical models, prediction, and mean-variance portfolio management. The book relies heavily on regression-based ideas and interpretations to connect and unify many existing methods and algorithms for the task. High-Dimensional Covariance Estimation features chapters on: Data, Sparsity, and Regularization Regularizing the Eigenstructure Banding, Tapering, and Thresholding Covariance Matrices Sparse Gaussian Graphical Models Multivariate Regression The book is an ideal resource for researchers in statistics, mathematics, business and economics, computer sciences, and engineering, as well as a useful text or supplement for graduate-level courses in multivariate analysis, covariance estimation, statistical learning, and high-dimensional data analysis.


Data Mining for Bioinformatics

Data Mining for Bioinformatics

Author: Sumeet Dua

Publisher: CRC Press

Published: 2012-11-06

Total Pages: 351

ISBN-13: 0849328012

DOWNLOAD EBOOK

Covering theory, algorithms, and methodologies, as well as data mining technologies, Data Mining for Bioinformatics provides a comprehensive discussion of data-intensive computations used in data mining with applications in bioinformatics. It supplies a broad, yet in-depth, overview of the application domains of data mining for bioinformatics to help readers from both biology and computer science backgrounds gain an enhanced understanding of this cross-disciplinary field. The book offers authoritative coverage of data mining techniques, technologies, and frameworks used for storing, analyzing, and extracting knowledge from large databases in the bioinformatics domains, including genomics and proteomics. It begins by describing the evolution of bioinformatics and highlighting the challenges that can be addressed using data mining techniques. Introducing the various data mining techniques that can be employed in biological databases, the text is organized into four sections: Supplies a complete overview of the evolution of the field and its intersection with computational learning Describes the role of data mining in analyzing large biological databases—explaining the breath of the various feature selection and feature extraction techniques that data mining has to offer Focuses on concepts of unsupervised learning using clustering techniques and its application to large biological data Covers supervised learning using classification techniques most commonly used in bioinformatics—addressing the need for validation and benchmarking of inferences derived using either clustering or classification The book describes the various biological databases prominently referred to in bioinformatics and includes a detailed list of the applications of advanced clustering algorithms used in bioinformatics. Highlighting the challenges encountered during the application of classification on biological databases, it considers systems of both single and ensemble classifiers and shares effort-saving tips for model selection and performance estimation strategies.


High-Dimensional Probability

High-Dimensional Probability

Author: Roman Vershynin

Publisher: Cambridge University Press

Published: 2018-09-27

Total Pages: 299

ISBN-13: 1108415199

DOWNLOAD EBOOK

An integrated package of powerful probabilistic tools and key applications in modern mathematical data science.


Recent Advances in Computational Optimization

Recent Advances in Computational Optimization

Author: Stefka Fidanova

Publisher: Springer

Published: 2016-07-15

Total Pages: 306

ISBN-13: 3319401327

DOWNLOAD EBOOK

This volume is a comprehensive collection of extended contributions from the Workshop on Computational Optimization 2015. It presents recent advances in computational optimization. The volume includes important real life problems like parameter settings for controlling processes in bioreactor, control of ethanol production, minimal convex hill with application in routing algorithms, graph coloring, flow design in photonic data transport system, predicting indoor temperature, crisis control center monitoring, fuel consumption of helicopters, portfolio selection, GPS surveying and so on. It shows how to develop algorithms for them based on new metaheuristic methods like evolutionary computation, ant colony optimization, constrain programming and others. This research demonstrates how some real-world problems arising in engineering, economics, medicine and other domains can be formulated as optimization problems.


Recent Advances in Theory and Methods for the Analysis of High Dimensional and High Frequency Financial Data

Recent Advances in Theory and Methods for the Analysis of High Dimensional and High Frequency Financial Data

Author: Norman R. Swanson

Publisher: MDPI

Published: 2021-08-31

Total Pages: 196

ISBN-13: 303650852X

DOWNLOAD EBOOK

Recently, considerable attention has been placed on the development and application of tools useful for the analysis of the high-dimensional and/or high-frequency datasets that now dominate the landscape. The purpose of this Special Issue is to collect both methodological and empirical papers that develop and utilize state-of-the-art econometric techniques for the analysis of such data.


High-Dimensional Statistics

High-Dimensional Statistics

Author: Martin J. Wainwright

Publisher: Cambridge University Press

Published: 2019-02-21

Total Pages: 571

ISBN-13: 1108498027

DOWNLOAD EBOOK

A coherent introductory text from a groundbreaking researcher, focusing on clarity and motivation to build intuition and understanding.


Large Sample Covariance Matrices and High-Dimensional Data Analysis

Large Sample Covariance Matrices and High-Dimensional Data Analysis

Author: Jianfeng Yao

Publisher: Cambridge University Press

Published: 2015-03-26

Total Pages: 0

ISBN-13: 9781107065178

DOWNLOAD EBOOK

High-dimensional data appear in many fields, and their analysis has become increasingly important in modern statistics. However, it has long been observed that several well-known methods in multivariate analysis become inefficient, or even misleading, when the data dimension p is larger than, say, several tens. A seminal example is the well-known inefficiency of Hotelling's T2-test in such cases. This example shows that classical large sample limits may no longer hold for high-dimensional data; statisticians must seek new limiting theorems in these instances. Thus, the theory of random matrices (RMT) serves as a much-needed and welcome alternative framework. Based on the authors' own research, this book provides a first-hand introduction to new high-dimensional statistical methods derived from RMT. The book begins with a detailed introduction to useful tools from RMT, and then presents a series of high-dimensional problems with solutions provided by RMT methods.


Advances in Statistical Bioinformatics

Advances in Statistical Bioinformatics

Author: Kim-Anh Do

Publisher: Cambridge University Press

Published: 2013-06-10

Total Pages: 499

ISBN-13: 1107244919

DOWNLOAD EBOOK

Providing genome-informed personalized treatment is a goal of modern medicine. Identifying new translational targets in nucleic acid characterizations is an important step toward that goal. The information tsunami produced by such genome-scale investigations is stimulating parallel developments in statistical methodology and inference, analytical frameworks, and computational tools. Within the context of genomic medicine and with a strong focus on cancer research, this book describes the integration of high-throughput bioinformatics data from multiple platforms to inform our understanding of the functional consequences of genomic alterations. This includes rigorous and scalable methods for simultaneously handling diverse data types such as gene expression array, miRNA, copy number, methylation, and next-generation sequencing data. This material is written for statisticians who are interested in modeling and analyzing high-throughput data. Chapters by experts in the field offer a thorough introduction to the biological and technical principles behind multiplatform high-throughput experimentation.


Robust Correlation

Robust Correlation

Author: Georgy L. Shevlyakov

Publisher: John Wiley & Sons

Published: 2016-09-19

Total Pages: 353

ISBN-13: 1118493451

DOWNLOAD EBOOK

This bookpresents material on both the analysis of the classical concepts of correlation and on the development of their robust versions, as well as discussing the related concepts of correlation matrices, partial correlation, canonical correlation, rank correlations, with the corresponding robust and non-robust estimation procedures. Every chapter contains a set of examples with simulated and real-life data. Key features: Makes modern and robust correlation methods readily available and understandable to practitioners, specialists, and consultants working in various fields. Focuses on implementation of methodology and application of robust correlation with R. Introduces the main approaches in robust statistics, such as Huber’s minimax approach and Hampel’s approach based on influence functions. Explores various robust estimates of the correlation coefficient including the minimax variance and bias estimates as well as the most B- and V-robust estimates. Contains applications of robust correlation methods to exploratory data analysis, multivariate statistics, statistics of time series, and to real-life data. Includes an accompanying website featuring computer code and datasets Features exercises and examples throughout the text using both small and large data sets. Theoretical and applied statisticians, specialists in multivariate statistics, robust statistics, robust time series analysis, data analysis and signal processing will benefit from this book. Practitioners who use correlation based methods in their work as well as postgraduate students in statistics will also find this book useful.