Building Bridges Between Soft and Statistical Methodologies for Data Science

Building Bridges Between Soft and Statistical Methodologies for Data Science

Author: Luis A. García-Escudero

Publisher:

Published: 2023

Total Pages: 0

ISBN-13: 9783031155109

DOWNLOAD EBOOK

Nowadays, data analysis is becoming an appealing topic due to the emergence of new data types, dimensions, and sources. This motivates the development of probabilistic/statistical approaches and tools to cope with these data. Different communities of experts, namely statisticians, mathematicians, computer scientists, engineers, econometricians, and psychologists are more and more interested in facing this challenge. As a consequence, there is a clear need to build bridges between all these communities for Data Science. This book contains more than fifty selected recent contributions aiming to establish the above referred bridges. These contributions address very different and relevant aspects such as imprecise probabilities, information theory, random sets and random fuzzy sets, belief functions, possibility theory, dependence modelling and copulas, clustering, depth concepts, dimensionality reduction of complex data and robustness.


Building Bridges between Soft and Statistical Methodologies for Data Science

Building Bridges between Soft and Statistical Methodologies for Data Science

Author: Luis A. García-Escudero

Publisher: Springer Nature

Published: 2022-08-24

Total Pages: 421

ISBN-13: 3031155092

DOWNLOAD EBOOK

Nowadays, data analysis is becoming an appealing topic due to the emergence of new data types, dimensions, and sources. This motivates the development of probabilistic/statistical approaches and tools to cope with these data. Different communities of experts, namely statisticians, mathematicians, computer scientists, engineers, econometricians, and psychologists are more and more interested in facing this challenge. As a consequence, there is a clear need to build bridges between all these communities for Data Science. This book contains more than fifty selected recent contributions aiming to establish the above referred bridges. These contributions address very different and relevant aspects such as imprecise probabilities, information theory, random sets and random fuzzy sets, belief functions, possibility theory, dependence modelling and copulas, clustering, depth concepts, dimensionality reduction of complex data and robustness.


Reasoning Web. Causality, Explanations and Declarative Knowledge

Reasoning Web. Causality, Explanations and Declarative Knowledge

Author: Leopoldo Bertossi

Publisher: Springer Nature

Published: 2023-04-27

Total Pages: 219

ISBN-13: 303131414X

DOWNLOAD EBOOK

The purpose of the Reasoning Web Summer School is to disseminate recent advances on reasoning techniques and related issues that are of particular interest to Semantic Web and Linked Data applications. It is primarily intended for postgraduate students, postdocs, young researchers, and senior researchers wishing to deepen their knowledge. As in the previous years, lectures in the summer school were given by a distinguished group of expert lecturers. The broad theme of this year's summer school was “Reasoning in Probabilistic Models and Machine Learning” and it covered various aspects of ontological reasoning and related issues that are of particular interest to Semantic Web and Linked Data applications. The following eight lectures were presented during the school: Logic-Based Explainability in Machine Learning; Causal Explanations and Fairness in Data; Statistical Relational Extensions of Answer Set Programming; Vadalog: Its Extensions and Business Applications; Cross-Modal Knowledge Discovery, Inference, and Challenges; Reasoning with Tractable Probabilistic Circuits; From Statistical Relational to Neural Symbolic Artificial Intelligence; Building Intelligent Data Apps in Rel using Reasoning and Probabilistic Modelling.


Statistical Foundations of Data Science

Statistical Foundations of Data Science

Author: Jianqing Fan

Publisher: CRC Press

Published: 2020-09-21

Total Pages: 942

ISBN-13: 0429527616

DOWNLOAD EBOOK

Statistical Foundations of Data Science gives a thorough introduction to commonly used statistical models, contemporary statistical machine learning techniques and algorithms, along with their mathematical insights and statistical theories. It aims to serve as a graduate-level textbook and a research monograph on high-dimensional statistics, sparsity and covariance learning, machine learning, and statistical inference. It includes ample exercises that involve both theoretical studies as well as empirical applications. The book begins with an introduction to the stylized features of big data and their impacts on statistical analysis. It then introduces multiple linear regression and expands the techniques of model building via nonparametric regression and kernel tricks. It provides a comprehensive account on sparsity explorations and model selections for multiple regression, generalized linear models, quantile regression, robust regression, hazards regression, among others. High-dimensional inference is also thoroughly addressed and so is feature screening. The book also provides a comprehensive account on high-dimensional covariance estimation, learning latent factors and hidden structures, as well as their applications to statistical estimation, inference, prediction and machine learning problems. It also introduces thoroughly statistical machine learning theory and methods for classification, clustering, and prediction. These include CART, random forests, boosting, support vector machines, clustering algorithms, sparse PCA, and deep learning.


Advanced Statistical Methods in Data Science

Advanced Statistical Methods in Data Science

Author: Ding-Geng Chen

Publisher: Springer

Published: 2016-11-30

Total Pages: 229

ISBN-13: 9811025940

DOWNLOAD EBOOK

This book gathers invited presentations from the 2nd Symposium of the ICSA- CANADA Chapter held at the University of Calgary from August 4-6, 2015. The aim of this Symposium was to promote advanced statistical methods in big-data sciences and to allow researchers to exchange ideas on statistics and data science and to embraces the challenges and opportunities of statistics and data science in the modern world. It addresses diverse themes in advanced statistical analysis in big-data sciences, including methods for administrative data analysis, survival data analysis, missing data analysis, high-dimensional and genetic data analysis, longitudinal and functional data analysis, the design and analysis of studies with response-dependent and multi-phase designs, time series and robust statistics, statistical inference based on likelihood, empirical likelihood and estimating functions. The editorial group selected 14 high-quality presentations from this successful symposium and invited the presenters to prepare a full chapter for this book in order to disseminate the findings and promote further research collaborations in this area. This timely book offers new methods that impact advanced statistical model development in big-data sciences.


Foundations of Statistics for Data Scientists

Foundations of Statistics for Data Scientists

Author: Alan Agresti

Publisher: CRC Press

Published: 2021-11-22

Total Pages: 486

ISBN-13: 1000462919

DOWNLOAD EBOOK

Foundations of Statistics for Data Scientists: With R and Python is designed as a textbook for a one- or two-term introduction to mathematical statistics for students training to become data scientists. It is an in-depth presentation of the topics in statistical science with which any data scientist should be familiar, including probability distributions, descriptive and inferential statistical methods, and linear modeling. The book assumes knowledge of basic calculus, so the presentation can focus on "why it works" as well as "how to do it." Compared to traditional "mathematical statistics" textbooks, however, the book has less emphasis on probability theory and more emphasis on using software to implement statistical methods and to conduct simulations to illustrate key concepts. All statistical analyses in the book use R software, with an appendix showing the same analyses with Python. The book also introduces modern topics that do not normally appear in mathematical statistics texts but are highly relevant for data scientists, such as Bayesian inference, generalized linear models for non-normal responses (e.g., logistic regression and Poisson loglinear models), and regularized model fitting. The nearly 500 exercises are grouped into "Data Analysis and Applications" and "Methods and Concepts." Appendices introduce R and Python and contain solutions for odd-numbered exercises. The book's website has expanded R, Python, and Matlab appendices and all data sets from the examples and exercises.


Statistical Methods in e-Commerce Research

Statistical Methods in e-Commerce Research

Author: Wolfgang Jank

Publisher: John Wiley & Sons

Published: 2008-12-29

Total Pages: 451

ISBN-13: 0470323183

DOWNLOAD EBOOK

This groundbreaking book introduces the application of statistical methodologies to e-Commerce data With the expanding presence of technology in today's economic market, the use of the Internet for buying, selling, and investing is growing more popular and public in nature. Statistical Methods in e-Commerce Research is the first book of its kind to focus on the statistical models and methods that are essential in order to analyze information from electronic-commerce (e-Commerce) transactions, identify the challenges that arise with new e-Commerce data structures, and discover new knowledge about consumer activity. This collection gathers over thirty researchers and practitioners from the fields of statistics, computer science, information systems, and marketing to discuss the growing use of statistical methods in e-Commerce research. From privacy protection to economic impact, the book first identifies the many obstacles that are encountered while collecting, cleaning, exploring, and analyzing e-Commerce data. Solutions to these problems are then suggested using established and newly developed statistical and data mining methods. Finally, a look into the future of this evolving area of study is provided through an in-depth discussion of the emerging methods for conducting e-Commerce research. Statistical Methods in e-Commerce Research successfully bridges the gap between statistics and e-Commerce, introducing a statistical approach to solving challenges that arise in the context of online transactions, while also introducing a wide range of e-Commerce applications and problems where novel statistical methodology is warranted. It is an ideal text for courses on e-Commerce at the upper-undergraduate and graduate levels and also serves as a valuable reference for researchers and analysts across a wide array of subject areas, including economics, marketing, and information systems who would like to gain a deeper understanding of the use of statistics in their work.


New Advances in Statistics and Data Science

New Advances in Statistics and Data Science

Author: Ding-Geng Chen

Publisher: Springer

Published: 2018-01-26

Total Pages: 348

ISBN-13: 9783319694153

DOWNLOAD EBOOK

This book is comprised of the presentations delivered at the 25th ICSA Applied Statistics Symposium held at the Hyatt Regency Atlanta, on June 12-15, 2016. This symposium attracted more than 700 statisticians and data scientists working in academia, government, and industry from all over the world. The theme of this conference was the “Challenge of Big Data and Applications of Statistics,” in recognition of the advent of big data era, and the symposium offered opportunities for learning, receiving inspirations from old research ideas and for developing new ones, and for promoting further research collaborations in the data sciences. The invited contributions addressed rich topics closely related to big data analysis in the data sciences, reflecting recent advances and major challenges in statistics, business statistics, and biostatistics. Subsequently, the six editors selected 19 high-quality presentations and invited the speakers to prepare full chapters for this book, which showcases new methods in statistics and data sciences, emerging theories, and case applications from statistics, data science and interdisciplinary fields. The topics covered in the book are timely and have great impact on data sciences, identifying important directions for future research, promoting advanced statistical methods in big data science, and facilitating future collaborations across disciplines and between theory and practice.