Machine Learning Approaches for Extracting Biological Insights from Heterogeneous Omics Data

Machine Learning Approaches for Extracting Biological Insights from Heterogeneous Omics Data

Author: Tongxin Wang

Publisher:

Published: 2021

Total Pages: 0

ISBN-13:

DOWNLOAD EBOOK

With the breakthrough in biomedical technologies over the last decades, the field of biomedical research has entered the "big data" era. Rapid advancement in high-throughput omics technologies has generated a tremendous amount of data that requires incorporating machine learning algorithms for effective analysis. With the consistent evolution in omics technologies, the data being generated are not only growing in scale but also in complexity and heterogeneity. While the ever-changing and ever-growing omics data keep bringing new computational challenges that demand new computation tools, they also bring new opportunities for a deeper and more comprehensive view into the underlying biomedical problems. To address the computational challenges brought by the continuous development of omics technologies, we focus on developing data-driven approaches that utilize machine learning for better exploiting the omics data for biological insights. Specifically, following the transformation of omics technologies, we develop methodologies, frameworks, and algorithms for omics data with different complexity and heterogeneity, ranging from single-omics to multi-omics data, as well as from bulk sequencing to single-cell sequencing data.


Machine Learning Methods for Multi-Omics Data Integration

Machine Learning Methods for Multi-Omics Data Integration

Author: Abedalrhman Alkhateeb

Publisher: Springer Nature

Published: 2023-12-15

Total Pages: 171

ISBN-13: 303136502X

DOWNLOAD EBOOK

The advancement of biomedical engineering has enabled the generation of multi-omics data by developing high-throughput technologies, such as next-generation sequencing, mass spectrometry, and microarrays. Large-scale data sets for multiple omics platforms, including genomics, transcriptomics, proteomics, and metabolomics, have become more accessible and cost-effective over time. Integrating multi-omics data has become increasingly important in many research fields, such as bioinformatics, genomics, and systems biology. This integration allows researchers to understand complex interactions between biological molecules and pathways. It enables us to comprehensively understand complex biological systems, leading to new insights into disease mechanisms, drug discovery, and personalized medicine. Still, integrating various heterogeneous data types into a single learning model also comes with challenges. In this regard, learning algorithms have been vital in analyzing and integrating these large-scale heterogeneous data sets into one learning model. This book overviews the latest multi-omics technologies, machine learning techniques for data integration, and multi-omics databases for validation. It covers different types of learning for supervised and unsupervised learning techniques, including standard classifiers, deep learning, tensor factorization, ensemble learning, and clustering, among others. The book categorizes different levels of integrations, ranging from early, middle, or late-stage among multi-view models. The underlying models target different objectives, such as knowledge discovery, pattern recognition, disease-related biomarkers, and validation tools for multi-omics data. Finally, the book emphasizes practical applications and case studies, making it an essential resource for researchers and practitioners looking to apply machine learning to their multi-omics data sets. The book covers data preprocessing, feature selection, and model evaluation, providing readers with a practical guide to implementing machine learning techniques on various multi-omics data sets.


Handbook of Machine Learning Applications for Genomics

Handbook of Machine Learning Applications for Genomics

Author: Sanjiban Sekhar Roy

Publisher: Springer Nature

Published: 2022-06-23

Total Pages: 222

ISBN-13: 9811691584

DOWNLOAD EBOOK

Currently, machine learning is playing a pivotal role in the progress of genomics. The applications of machine learning are helping all to understand the emerging trends and the future scope of genomics. This book provides comprehensive coverage of machine learning applications such as DNN, CNN, and RNN, for predicting the sequence of DNA and RNA binding proteins, expression of the gene, and splicing control. In addition, the book addresses the effect of multiomics data analysis of cancers using tensor decomposition, machine learning techniques for protein engineering, CNN applications on genomics, challenges of long noncoding RNAs in human disease diagnosis, and how machine learning can be used as a tool to shape the future of medicine. More importantly, it gives a comparative analysis and validates the outcomes of machine learning methods on genomic data to the functional laboratory tests or by formal clinical assessment. The topics of this book will cater interest to academicians, practitioners working in the field of functional genomics, and machine learning. Also, this book shall guide comprehensively the graduate, postgraduates, and Ph.D. scholars working in these fields.


Machine Learning Approaches to Bioinformatics

Machine Learning Approaches to Bioinformatics

Author: Zheng Rong Yang

Publisher: World Scientific

Published: 2010

Total Pages: 337

ISBN-13: 981428730X

DOWNLOAD EBOOK

This book covers a wide range of subjects in applying machine learning approaches for bioinformatics projects. The book succeeds on two key unique features. First, it introduces the most widely used machine learning approaches in bioinformatics and discusses, with evaluations from real case studies, how they are used in individual bioinformatics projects. Second, it introduces state-of-the-art bioinformatics research methods. Furthermore, the book includes R codes and example data sets to help readers develop their own bioinformatics research skills. The theoretical parts and the practical parts are well integrated for readers to follow the existing procedures in individual research. Unlike most of the bioinformatics textbooks on the market, the content coverage is not limited to just one subject. A broad spectrum of relevant topics in bioinformatics including systematic data mining and computational systems biology researches are brought together in this book, thereby offering an efficient and convenient platform for undergraduate/graduate teaching. An essential textbook for both final year undergraduates and graduate students in universities, as well as a comprehensive handbook for new researchers, this book will also serve as a practical guide for software development in relevant bioinformatics projects.


Systems Analytics and Integration of Big Omics Data

Systems Analytics and Integration of Big Omics Data

Author: Gary Hardiman

Publisher: MDPI

Published: 2020-04-15

Total Pages: 202

ISBN-13: 3039287443

DOWNLOAD EBOOK

A “genotype" is essentially an organism's full hereditary information which is obtained from its parents. A "phenotype" is an organism's actual observed physical and behavioral properties. These may include traits such as morphology, size, height, eye color, metabolism, etc. One of the pressing challenges in computational and systems biology is genotype-to-phenotype prediction. This is challenging given the amount of data generated by modern Omics technologies. This “Big Data” is so large and complex that traditional data processing applications are not up to the task. Challenges arise in collection, analysis, mining, sharing, transfer, visualization, archiving, and integration of these data. In this Special Issue, there is a focus on the systems-level analysis of Omics data, recent developments in gene ontology annotation, and advances in biological pathways and network biology. The integration of Omics data with clinical and biomedical data using machine learning is explored. This Special Issue covers new methodologies in the context of gene–environment interactions, tissue-specific gene expression, and how external factors or host genetics impact the microbiome.


Data Analytics in Bioinformatics

Data Analytics in Bioinformatics

Author: Rabinarayan Satpathy

Publisher: John Wiley & Sons

Published: 2021-01-20

Total Pages: 433

ISBN-13: 111978560X

DOWNLOAD EBOOK

Machine learning techniques are increasingly being used to address problems in computational biology and bioinformatics. Novel machine learning computational techniques to analyze high throughput data in the form of sequences, gene and protein expressions, pathways, and images are becoming vital for understanding diseases and future drug discovery. Machine learning techniques such as Markov models, support vector machines, neural networks, and graphical models have been successful in analyzing life science data because of their capabilities in handling randomness and uncertainty of data noise and in generalization. Machine Learning in Bioinformatics compiles recent approaches in machine learning methods and their applications in addressing contemporary problems in bioinformatics approximating classification and prediction of disease, feature selection, dimensionality reduction, gene selection and classification of microarray data and many more.


Biological Pattern Discovery With R: Machine Learning Approaches

Biological Pattern Discovery With R: Machine Learning Approaches

Author: Zheng Rong Yang

Publisher: World Scientific

Published: 2021-09-17

Total Pages: 462

ISBN-13: 9811240132

DOWNLOAD EBOOK

This book provides the research directions for new or junior researchers who are going to use machine learning approaches for biological pattern discovery. The book was written based on the research experience of the author's several research projects in collaboration with biologists worldwide. The chapters are organised to address individual biological pattern discovery problems. For each subject, the research methodologies and the machine learning algorithms which can be employed are introduced and compared. Importantly, each chapter was written with the aim to help the readers to transfer their knowledge in theory to practical implementation smoothly. Therefore, the R programming environment was used for each subject in the chapters. The author hopes that this book can inspire new or junior researchers' interest in biological pattern discovery using machine learning algorithms.