Computational Methods for Integrative Inference of Genome-scale Gene Regulatory Networks

Computational Methods for Integrative Inference of Genome-scale Gene Regulatory Networks

Author: Alireza Fotuhi Siahpirani

Publisher:

Published: 2019

Total Pages: 156

ISBN-13:

DOWNLOAD EBOOK

Inference of transcriptional regulatory networks is an important filed of research in systems biology, and many computational methods have been developed to infer regulatory networks from different types of genomic data. One of the most popular classes of computational network inference methods is expression based network inference. Given the mRNA levels of genes, these methods reconstruct a network between regulatory genes (called transcription factors) and potential target genes that best explains the input data. However, it has been shown that the networks that are inferred only using expression, have low agreement with experimentally validated physical regulatory interactions. In recent years, many methods have been developed to improve the accuracy of these computational methods by incorporating additional data types. In this dissertation, we describe our contributions towards advancing the state of the art in this field. Our first contribution, is developing a prior-based network inference method, MERLIN-P. MERLIN-P uses both expression of genes, and prior knowledge of interactions between regulatory genes and their potential targets, and infers a network that is supported by both expression and prior knowledge. Using a logistic function, MERLIN-P could incorporate and combine multiple sources of prior knowledge. The inferred networks in yeast, outperform state of the art expression based network inference methods, and perform better or at a par with prior based state of the art method. Our second contribution, is developing a method to estimate transcription factor activity from a noisy prior network, NCA+LASSO. Network Component Analysis (NCA), is a computational method that given expression of target genes and a (potentially incomplete and noisy) network structure that describes the connection of regulatory genes to these target genes, estimates unobserved activity of the regulators (transcription factor activities, TFA). It has been shown that using TFA can improve the quality of inferred networks. However, our prior knowledge in new contexts could be incomplete and noisy, and we do not know to what extent presence of noise in input network affects the quality of estimated TFA. We first show how presence of noise in the input prior network can decrease the quality of estimated TFA, and then show that by adding a regularization term, we can improve the quality of the estimated TFA. We show that using estimated TFA instead of just expression of TFs in network inference, improves the agreement of inferred networks to experimentally validated physical interactions, for all state of the art methods, including MERLIN-P. Our final contribution, is developing a multi-task inference method, Dynamic Regulatory Module Network (DRMN), that simultaneously infers regulatory networks for related cell lines, while taking into account the expected similarity of the cell lines. Many biological contexts are hierarchically related, and leveraging the similarity of these contexts could help us infer more accurate regulatory programs in each context. However, the small number of measurements in each context makes the inference of regulatory networks challenging. By inferring regulatory programs at module level (groups of co-expressed genes), DRMN is able to handle the small number of measurements, while the use of multi-task learning allows for incorporation of hierarchical relationship of contexts. DRMN first infers modules of co-expressed genes in each cell line, then infers a regulatory network for each module, and iteratively updates the inferred modules to reflect both co-expression and co-regulation, and updates the inferred networks to reflect the updated modules. We assess the accuracy of the inferred networks by predicting the expression on hold out genes, and show that the resulting modules and networks, provide insight into the process of differentiation between these related cell lines. For all the developed methods, we validate our results by comparing to known experimentally validated networks, and show that our results provide useful insight into the biological processes under consideration. Specifically, in chapter 2, we evaluated our inferred networks based on both network structure and predictive power, identified TFs that all tested methods fail to recover their target sets, and explored potential reasons that can explain this failure. Additionally, we used our method to infer stress specific networks, and evaluated predictions using stress specific knock-down experiments. In chapter 3, we evaluated our inferred networks based on both network structure and predictive power, and furthermore used our inferred networks to identify potential regulators that could be important for pluripotency state in mESC. We tested the effect of these regulators using shRNA experiments, and experimentally validated some of their predicted targets. Finally, in chapter 4, we evaluated our inferred models based on their predictive power and ability to predict gene expression in hold out data.


Computational Methods for Analyzing and Modeling Gene Regulation and 3D Genome Organization

Computational Methods for Analyzing and Modeling Gene Regulation and 3D Genome Organization

Author: Anastasiya Belyaeva

Publisher:

Published: 2021

Total Pages:

ISBN-13:

DOWNLOAD EBOOK

Biological processes from differentiation to disease progression are governed by gene regulatory mechanisms. Currently large-scale omics and imaging data sets are being collected to characterize gene regulation at every level. Such data sets present new opportunities and challenges for extracting biological insights and elucidating the gene regulatory logic of cells. In this thesis, I present computational methods for the analysis and integration of various data types used for cell profiling. Specifically, I focus on analyzing and linking gene expression with the 3D organization of the genome. First, I describe methodologies for elucidating gene regulatory mechanisms by considering multiple data modalities. I design a computational framework for identifying colocalized and coregulated chromosome regions by integrating gene expression and epigenetic marks with 3D interactions using network analysis. Then, I provide a general framework for data integration using autoencoders and apply it for the integration and translation between gene expression and chromatin images of naive T-cells. Second, I describe methods for analyzing single modalities such as contact frequency data, which measures the spatial organization of the genome, and gene expression data. Given the important role of the 3D genome organization in gene regulation, I present a methodology for reconstructing the 3D diploid conformation of the genome from contact frequency data. Given the ubiquity of gene expression data and the recent advances in single-cell RNA-sequencing technologies as well as the need for causal modeling of gene regulatory mechanisms, I then describe an algorithm as well as a software tool, difference causal inference (DCI), for learning causal gene regulatory networks from gene expression data. DCI addresses the problem of directly learning differences between causal gene regulatory networks given gene expression data from two related conditions. Finally, I shift my focus from basic biology to drug discovery. Given the current COVID19 pandemic, I present a computational drug repurposing platform that enables the identification of FDA approved compounds for drug repurposing and investigation of potential causal drug mechanisms. This framework relies on identifying drugs that reverse the signature of the infection in the space learned by an autoencoder and then uses causal inference to identify putative drug mechanisms.


Gene Regulatory Networks

Gene Regulatory Networks

Author: Guido Sanguinetti

Publisher: Humana

Published: 2018-12-14

Total Pages: 0

ISBN-13: 9781493988815

DOWNLOAD EBOOK

This volume explores recent techniques for the computational inference of gene regulatory networks (GRNs). The chapters in this book cover topics such as methods to infer GRNs from time-varying data; the extraction of causal information from biological data; GRN inference from multiple heterogeneous data sets; non-parametric and hybrid statistical methods; the joint inference of differential networks; and mechanistic models of gene regulation dynamics. Written in the highly successful Methods in Molecular Biology series format, chapters include introductions to their respective topics, descriptions of recently developed methods for GRN inference, applications of these methods on real and/ or simulated biological data, and step-by-step tutorials on the usage of associated software tools. Cutting-edge and thorough, Gene Regulatory Networks: Methods and Protocols is an essential tool for evaluating the current research needed to further address the common challenges faced by specialists in this field.


Computational Modeling Of Gene Regulatory Networks - A Primer

Computational Modeling Of Gene Regulatory Networks - A Primer

Author: Hamid Bolouri

Publisher: World Scientific Publishing Company

Published: 2008-08-13

Total Pages: 341

ISBN-13: 1848168187

DOWNLOAD EBOOK

This book serves as an introduction to the myriad computational approaches to gene regulatory modeling and analysis, and is written specifically with experimental biologists in mind. Mathematical jargon is avoided and explanations are given in intuitive terms. In cases where equations are unavoidable, they are derived from first principles or, at the very least, an intuitive description is provided. Extensive examples and a large number of model descriptions are provided for use in both classroom exercises as well as self-guided exploration and learning. As such, the book is ideal for self-learning and also as the basis of a semester-long course for undergraduate and graduate students in molecular biology, bioengineering, genome sciences, or systems biology./a


Evolutionary Computation in Gene Regulatory Network Research

Evolutionary Computation in Gene Regulatory Network Research

Author: Hitoshi Iba

Publisher: John Wiley & Sons

Published: 2016-01-20

Total Pages: 464

ISBN-13: 1119079772

DOWNLOAD EBOOK

Introducing a handbook for gene regulatory network research using evolutionary computation, with applications for computer scientists, computational and system biologists This book is a step-by-step guideline for research in gene regulatory networks (GRN) using evolutionary computation (EC). The book is organized into four parts that deliver materials in a way equally attractive for a reader with training in computation or biology. Each of these sections, authored by well-known researchers and experienced practitioners, provides the relevant materials for the interested readers. The first part of this book contains an introductory background to the field. The second part presents the EC approaches for analysis and reconstruction of GRN from gene expression data. The third part of this book covers the contemporary advancements in the automatic construction of gene regulatory and reaction networks and gives direction and guidelines for future research. Finally, the last part of this book focuses on applications of GRNs with EC in other fields, such as design, engineering and robotics. • Provides a reference for current and future research in gene regulatory networks (GRN) using evolutionary computation (EC) • Covers sub-domains of GRN research using EC, such as expression profile analysis, reverse engineering, GRN evolution, applications • Contains useful contents for courses in gene regulatory networks, systems biology, computational biology, and synthetic biology • Delivers state-of-the-art research in genetic algorithms, genetic programming, and swarm intelligence Evolutionary Computation in Gene Regulatory Network Research is a reference for researchers and professionals in computer science, systems biology, and bioinformatics, as well as upper undergraduate, graduate, and postgraduate students. Hitoshi Iba is a Professor in the Department of Information and Communication Engineering, Graduate School of Information Science and Technology, at the University of Tokyo, Toyko, Japan. He is an Associate Editor of the IEEE Transactions on Evolutionary Computation and the journal of Genetic Programming and Evolvable Machines. Nasimul Noman is a lecturer in the School of Electrical Engineering and Computer Science at the University of Newcastle, NSW, Australia. From 2002 to 2012 he was a faculty member at the University of Dhaka, Bangladesh. Noman is an Editor of the BioMed Research International journal. His research interests include computational biology, synthetic biology, and bioinformatics.


Computational Methods for the Analysis of Genomic Data and Biological Processes

Computational Methods for the Analysis of Genomic Data and Biological Processes

Author: Francisco A. Gómez Vela

Publisher: MDPI

Published: 2021-02-05

Total Pages: 222

ISBN-13: 3039437712

DOWNLOAD EBOOK

In recent decades, new technologies have made remarkable progress in helping to understand biological systems. Rapid advances in genomic profiling techniques such as microarrays or high-performance sequencing have brought new opportunities and challenges in the fields of computational biology and bioinformatics. Such genetic sequencing techniques allow large amounts of data to be produced, whose analysis and cross-integration could provide a complete view of organisms. As a result, it is necessary to develop new techniques and algorithms that carry out an analysis of these data with reliability and efficiency. This Special Issue collected the latest advances in the field of computational methods for the analysis of gene expression data, and, in particular, the modeling of biological processes. Here we present eleven works selected to be published in this Special Issue due to their interest, quality, and originality.


The Regulatory Genome

The Regulatory Genome

Author: Eric H. Davidson

Publisher: Elsevier

Published: 2010-07-19

Total Pages: 303

ISBN-13: 0080455573

DOWNLOAD EBOOK

Gene regulatory networks are the most complex, extensive control systems found in nature. The interaction between biology and evolution has been the subject of great interest in recent years. The author, Eric Davidson, has been instrumental in elucidating this relationship. He is a world renowned scientist and a major contributor to the field of developmental biology. The Regulatory Genome beautifully explains the control of animal development in terms of structure/function relations of inherited regulatory DNA sequence, and the emergent properties of the gene regulatory networks composed of these sequences. New insights into the mechanisms of body plan evolution are derived from considerations of the consequences of change in developmental gene regulatory networks. Examples of crucial evidence underscore each major concept. The clear writing style explains regulatory causality without requiring a sophisticated background in descriptive developmental biology. This unique text supersedes anything currently available in the market. The only book in the market that is solely devoted to the genomic regulatory code for animal development Written at a conceptual level, including many novel synthetic concepts that ultimately simplify understanding Presents a comprehensive treatment of molecular control elements that determine the function of genes Provides a comparative treatment of development, based on principles rather than description of developmental processes Considers the evolutionary processes in terms of the structural properties of gene regulatory networks Includes 42 full-color descriptive figures and diagrams


Gene Regulatory Network Inference Using Machine Learning Techniques

Gene Regulatory Network Inference Using Machine Learning Techniques

Author: Stephanie Kamgnia Wonkap

Publisher:

Published: 2020

Total Pages: 0

ISBN-13:

DOWNLOAD EBOOK

Systems Biology is a field that models complex biological systems in order to better understand the working of cells and organisms. One of the systems modeled is the gene regulatory network that plays the critical role of controlling an organism's response to changes in its environment. Ideally, we would like a model of the complete gene regulatory network. In recent years, several advances in technology have permitted the collection of an unprecedented amount and variety of data such as genomes, gene expression data, time-series data, and perturbation data. This has stimulated research into computational methods that reconstruct, or infer, models of the gene regulatory network from the data. Many solutions have been proposed, yet there remain open challenges in utilising the range of available data as it is inherently noisy, and must be integrated by the inference techniques. The thesis seeks to contribute to this discourse by investigating challenges of performance, scale, and data integration. We propose a new algorithm BENIN that views network inference as feature selection to address issues of scale, that uses elastic net regression for improved performance, and adapts elastic net to integrate different types of biological data. The BENIN algorithm is benchmarked on a synthetic dataset from the DREAM4 challenge, and on real expression data for the human HeLa cell cycle. On the DREAM4 dataset BENIN out-performed all DREAM4 competitors on the size 100 subchallenge, and is also competitive with more recent state-of-the-art methods. Moreover, on the HeLa cell cycle data, BENIN could infer known regulatory interactions and propose new interactions that warrant further experimental investigation. Keys words: gene regulatory network, network inference, feature selection, elastic net regression.