Probability Models for DNA Sequence Evolution

Probability Models for DNA Sequence Evolution

Author: Rick Durrett

Publisher: Springer Science & Business Media

Published: 2013-03-09

Total Pages: 246

ISBN-13: 1475762852

DOWNLOAD EBOOK

"What underlying forces are responsible for the observed patterns of variability, given a collection of DNA sequences?" In approaching this question a number of probability models are introduced and anyalyzed.Throughout the book, the theory is developed in close connection with data from more than 60 experimental studies that illustrate the use of these results.


Statistical Methods in Molecular Evolution

Statistical Methods in Molecular Evolution

Author: Rasmus Nielsen

Publisher: Springer Science & Business Media

Published: 2006-05-06

Total Pages: 503

ISBN-13: 0387277331

DOWNLOAD EBOOK

In the field of molecular evolution, inferences about past evolutionary events are made using molecular data from currently living species. With the availability of genomic data from multiple related species, molecular evolution has become one of the most active and fastest growing fields of study in genomics and bioinformatics. Most studies in molecular evolution rely heavily on statistical procedures based on stochastic process modelling and advanced computational methods including high-dimensional numerical optimization and Markov Chain Monte Carlo. This book provides an overview of the statistical theory and methods used in studies of molecular evolution. It includes an introductory section suitable for readers that are new to the field, a section discussing practical methods for data analysis, and more specialized sections discussing specific models and addressing statistical issues relating to estimation and model choice. The chapters are written by the leaders of field and they will take the reader from basic introductory material to the state-of-the-art statistical methods. This book is suitable for statisticians seeking to learn more about applications in molecular evolution and molecular evolutionary biologists with an interest in learning more about the theory behind the statistical methods applied in the field. The chapters of the book assume no advanced mathematical skills beyond basic calculus, although familiarity with basic probability theory will help the reader. Most relevant statistical concepts are introduced in the book in the context of their application in molecular evolution, and the book should be accessible for most biology graduate students with an interest in quantitative methods and theory. Rasmus Nielsen received his Ph.D. form the University of California at Berkeley in 1998 and after a postdoc at Harvard University, he assumed a faculty position in Statistical Genomics at Cornell University. He is currently an Ole Rømer Fellow at the University of Copenhagen and holds a Sloan Research Fellowship. His is an associate editor of the Journal of Molecular Evolution and has published more than fifty original papers in peer-reviewed journals on the topic of this book. From the reviews: "...Overall this is a very useful book in an area of increasing importance." Journal of the Royal Statistical Society "I find Statistical Methods in Molecular Evolution very interesting and useful. It delves into problems that were considered very difficult just several years ago...the book is likely to stimulate the interest of statisticians that are unaware of this exciting field of applications. It is my hope that it will also help the 'wet lab' molecular evolutionist to better understand mathematical and statistical methods." Marek Kimmel for the Journal of the American Statistical Association, September 2006 "Who should read this book? We suggest that anyone who deals with molecular data (who does not?) and anyone who asks evolutionary questions (who should not?) ought to consult the relevant chapters in this book." Dan Graur and Dror Berel for Biometrics, September 2006 "Coalescence theory facilitates the merger of population genetics theory with phylogenetic approaches, but still, there are mostly two camps: phylogeneticists and population geneticists. Only a few people are moving freely between them. Rasmus Nielsen is certainly one of these researchers, and his work so far has merged many population genetic and phylogenetic aspects of biological research under the umbrella of molecular evolution. Although Nielsen did not contribute a chapter to his book, his work permeates all its chapters. This book gives an overview of his interests and current achievements in molecular evolution. In short, this book should be on your bookshelf." Peter Beerli for Evolution, 60(2), 2006


Bayesian Modeling in Bioinformatics

Bayesian Modeling in Bioinformatics

Author: Dipak K. Dey

Publisher: CRC Press

Published: 2010-09-03

Total Pages: 466

ISBN-13: 1420070185

DOWNLOAD EBOOK

Bayesian Modeling in Bioinformatics discusses the development and application of Bayesian statistical methods for the analysis of high-throughput bioinformatics data arising from problems in molecular and structural biology and disease-related medical research, such as cancer. It presents a broad overview of statistical inference, clustering, and c


Codon Evolution

Codon Evolution

Author: Gina M. Cannarozzi

Publisher: Oxford University Press

Published: 2012-02-23

Total Pages: 297

ISBN-13: 019960116X

DOWNLOAD EBOOK

The second part of the book focuses on codon usage bias.


Accounting for Dependent Evolution Among Sites

Accounting for Dependent Evolution Among Sites

Author: Chris Anthony Nasrallah

Publisher:

Published: 2012

Total Pages: 142

ISBN-13:

DOWNLOAD EBOOK

Models of the evolution of DNA sequences typically assume that each position of the sequence evolves independently of all others. This assumption is unrealistic in most cases and is made either for simplicity, computational tractability, or because the nature of the dependence may not be well understood. Proteins and RNAs present instances in which the three dimensional structure of the molecules are essential for function, and introduce dependence among sites in clearly defined ways. Here I explore models that can account for dependence among sites, use them to explore the evolution of DNA sequences containing dependence both within a population and between species, and develop a new substitution model that can be used to make inferences about the strength of natural selection acting on these sequences. In the first chapter I demonstrate the importance of accounting for dependent evolution among sites for phylogenetic inference. Using a realistic model of the evolution of proteins and RNAs based on known structures, I simulate the evolution of DNA sequences in which the evolution at each site can depend on many other positions in the sequence. Using these simulated data I show that phylogenetic methods that assume sites evolve independently are impaired in their ability to infer the true topology relating the species, and I quantify the error in this estimation as a function of the strength of the dependence, the tree length, the topology, and the specific type of molecular structure. This underscores the importance of accounting for such dependent evolution among sites in studies of molecular evolution. In the second chapter I explore the dynamics of the substitution process within a population rather than between species. One of the central questions when accounting for epistatic interactions among sites is how two changes, which when taken together are neutral, can spread in a population when a single change in isolation is deleterious. This process of compensatory evolution has been explored by population genetics theory in the case when natural selection acting against the intermediate state is very strong. Here I explore the case in which natural selection against the intermediate states is moderate to weak using forward time population genetic simulations of the simplest possible case of two dependent sites. I show that when selection is weak the two substitutions can be made one at a time, that as selection increases the substitutions are made more frequently in tandem, and how these patterns are functions of population size, mutation rate, and recombination. In the third chapter I utilize the insights about the dynamics of compensatory evolution within a population from the second chapter to reexamine the evolution of dependent sites between species. I develop a new substitution model for the analysis of RNA that accounts for the probability of the different pathways to compensatory substitution. This model is interpretive, in that parameters have direct meaning with respect to the strength of natural selection acting against deleterious intermediate states. I implement this model in a Bayesian framework for parameter estimation, and demonstrate its utility for making inferences about historical selective pressures on RNA sequences using a 5S ribosomal RNA dataset. This represents the first probabilistic evolutionary model that both accounts for dependent evolution among sites and connects population genetic dynamics with substitution patterns between species. Taken together, these studies reveal a great deal about the nature of the evolutionary process when sites are not independent. They explore these processes both within a population and between species, and then use insights from one to better inform the other, attempting to connect these two historically separate approaches to the study of evolution. The advances here are not limited to RNA and proteins, but are generally applicable to any instance in which epistatic interactions can be found, from speciation genetics to the evolution of functional morphology.


Biological Sequence Analysis

Biological Sequence Analysis

Author: Richard Durbin

Publisher: Cambridge University Press

Published: 1998-04-23

Total Pages: 372

ISBN-13: 113945739X

DOWNLOAD EBOOK

Probabilistic models are becoming increasingly important in analysing the huge amount of data being produced by large-scale DNA-sequencing efforts such as the Human Genome Project. For example, hidden Markov models are used for analysing biological sequences, linguistic-grammar-based probabilistic models for identifying RNA secondary structure, and probabilistic evolutionary models for inferring phylogenies of sequences from different organisms. This book gives a unified, up-to-date and self-contained account, with a Bayesian slant, of such methods, and more generally to probabilistic methods of sequence analysis. Written by an interdisciplinary team of authors, it aims to be accessible to molecular biologists, computer scientists, and mathematicians with no formal knowledge of the other fields, and at the same time present the state-of-the-art in this new and highly important field.


The Phylogenetic Handbook

The Phylogenetic Handbook

Author: Marco Salemi

Publisher: Cambridge University Press

Published: 2009-03-26

Total Pages: 750

ISBN-13: 0521877105

DOWNLOAD EBOOK

A broad, hands on guide with detailed explanations of current methodology, relevant exercises and popular software tools.