The only single, up-to-date source for Grid issues in bioinformatics and biology Bioinformatics is fast emerging as an important discipline for academic research and industrial applications, creating a need for the use of Grid computing techniques for large-scale distributed applications. This book successfully presents Grid algorithms and their real-world applications, provides details on modern and ongoing research, and explores software frameworks that integrate bioinformatics and computational biology. Additional coverage includes: * Bio-ontology and data mining * Data visualization * DNA assembly, clustering, and mapping * Molecular evolution and phylogeny * Gene expression and micro-arrays * Molecular modeling and simulation * Sequence search and alignment * Protein structure prediction * Grid infrastructure, middleware, and tools for bio data Grid Computing for Bioinformatics and Computational Biology is an indispensable resource for professionals in several research and development communities including bioinformatics, computational biology, Grid computing, data mining, and more. It also serves as an ideal textbook for undergraduate- and graduate-level courses in bioinformatics and Grid computing.
Discover how to streamline complex bioinformatics applications with parallel computing This publication enables readers to handle more complex bioinformatics applications and larger and richer data sets. As the editor clearly shows, using powerful parallel computing tools can lead to significant breakthroughs in deciphering genomes, understanding genetic disease, designing customized drug therapies, and understanding evolution. A broad range of bioinformatics applications is covered with demonstrations on how each one can be parallelized to improve performance and gain faster rates of computation. Current parallel computing techniques and technologies are examined, including distributed computing and grid computing. Readers are provided with a mixture of algorithms, experiments, and simulations that provide not only qualitative but also quantitative insights into the dynamic field of bioinformatics. Parallel Computing for Bioinformatics and Computational Biology is a contributed work that serves as a repository of case studies, collectively demonstrating how parallel computing streamlines difficult problems in bioinformatics and produces better results. Each of the chapters is authored by an established expert in the field and carefully edited to ensure a consistent approach and high standard throughout the publication. The work is organized into five parts: * Algorithms and models * Sequence analysis and microarrays * Phylogenetics * Protein folding * Platforms and enabling technologies Researchers, educators, and students in the field of bioinformatics will discover how high-performance computing can enable them to handle more complex data sets, gain deeper insights, and make new discoveries.
The book constitutes the refereed proceedings of the International Workshop on Distributed, High-Performance and Grid Computing in Computational Biology, GCCB 2006, held in Eilat, Israel in January 2007 in conjunction with the 5th European Conference on Computational Biology, ECCB 2006. The 13 revised full papers presented were carefully reviewed and selected from many high quality submissions.
Emerging Trends in Computational Biology, Bioinformatics, and Systems Biology discusses the latest developments in all aspects of computational biology, bioinformatics, and systems biology and the application of data-analytics and algorithms, mathematical modeling, and simu- lation techniques. • Discusses the development and application of data-analytical and theoretical methods, mathematical modeling, and computational simulation techniques to the study of biological and behavioral systems, including applications in cancer research, computational intelligence and drug design, high-performance computing, and biology, as well as cloud and grid computing for the storage and access of big data sets. • Presents a systematic approach for storing, retrieving, organizing, and analyzing biological data using software tools with applications to general principles of DNA/RNA structure, bioinformatics and applications, genomes, protein structure, and modeling and classification, as well as microarray analysis. • Provides a systems biology perspective, including general guidelines and techniques for obtaining, integrating, and analyzing complex data sets from multiple experimental sources using computational tools and software. Topics covered include phenomics, genomics, epigenomics/epigenetics, metabolomics, cell cycle and checkpoint control, and systems biology and vaccination research. • Explains how to effectively harness the power of Big Data tools when data sets are so large and complex that it is difficult to process them using conventional database management systems or traditional data processing applications. - Discusses the development and application of data-analytical and theoretical methods, mathematical modeling and computational simulation techniques to the study of biological and behavioral systems. - Presents a systematic approach for storing, retrieving, organizing and analyzing biological data using software tools with applications. - Provides a systems biology perspective including general guidelines and techniques for obtaining, integrating and analyzing complex data sets from multiple experimental sources using computational tools and software.
Proceedings of the 2019 International Conference on Bioinformatics & Computational Biology (BIOCOMP'19) held July 29th - August 1st, 2019 in Las Vegas, Nevada.
The first comprehensive overview of preprocessing, mining, and postprocessing of biological data Molecular biology is undergoing exponential growth in both the volume and complexity of biological data and knowledge discovery offers the capacity to automate complex search and data analysis tasks. This book presents a vast overview of the most recent developments on techniques and approaches in the field of biological knowledge discovery and data mining (KDD) providing in-depth fundamental and technical field information on the most important topics encountered. Written by top experts, Biological Knowledge Discovery Handbook: Preprocessing, Mining, and Postprocessing of Biological Data covers the three main phases of knowledge discovery (data preprocessing, data processing also known as data mining and data postprocessing) and analyzes both verification systems and discovery systems. BIOLOGICAL DATA PREPROCESSING Part A: Biological Data Management Part B: Biological Data Modeling Part C: Biological Feature Extraction Part D Biological Feature Selection BIOLOGICAL DATA MINING Part E: Regression Analysis of Biological Data Part F Biological Data Clustering Part G: Biological Data Classification Part H: Association Rules Learning from Biological Data Part I: Text Mining and Application to Biological Data Part J: High-Performance Computing for Biological Data Mining Combining sound theory with practical applications in molecular biology, Biological Knowledge Discovery Handbook is ideal for courses in bioinformatics and biological KDD as well as for practitioners and professional researchers in computer science, life science, and mathematics.
Covers the fundamentals and techniques of multiple biological sequence alignment and analysis, and shows readers how to choose the appropriate sequence analysis tools for their tasks This book describes the traditional and modern approaches in biological sequence alignment and homology search. This book contains 11 chapters, with Chapter 1 providing basic information on biological sequences. Next, Chapter 2 contains fundamentals in pair-wise sequence alignment, while Chapters 3 and 4 examine popular existing quantitative models and practical clustering techniques that have been used in multiple sequence alignment. Chapter 5 describes, characterizes and relates many multiple sequence alignment models. Chapter 6 describes how traditionally phylogenetic trees have been constructed, and available sequence knowledge bases can be used to improve the accuracy of reconstructing phylogeny trees. Chapter 7 covers the latest methods developed to improve the run-time efficiency of multiple sequence alignment. Next, Chapter 8 covers several popular existing multiple sequence alignment server and services, and Chapter 9 examines several multiple sequence alignment techniques that have been developed to handle short sequences (reads) produced by the Next Generation Sequencing technique (NSG). Chapter 10 describes a Bioinformatics application using multiple sequence alignment of short reads or whole genomes as input. Lastly, Chapter 11 provides a review of RNA and protein secondary structure prediction using the evolution information inferred from multiple sequence alignments. • Covers the full spectrum of the field, from alignment algorithms to scoring methods, practical techniques, and alignment tools and their evaluations • Describes theories and developments of scoring functions and scoring matrices •Examines phylogeny estimation and large-scale homology search Multiple Biological Sequence Alignment: Scoring Functions, Algorithms and Applications is a reference for researchers, engineers, graduate and post-graduate students in bioinformatics, and system biology and molecular biologists. Ken Nguyen, PhD, is an associate professor at Clayton State University, GA, USA. He received his PhD, MSc and BSc degrees in computer science all from Georgia State University. His research interests are in databases, parallel and distribute computing and bioinformatics. He was a Molecular Basis of Disease fellow at Georgia State and is the recipient of the highest graduate honor at Georgia State, the William M. Suttles Graduate Fellowship. Xuan Guo, PhD, is a postdoctoral associate at Oak Ridge National Lab, USA. He received his PhD degree in computer science from Georgia State University in 2015. His research interests are in bioinformatics, machine leaning, and cloud computing. He is an editorial assistant of International Journal of Bioinformatics Research and Applications. Yi Pan, PhD, is a Regents' Professor of Computer Science and an Interim Associate Dean and Chair of Biology at Georgia State University. He received his BE and ME in computer engineering from Tsinghua University in China and his PhD in computer science from the University of Pittsburgh. Dr. Pan's research interests include parallel and distributed computing, optical networks, wireless networks and bioinformatics. He has published more than 180 journal papers with about 60 papers published in various IEEE/ACM journals. He is co-editor along with Albert Y. Zomaya of the Wiley Series in Bioinformatics.
Grid Technologies for E-Health: Applications for Telemedicine Services and Delivery examines innovations to further improve medical management using grid computing. A defining collection of field advancements, this publication discusses the significance of automation and IT resources in healthcare technology previously infeasible due to computing and data-integration constraints.
Introduces readers to core algorithmic techniques for next-generation sequencing (NGS) data analysis and discusses a wide range of computational techniques and applications This book provides an in-depth survey of some of the recent developments in NGS and discusses mathematical and computational challenges in various application areas of NGS technologies. The 18 chapters featured in this book have been authored by bioinformatics experts and represent the latest work in leading labs actively contributing to the fast-growing field of NGS. The book is divided into four parts: Part I focuses on computing and experimental infrastructure for NGS analysis, including chapters on cloud computing, modular pipelines for metabolic pathway reconstruction, pooling strategies for massive viral sequencing, and high-fidelity sequencing protocols. Part II concentrates on analysis of DNA sequencing data, covering the classic scaffolding problem, detection of genomic variants, including insertions and deletions, and analysis of DNA methylation sequencing data. Part III is devoted to analysis of RNA-seq data. This part discusses algorithms and compares software tools for transcriptome assembly along with methods for detection of alternative splicing and tools for transcriptome quantification and differential expression analysis. Part IV explores computational tools for NGS applications in microbiomics, including a discussion on error correction of NGS reads from viral populations, methods for viral quasispecies reconstruction, and a survey of state-of-the-art methods and future trends in microbiome analysis. Computational Methods for Next Generation Sequencing Data Analysis: Reviews computational techniques such as new combinatorial optimization methods, data structures, high performance computing, machine learning, and inference algorithms Discusses the mathematical and computational challenges in NGS technologies Covers NGS error correction, de novo genome transcriptome assembly, variant detection from NGS reads, and more This text is a reference for biomedical professionals interested in expanding their knowledge of computational techniques for NGS data analysis. The book is also useful for graduate and post-graduate students in bioinformatics.