Genomics in the Cloud

Genomics in the Cloud

Author: Geraldine A. Van der Auwera

Publisher: O'Reilly Media

Published: 2020-04-02

Total Pages: 496

ISBN-13: 1491975164

DOWNLOAD EBOOK

Data in the genomics field is booming. In just a few years, organizations such as the National Institutes of Health (NIH) will host 50+ petabytes—or over 50 million gigabytes—of genomic data, and they’re turning to cloud infrastructure to make that data available to the research community. How do you adapt analysis tools and protocols to access and analyze that volume of data in the cloud? With this practical book, researchers will learn how to work with genomics algorithms using open source tools including the Genome Analysis Toolkit (GATK), Docker, WDL, and Terra. Geraldine Van der Auwera, longtime custodian of the GATK user community, and Brian O’Connor of the UC Santa Cruz Genomics Institute, guide you through the process. You’ll learn by working with real data and genomics algorithms from the field. This book covers: Essential genomics and computing technology background Basic cloud computing operations Getting started with GATK, plus three major GATK Best Practices pipelines Automating analysis with scripted workflows using WDL and Cromwell Scaling up workflow execution in the cloud, including parallelization and cost optimization Interactive analysis in the cloud using Jupyter notebooks Secure collaboration and computational reproducibility using Terra


Genomics in the Cloud

Genomics in the Cloud

Author: Geraldine A. Van der Auwera

Publisher: "O'Reilly Media, Inc."

Published: 2020-04-02

Total Pages: 570

ISBN-13: 1491975148

DOWNLOAD EBOOK

Data in the genomics field is booming. In just a few years, organizations such as the National Institutes of Health (NIH) will host 50+ petabytesâ??or over 50 million gigabytesâ??of genomic data, and theyâ??re turning to cloud infrastructure to make that data available to the research community. How do you adapt analysis tools and protocols to access and analyze that volume of data in the cloud? With this practical book, researchers will learn how to work with genomics algorithms using open source tools including the Genome Analysis Toolkit (GATK), Docker, WDL, and Terra. Geraldine Van der Auwera, longtime custodian of the GATK user community, and Brian Oâ??Connor of the UC Santa Cruz Genomics Institute, guide you through the process. Youâ??ll learn by working with real data and genomics algorithms from the field. This book covers: Essential genomics and computing technology background Basic cloud computing operations Getting started with GATK, plus three major GATK Best Practices pipelines Automating analysis with scripted workflows using WDL and Cromwell Scaling up workflow execution in the cloud, including parallelization and cost optimization Interactive analysis in the cloud using Jupyter notebooks Secure collaboration and computational reproducibility using Terra


Getting Down to Earth

Getting Down to Earth

Author: World Bank

Publisher: World Bank Publications

Published: 2022-01-14

Total Pages: 195

ISBN-13: 1464817278

DOWNLOAD EBOOK

Outdoor air pollution accounts for an estimated 4.2 million deaths worldwide, caused predominantly by exposure to fine aerosols. This report investigates the performance of satellites for predicting outdoor concentrations of PM2.5, the most harmful air pollutant to human health, in low- and middle-income countries.


Water Resources Research in Northwest China

Water Resources Research in Northwest China

Author: Yaning Chen

Publisher: Springer Science & Business Media

Published: 2014-03-23

Total Pages: 466

ISBN-13: 940178017X

DOWNLOAD EBOOK

This book examines the possible impacts of climate change on hydrology and water resources in the vast arid region of Northwest China, which is one of the world’s largest arid places. The first chapter offers an introductory discussion of the physical geography and socioeconomic conditions in the region. Chapters 2 through 7 discuss the climate system and hydrologic system changes in the region, and assess some implications of these changes in relation to potential evapotranspiration, the hydrological cycle and spatiotemporal variations of the snow cover and glaciers as measured via remote sensing, geographic information systems, and statistical analysis. Chapters 8 and 9 focus on model description and experimental design for interpreting the hydro-climatic process, emphasizing the integration of water, climate, and land ecosystems through field observations and computer-based simulations. Chapter 10 examines some extreme hydrological events and presents a study using the historical trend method to investigate the spatial and temporal variability of changing temperature and precipitation extremes in the hyper-arid region of Northwest China. A concluding chapter discusses possible strategies for sustainable watershed management. The contributors are acknowledged experts who bring broad, relevant experience on water resources research in China’s cold and arid regions. The lessons of this volume will prove useful for understanding arid areas elsewhere in the world.