Big Data and High Performance Computing

Big Data and High Performance Computing

Author: L. Grandinetti

Publisher: IOS Press

Published: 2015-10-20

Total Pages: 168

ISBN-13: 1614995834

DOWNLOAD EBOOK

Big Data has been much in the news in recent years, and the advantages conferred by the collection and analysis of large datasets in fields such as marketing, medicine and finance have led to claims that almost any real world problem could be solved if sufficient data were available. This is of course a very simplistic view, and the usefulness of collecting, processing and storing large datasets must always be seen in terms of the communication, processing and storage capabilities of the computing platforms available. This book presents papers from the International Research Workshop, Advanced High Performance Computing Systems, held in Cetraro, Italy, in July 2014. The papers selected for publication here discuss fundamental aspects of the definition of Big Data, as well as considerations from practice where complex datasets are collected, processed and stored. The concepts, problems, methodologies and solutions presented are of much more general applicability than may be suggested by the particular application areas considered. As a result the book will be of interest to all those whose work involves the processing of very large data sets, exascale computing and the emerging fields of data science


Helmholtz Portfolio Theme Large-Scale Data Management and Analysis (LSDMA)

Helmholtz Portfolio Theme Large-Scale Data Management and Analysis (LSDMA)

Author: Jung, Christopher

Publisher: KIT Scientific Publishing

Published: 2017-09-20

Total Pages: 274

ISBN-13: 3731506955

DOWNLOAD EBOOK

The Helmholtz Association funded the ""Large-Scale Data Management and Analysis"" portfolio theme from 2012-2016. Four Helmholtz centres, six universities and another research institution in Germany joined to enable data-intensive science by optimising data life cycles in selected scientific communities. In our Data Life cycle Labs, data experts performed joint R&D together with scientific communities. The Data Services Integration Team focused on generic solutions applied by several communities.


eScience on Distributed Computing Infrastructure

eScience on Distributed Computing Infrastructure

Author: Marian Bubak

Publisher: Springer

Published: 2014-08-25

Total Pages: 547

ISBN-13: 3319108948

DOWNLOAD EBOOK

To help researchers from different areas of science understand and unlock the potential of the Polish Grid Infrastructure and to define their requirements and expectations, the following 13 pilot communities have been organized and involved in the PLGrid Plus project: Acoustics, AstroGrid-PL, Bioinformatics, Ecology, Energy Sector, Health Sciences, HEPGrid, Life Science, Materials, Metallurgy, Nanotechnologies, Quantum Chemistry and Molecular Physics, and SynchroGrid. The book describes the experience and scientific results achieved by the project partners. Chapters 1 to 8 provide a general overview of research and development activities in the framework of the project with emphasis on services for different scientific areas and an update on the status of the PL-Grid infrastructure, describing new developments in security and middleware. Chapters 9 to 13 discuss new environments and services which may be applied by all scientific communities. Chapters 14 to 36 present how the PLGrid Plus environments, tools and services are used in advanced domain specific computer simulations; these chapters present computational models, new algorithms, and ways in which they are implemented. The book also provides a glossary of terms and concepts. This book may serve as a resource for researchers, developers and system administrators working on efficient exploitation of available e-infrastructures, promoting collaboration and exchange of ideas in the process of constructing a common European e-infrastructure.


Automated Optimization Methods for Scientific Workflows in e-Science Infrastructures

Automated Optimization Methods for Scientific Workflows in e-Science Infrastructures

Author: Sonja Holl

Publisher: Forschungszentrum Jülich

Published: 2014

Total Pages: 207

ISBN-13: 389336949X

DOWNLOAD EBOOK

Scientific workflows have emerged as a key technology that assists scientists with the design, management, execution, sharing and reuse of in silico experiments. Workflow management systems simplify the management of scientific workflows by providing graphical interfaces for their development, monitoring and analysis. Nowadays, e-Science combines such workflow management systems with large-scale data and computing resources into complex research infrastructures. For instance, e-Science allows the conveyance of best practice research in collaborations by providing workflow repositories, which facilitate the sharing and reuse of scientific workflows. However, scientists are still faced with different limitations while reusing workflows. One of the most common challenges they meet is the need to select appropriate applications and their individual execution parameters. If scientists do not want to rely on default or experience-based parameters, the best-effort option is to test different workflow set-ups using either trial and error approaches or parameter sweeps. Both methods may be inefficient or time consuming respectively, especially when tuning a large number of parameters. Therefore, scientists require an effective and efficient mechanism that automatically tests different workflow set-ups in an intelligent way and will help them to improve their scientific results. This thesis addresses the limitation described above by defining and implementing an approach for the optimization of scientific workflows. In the course of this work, scientists’ needs are investigated and requirements are formulated resulting in an appropriate optimization concept. In a following step, this concept is prototypically implemented by extending a workflow management system with an optimization framework, including general mechanisms required to conduct workflow optimization. As optimization is an ongoing research topic, different algorithms are provided by pluggable extensions (plugins) that can be loosely coupled with the framework, resulting in a generic and quickly extendable system. In this thesis, an exemplary plugin is introduced which applies a Genetic Algorithm for parameter optimization. In order to accelerate and therefore make workflow optimization feasible at all, e-Science infrastructures are utilized for the parallel execution of scientific workflows. This is empowered by additional extensions enabling the execution of applications and workflows on distributed computing resources. The actual implementation and therewith the general approach of workflow optimization is experimentally verified by four use cases in the life science domain. All workflows were significantly improved, which demonstrates the advantage of the proposed workflow optimization. Finally, a new collaboration-based approach is introduced that harnesses optimization provenance to make optimization faster and more robust in the future.


Recent Trends in Computer Networks and Distributed Systems Security

Recent Trends in Computer Networks and Distributed Systems Security

Author: Gregorio Martinez Perez

Publisher: Springer

Published: 2014-02-07

Total Pages: 583

ISBN-13: 3642545254

DOWNLOAD EBOOK

This book constitutes the refereed proceedings of the Second International Conference on Security in Computer Networks and Distributed Systems, SNDS 2014, held in Trivandrum, India, in March 2014. The 32 revised full papers presented together with 9 short papers and 8 workshop papers were carefully reviewed and selected from 129 submissions. The papers are organized in topical sections on security and privacy in networked systems; multimedia security; cryptosystems, algorithms, primitives; system and network security; short papers. The workshop papers were presented at the following workshops: Second International Workshop on Security in Self-Organising Networks (Self Net 2014); Workshop on Multidisciplinary Perspectives in Cryptology and Information Security (CIS 2014); Second International Workshop on Trust and Privacy in Cyberspace (Cyber Trust 2014).


Euro-Par 2013: Parallel Processing Workshops

Euro-Par 2013: Parallel Processing Workshops

Author: Dieter an Mey

Publisher: Springer

Published: 2014-04-10

Total Pages: 928

ISBN-13: 3642544207

DOWNLOAD EBOOK

This book constitutes thoroughly refereed post-conference proceedings of the workshops of the 19th International Conference on Parallel Computing, Euro-Par 2013, held in Aachen, Germany in August 2013. The 99 papers presented were carefully reviewed and selected from 145 submissions. The papers include seven workshops that have been co-located with Euro-Par in the previous years: - Big Data Cloud (Second Workshop on Big Data Management in Clouds) - Hetero Par (11th Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms) - HiBB (Fourth Workshop on High Performance Bioinformatics and Biomedicine) - OMHI (Second Workshop on On-chip Memory Hierarchies and Interconnects) - PROPER (Sixth Workshop on Productivity and Performance) - Resilience (Sixth Workshop on Resiliency in High Performance Computing with Clusters, Clouds, and Grids) - UCHPC (Sixth Workshop on Un Conventional High Performance Computing) as well as six newcomers: - DIHC (First Workshop on Dependability and Interoperability in Heterogeneous Clouds) - Fed ICI (First Workshop on Federative and Interoperable Cloud Infrastructures) - LSDVE (First Workshop on Large Scale Distributed Virtual Environments on Clouds and P2P) - MHPC (Workshop on Middleware for HPC and Big Data Systems) -PADABS ( First Workshop on Parallel and Distributed Agent Based Simulations) - ROME (First Workshop on Runtime and Operating Systems for the Many core Era) All these workshops focus on promotion and advancement of all aspects of parallel and distributed computing.


Parallel Processing and Applied Mathematics

Parallel Processing and Applied Mathematics

Author: Roman Wyrzykowski

Publisher: Springer

Published: 2014-05-05

Total Pages: 817

ISBN-13: 3642552242

DOWNLOAD EBOOK

This two-volume-set (LNCS 8384 and 8385) constitutes the refereed proceedings of the 10th International Conference of Parallel Processing and Applied Mathematics, PPAM 2013, held in Warsaw, Poland, in September 2013. The 143 revised full papers presented in both volumes were carefully reviewed and selected from numerous submissions. The papers cover important fields of parallel/distributed/cloud computing and applied mathematics, such as numerical algorithms and parallel scientific computing; parallel non-numerical algorithms; tools and environments for parallel/distributed/cloud computing; applications of parallel computing; applied mathematics, evolutionary computing and metaheuristics.


High-Performance Scientific Computing

High-Performance Scientific Computing

Author: Edoardo Di Napoli

Publisher: Springer

Published: 2017-03-01

Total Pages: 267

ISBN-13: 3319538624

DOWNLOAD EBOOK

This book constitutes the thoroughly refereed post-conference proceedings of the First JARA High-Performance Computing Symposium, JARA-HPC 2016, held in Aachen, Germany, in October 2016. The 21 full papers presented were carefully reviewed and selected from 26 submissions. They cover many diverse topics, such as coupling methods and strategies in Computational Fluid Dynamics (CFD), performance portability and applications in HPC, as well as provenance tracking for large-scale simulations.