Mining Very Large Databases with Parallel Processing

Mining Very Large Databases with Parallel Processing

Author: Alex A. Freitas

Publisher: Springer Science & Business Media

Published: 2012-12-06

Total Pages: 211

ISBN-13: 1461555213

DOWNLOAD EBOOK

Mining Very Large Databases with Parallel Processing addresses the problem of large-scale data mining. It is an interdisciplinary text, describing advances in the integration of three computer science areas, namely `intelligent' (machine learning-based) data mining techniques, relational databases and parallel processing. The basic idea is to use concepts and techniques of the latter two areas - particularly parallel processing - to speed up and scale up data mining algorithms. The book is divided into three parts. The first part presents a comprehensive review of intelligent data mining techniques such as rule induction, instance-based learning, neural networks and genetic algorithms. Likewise, the second part presents a comprehensive review of parallel processing and parallel databases. Each of these parts includes an overview of commercially-available, state-of-the-art tools. The third part deals with the application of parallel processing to data mining. The emphasis is on finding generic, cost-effective solutions for realistic data volumes. Two parallel computational environments are discussed, the first excluding the use of commercial-strength DBMS, and the second using parallel DBMS servers. It is assumed that the reader has a knowledge roughly equivalent to a first degree (BSc) in accurate sciences, so that (s)he is reasonably familiar with basic concepts of statistics and computer science. The primary audience for Mining Very Large Databases with Parallel Processing is industry data miners and practitioners in general, who would like to apply intelligent data mining techniques to large amounts of data. The book will also be of interest to academic researchers and postgraduate students, particularly database researchers, interested in advanced, intelligent database applications, and artificial intelligence researchers interested in industrial, real-world applications of machine learning.


Proceedings 1989 VLDB Conference

Proceedings 1989 VLDB Conference

Author: Petrus Maria Gerardus Apers

Publisher: Morgan Kaufmann

Published: 1989-12

Total Pages: 488

ISBN-13: 9781558601017

DOWNLOAD EBOOK

Proceedings of the 15th International Conference (see title), August 1989, Amsterdam, The Netherlands. Contains forty-five papers from worldwide contributors which explore fundamental issues and current developments parallelism, interfaces, statistics, and programming languages.


Practical Guide to Large Database Migration

Practical Guide to Large Database Migration

Author: Preston Zhang

Publisher: CRC Press

Published: 2019-03-27

Total Pages: 262

ISBN-13: 042974952X

DOWNLOAD EBOOK

It is a major challenge to migrate very large databases from one system, say for example, to transfer critical data from Oracle to SQL Server. One has to consider several issues such as loss of data being transferred, the security of the data, the cost and effort, technical aspects of the software involved, etc. There a very few books that provide practical tools and the methodology to migrate data from one vendor to another. This book introduces the concepts in database migration with large sample databases. It provides step by step guides and screenshots for database migration tools. Many examples are shown for migrating Oracle, SQL Server and MySQL databases.


Effective Databases for Text & Document Management

Effective Databases for Text & Document Management

Author: Shirley A. Becker

Publisher: IGI Global

Published: 2003-01-01

Total Pages: 390

ISBN-13: 9781931777476

DOWNLOAD EBOOK

"Focused on the latest research on text and document management, this guide addresses the information management needs of organizations by providing the most recent findings. How the need for effective databases to house information is impacting organizations worldwide and how some organizations that possess a vast amount of data are not able to use the data in an economic and efficient manner is demonstrated. A taxonomy for object-oriented databases, metrics for controlling database complexity, and a guide to accommodating hierarchies in relational databases are provided. Also covered is how to apply Java-triggers for X-Link management and how to build signatures."


Advances in Database Technology - EDBT 2004

Advances in Database Technology - EDBT 2004

Author: Elisa Bertino

Publisher: Springer

Published: 2004-02-12

Total Pages: 895

ISBN-13: 3540247416

DOWNLOAD EBOOK

The 9th International Conference on Extending Database Technology, EDBT 2004, was held in Heraklion, Crete, Greece, during March 14–18, 2004. The EDBT series of conferences is an established and prestigious forum for the exchange of the latest research results in data management. Held every two years in an attractive European location, the conference provides unique opp- tunities for database researchers, practitioners, developers, and users to explore new ideas, techniques, and tools, and to exchange experiences. The previous events were held in Venice, Vienna, Cambridge, Avignon, Valencia, Konstanz, and Prague. EDBT 2004 had the theme “new challenges for database technology,” with the goal of encouraging researchers to take a greater interest in the current exciting technological and application advancements and to devise and address new research and development directions for database technology. From its early days, database technology has been challenged and advanced by new uses and applications, and it continues to evolve along with application requirements and hardware advances. Today’s DBMS technology faces yet several new challenges. Technological trends and new computation paradigms, and applications such as pervasive and ubiquitous computing, grid computing, bioinformatics, trust management, virtual communities, and digital asset management, to name just a few, require database technology to be deployed in a variety of environments and for a number of di?erent purposes. Such an extensive deployment will also require trustworthy, resilient database systems, as well as easy-to-manage and ?exible ones, to which we can entrust our data in whatever form they are.


Principles of Distributed Database Systems

Principles of Distributed Database Systems

Author: M. Tamer Özsu

Publisher: Springer Science & Business Media

Published: 2011-02-24

Total Pages: 856

ISBN-13: 1441988343

DOWNLOAD EBOOK

This third edition of a classic textbook can be used to teach at the senior undergraduate and graduate levels. The material concentrates on fundamental theories as well as techniques and algorithms. The advent of the Internet and the World Wide Web, and, more recently, the emergence of cloud computing and streaming data applications, has forced a renewal of interest in distributed and parallel data management, while, at the same time, requiring a rethinking of some of the traditional techniques. This book covers the breadth and depth of this re-emerging field. The coverage consists of two parts. The first part discusses the fundamental principles of distributed data management and includes distribution design, data integration, distributed query processing and optimization, distributed transaction management, and replication. The second part focuses on more advanced topics and includes discussion of parallel database systems, distributed object management, peer-to-peer data management, web data management, data stream systems, and cloud computing. New in this Edition: • New chapters, covering database replication, database integration, multidatabase query processing, peer-to-peer data management, and web data management. • Coverage of emerging topics such as data streams and cloud computing • Extensive revisions and updates based on years of class testing and feedback Ancillary teaching materials are available.


Active and Real-Time Database Systems (ARTDB-95)

Active and Real-Time Database Systems (ARTDB-95)

Author: Mikael Berndtsson

Publisher: Springer Science & Business Media

Published: 2012-12-06

Total Pages: 276

ISBN-13: 1447130804

DOWNLOAD EBOOK

The areas of active and real-time databases have seen a tremendous growth of interest in the past few years, particularly with regard to their support of time-critical and embedded applications. ARTDB-95 provided, therefore, an important forum for researchers from both communities to discuss research results, and also to chart new directions for the future. As well as the 11 submitted papers presented at the workshop, this volume also contains 4 invited papers on the following topics: the impact of active databases on commercial practice; the optimization of active database transactions; the need for better language, compiler and tool support for real-time databases; and the origin of time constraints associated with data, events and actions. Together the papers give a comprehensive overview of current research, and will provide invaluable reading for academic and industrial researchers and students at both undergraduate and postgraduate level.