Multi-dimensional clustering for data base organizations
Author: Purdue University. Department of Computer Sciences
Publisher:
Published: 1976
Total Pages:
ISBN-13:
DOWNLOAD EBOOKRead and Download eBook Full
Author: Purdue University. Department of Computer Sciences
Publisher:
Published: 1976
Total Pages:
ISBN-13:
DOWNLOAD EBOOKAuthor: Sakti P. Ghosh
Publisher: Springer Science & Business Media
Published: 2012-12-06
Total Pages: 615
ISBN-13: 1461318815
DOWNLOAD EBOOKFoundations of data organization is a relatively new field of research in comparison to, other branches of science. It is close to twenty years old. In this short life span of this branch of computer science, it has spread to all corners of the world, which is reflected in this book. This book covers new database application areas (databases for advanced applications and CAD/VLSI databases), computational geometry, file allocation & distributed databases, database models (including non traditional database models), database machines, query processing & physical structures for relational databases, besides traditional file organization (hashing, index file organization, mathematical file organization and consecutive retrieval property), in order to identify new trends of database research. The papers in this book originally represent talks given at the International Conference on Foundations of Data Organization, which was held on May 21-24, 1985, in Kyoto, Japan. This conference was held at Kyoto University, and sponsored by the organizing committee of the International Conference on Foundations of Data Organization and the Japan Society for the Promotion of Science. The conference was in cooperation with: ACM SIGMOD, IEEE Computer Society, Information Processing Society of Japan, IBM Research, Kyushu University, Kobe University, IBM Japan, Kyoto Sangyo University and Polish Academy of Sciences. This Conference was the follow-up of the first conference, which was hosted by the Polish Academy of Sciences and held at Warsaw in 1981. The Warsaw conference focused mainly on consecutive retrieval property and it's applications.
Author: Bhattacharyya, Siddhartha
Publisher: IGI Global
Published: 2016-11-29
Total Pages: 471
ISBN-13: 1522517774
DOWNLOAD EBOOKData mining analysis techniques have undergone significant developments in recent years. This has led to improved uses throughout numerous functions and applications. Intelligent Multidimensional Data Clustering and Analysis is an authoritative reference source for the latest scholarly research on the advantages and challenges presented by the use of cluster analysis techniques. Highlighting theoretical foundations, computing paradigms, and real-world applications, this book is ideally designed for researchers, practitioners, upper-level students, and professionals interested in the latest developments in cluster analysis for large data sets.
Author: Akifumi Makinouchi
Publisher: World Scientific
Published: 1992-09-21
Total Pages: 568
ISBN-13: 9814554588
DOWNLOAD EBOOKThis volume contains 64 papers from contributors around the world on a wide range of topics in database systems research. Of special mention are the papers describing the practical experiences of developing and implementing some of the many useful database systems on the market. Readers should find useful new ideas from the proceedings of this international symposium.
Author: Sam S. Lightstone
Publisher:
Published: 2003
Total Pages: 210
ISBN-13:
DOWNLOAD EBOOKAuthor:
Publisher:
Published: 2006
Total Pages: 229
ISBN-13:
DOWNLOAD EBOOKThe generation of multi-dimensional data has proceeded at an explosive rate in many disciplines with the advance of modern technology, which greatly increases the challenges of comprehending and interpreting the resulting mass of data. Existing data analysis techniques have difficulty in handling multi-dimensional data. Multi-dimensional data has been a challenge for data analysis because of the inherent sparsity of the points. first step toward addressing this challenge is the use of clustering techniques, which is essential in the data mining process to reveal natural structures and identify interesting patterns in the underlying data. Cluster analysis is used to identify homogeneous and well-separated groups of objects in databases. The need to cluster large quantities of multi-dimensional data is widely recognized. It is a classical problem in the database, artificial intelligence, and theoretical literature, and plays an important role in many fields of business and science. There are also a lot of approaches designed for outlier detection. In many situations, clusters and outliers are concepts whose meanings are inseparable to each other, especially for those data sets with noise. Thus, it is necessary to treat clusters and outliers as concepts of the same importance in data analysis. It is well acknowledged that in the real world a large proportion of data has irrelevant features which may cause a reduction in the accuracy of some algorithms. High dimensional data sets continue to pose a challenge to clustering algorithms at a very fundamental level. One of the well known techniques for improving the data analysis performance is the method of dimension reduction which is often used in clustering, classification, and many other machine learning and data mining applications. Many approaches have been proposed to index high-dimensional data sets for efficient querying. Although most of them can efficiently support nearest neighbor search for low dimensional data sets, they degrade rapidly when dimensionality goes higher. Also the dynamic insertion of new data can cause original structures no longer handle the data sets efficiently since it may greatly increase the amount of data accessed for a query. In this dissertation, we study the problems mentioned above. We proposed a novel data pre-processing technique called shrinking which optimizes the inner structure of data inspired by Newton's Universal Law of Gravitation in the real world. We then proposed a shrinking-based clustering algorithm for multi-dimensional data and extended the algorithm to the dimension reduction field, resulting in a shrinking-based dimension reduction algorithm. (Abstract shortened by UMI.).
Author: Maurizio Rafanelli
Publisher: Springer Science & Business Media
Published: 1989-02-08
Total Pages: 468
ISBN-13: 9783540505754
DOWNLOAD EBOOKThe Fourth International Working Conference on Statistical and Scientific Data Base Management (IV SSDBM) held on June 21-23, 1988 in Rome, Italy, continued the series of conferences initiated in California in December 1981. The purpose of this conference was to bring together database researchers, users and system builders, working in this specific field, to discuss the particular points of interest, to propose new solutions to the problems of the domain and to expand the topics of the previous conferences, both from the theoretical and from the applicational point of view. The papers of four scientific sessions dealt with the following topics: knowledge base and expert system, data model, natural language processing, query language, time performance, user interface, heterogeneous data classification, storage constraints, automatic drawing, ranges and trackers, and arithmetic coding. Two other special sessions presented work on progress papers on geographical data modelling, spatial database queries, user interface in an Object Oriented SDB, interpretation of queries, graphical query language and knowledge browsing front ends. The conference also had three invited papers on topics of particular interest such as "Temporal Data", "Statistical Data Management Requirements" and "Knowledge Based Decision Support Systems", included in this volume. The introductory paper by M. Rafanelli provides both an introduction to the general concepts helpful to people outside the field and a survey of all the papers in these Proceedings. Furthermore, there were three open panels. Papers by the chairmen, contributions of the panelists and a summary of the respective discussions are included in this volume, too.
Author: Jacob Kogan
Publisher: Springer Science & Business Media
Published: 2006-02-08
Total Pages: 273
ISBN-13: 3540283498
DOWNLOAD EBOOKClustering is one of the most fundamental and essential data analysis techniques. Clustering can be used as an independent data mining task to discern intrinsic characteristics of data, or as a preprocessing step with the clustering results then used for classification, correlation analysis, or anomaly detection. Kogan and his co-editors have put together recent advances in clustering large and high-dimension data. Their volume addresses new topics and methods which are central to modern data analysis, with particular emphasis on linear algebra tools, opimization methods and statistical techniques. The contributions, written by leading researchers from both academia and industry, cover theoretical basics as well as application and evaluation of algorithms, and thus provide an excellent state-of-the-art overview. The level of detail, the breadth of coverage, and the comprehensive bibliography make this book a perfect fit for researchers and graduate students in data mining and in many other important related application areas.
Author: Francesco Palumbo
Publisher: Springer
Published: 2017-07-04
Total Pages: 346
ISBN-13: 3319557238
DOWNLOAD EBOOKThis edited volume on the latest advances in data science covers a wide range of topics in the context of data analysis and classification. In particular, it includes contributions on classification methods for high-dimensional data, clustering methods, multivariate statistical methods, and various applications. The book gathers a selection of peer-reviewed contributions presented at the Fifteenth Conference of the International Federation of Classification Societies (IFCS2015), which was hosted by the Alma Mater Studiorum, University of Bologna, from July 5 to 8, 2015.