Recent advances in data collection, storage technologies, and computing power have made it possible for companies, government agencies and scientific laboratories to keep and manipulate vast amounts of data relating to their activities. This state-of-the-art monograph discusses essential algorithms for sophisticated data mining methods used with large-scale databases, focusing on two key topics: association rules and sequential pattern discovery. This will be an essential book for practitioners and professionals in computer science and computer engineering.
This comprehensive reference consists of 18 chapters from prominent researchers in the field. Each chapter is self-contained, and synthesizes one aspect of frequent pattern mining. An emphasis is placed on simplifying the content, so that students and practitioners can benefit from the book. Each chapter contains a survey describing key research on the topic, a case study and future directions. Key topics include: Pattern Growth Methods, Frequent Pattern Mining in Data Streams, Mining Graph Patterns, Big Data Frequent Pattern Mining, Algorithms for Data Clustering and more. Advanced-level students in computer science, researchers and practitioners from industry will find this book an invaluable reference.
This book constitutes the refereed proceedings of the 25th Canadian Conference on Artificial Intelligence, Canadian AI 2012, held in Toronto, Canada, in May 2012. The 23 regular papers, 16 short papers, and 4 papers from the Graduate Student Symposium presented were carefully reviewed and selected for inclusion in this book. The papers cover a broad range of topics presenting original work in all areas of artificial intelligence, either theoretical or applied.
This book constitutes the refereed proceedings of the Third European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD'99, held in Prague, Czech Republic in September 1999. The 28 revised full papers and 48 poster presentations were carefully reviewed and selected from 106 full papers submitted. The papers are organized in topical sections on time series, applications, taxonomies and partitions, logic methods, distributed and multirelational databases, text mining and feature selection, rules and induction, and interesting and unusual issues.
This book provides an introduction to the field of periodic pattern mining, reviews state-of-the-art techniques, discusses recent advances, and reviews open-source software. Periodic pattern mining is a popular and emerging research area in the field of data mining. It involves discovering all regularly occurring patterns in temporal databases. One of the major applications of periodic pattern mining is the analysis of customer transaction databases to discover sets of items that have been regularly purchased by customers. Discovering such patterns has several implications for understanding the behavior of customers. Since the first work on periodic pattern mining, numerous studies have been published and great advances have been made in this field. The book consists of three main parts: introduction, algorithms, and applications. The first chapter is an introduction to pattern mining and periodic pattern mining. The concepts of periodicity, periodic support, search space exploration techniques, and pruning strategies are discussed. The main types of algorithms are also presented such as periodic-frequent pattern growth, partial periodic pattern-growth, and periodic high-utility itemset mining algorithm. Challenges and research opportunities are reviewed. The chapters that follow present state-of-the-art techniques for discovering periodic patterns in (1) transactional databases, (2) temporal databases, (3) quantitative temporal databases, and (4) big data. Then, the theory on concise representations of periodic patterns is presented, as well as hiding sensitive information using privacy-preserving data mining techniques. The book concludes with several applications of periodic pattern mining, including applications in air pollution data analytics, accident data analytics, and traffic congestion analytics.
This book presents an overview of techniques for discovering high-utility patterns (patterns with a high importance) in data. It introduces the main types of high-utility patterns, as well as the theory and core algorithms for high-utility pattern mining, and describes recent advances, applications, open-source software, and research opportunities. It also discusses several types of discrete data, including customer transaction data and sequential data. The book consists of twelve chapters, seven of which are surveys presenting the main subfields of high-utility pattern mining, including itemset mining, sequential pattern mining, big data pattern mining, metaheuristic-based approaches, privacy-preserving pattern mining, and pattern visualization. The remaining five chapters describe key techniques and applications, such as discovering concise representations and regular patterns.
This book presents the refereed proceedings of the Fifth International Conference on Extending Database Technology, EDBT'96, held in Avignon, France in March 1996. The 31 full revised papers included were selected from a total of 178 submissions; also included are some industrial-track papers, contributed by partners of several ESPRIT projects. The volume is organized in topical sections on data mining, active databases, design tools, advanced DBMS, optimization, warehousing, system issues, temporal databases, the web and hypermedia, performance, workflow management, database design, and parallel databases.
Liu has written a comprehensive text on Web mining, which consists of two parts. The first part covers the data mining and machine learning foundations, where all the essential concepts and algorithms of data mining and machine learning are presented. The second part covers the key topics of Web mining, where Web crawling, search, social network analysis, structured data extraction, information integration, opinion mining and sentiment analysis, Web usage mining, query log mining, computational advertising, and recommender systems are all treated both in breadth and in depth. His book thus brings all the related concepts and algorithms together to form an authoritative and coherent text. The book offers a rich blend of theory and practice. It is suitable for students, researchers and practitioners interested in Web mining and data mining both as a learning text and as a reference book. Professors can readily use it for classes on data mining, Web mining, and text mining. Additional teaching materials such as lecture slides, datasets, and implemented algorithms are available online.
There is no royal road to science, and only those who do not dread the fatiguing climb of its steep paths have a chance of gaining its luminous summits. Karl Marx A Universial Genius of the 19th Century Many scientists from all over the world during the past two years since the MLDM 2007 have come along on the stony way to the sunny summit of science and have worked hard on new ideas and applications in the area of data mining in pattern r- ognition. Our thanks go to all those who took part in this year's MLDM. We appre- ate their submissions and the ideas shared with the Program Committee. We received over 205 submissions from all over the world to the International Conference on - chine Learning and Data Mining, MLDM 2009. The Program Committee carefully selected the best papers for this year’s program and gave detailed comments on each submitted paper. There were 63 papers selected for oral presentation and 17 papers for poster presentation. The topics range from theoretical topics for classification, clustering, association rule and pattern mining to specific data-mining methods for the different multimedia data types such as image mining, text mining, video mining and Web mining. Among these topics this year were special contributions to subtopics such as attribute discre- zation and data preparation, novelty and outlier detection, and distances and simila- ties.