The book collects the short papers presented at the 13th Scientific Meeting of the Classification and Data Analysis Group (CLADAG) of the Italian Statistical Society (SIS). The meeting has been organized by the Department of Statistics, Computer Science and Applications of the University of Florence, under the auspices of the Italian Statistical Society and the International Federation of Classification Societies (IFCS). CLADAG is a member of the IFCS, a federation of national, regional, and linguistically-based classification societies. It is a non-profit, non-political scientific organization, whose aims are to further classification research.
This book includes a wide selection of papers presented at the 50th Scientific Meeting of the Italian Statistical Society (SIS2021), held virtually on 21-25 June 2021. It covers a wide variety of subjects ranging from methodological and theoretical contributions to applied works and case studies, giving an excellent overview of the interests of the Italian statisticians and their international collaborations. Intended for researchers interested in theoretical and empirical issues, this volume provides interesting starting points for further research.
This book constitutes the thoroughly refereed proceedings of the 7th International Conference on Mathematics and Computation in Music, MCM 2019, held in Madrid, Spain, in June 2019. The 22 full papers and 10 short papers presented were carefully reviewed and selected from 48 submissions. The papers feature research that combines mathematics or computation with music theory, music analysis, composition, and performance. They are organized in topical sections on algebraic and other abstract mathematical approaches to understanding musical objects; remanaging Riemann: mathematical music theory as “experimental philosophy”?; octave division; computer-based approaches to composition and score structuring; models for music cognition and beat tracking; pedagogy of mathematical music theory. The chapter “Distant Neighbors and Interscalar Contiguities” is available open access under a Creative Commons Attribution 4.0 International License via link.springer.com.
As a generalization of simple correspondence analysis, multiple correspondence analysis (MCA) is a powerful technique for handling larger, more complex datasets, including the high-dimensional categorical data often encountered in the social sciences, marketing, health economics, and biomedical research. Until now, however, the literature on the su
Data mining can be defined as the process of selection, explorationand modelling of large databases, in order to discover models andpatterns. The increasing availability of data in the currentinformation society has led to the need for valid tools for itsmodelling and analysis. Data mining and applied statistical methodsare the appropriate tools to extract such knowledge from data.Applications occur in many different fields, including statistics,computer science, machine learning, economics, marketing andfinance. This book is the first to describe applied data mining methodsin a consistent statistical framework, and then show how they canbe applied in practice. All the methods described are eithercomputational, or of a statistical modelling nature. Complexprobabilistic models and mathematical tools are not used, so thebook is accessible to a wide audience of students and industryprofessionals. The second half of the book consists of nine casestudies, taken from the author's own work in industry, thatdemonstrate how the methods described can be applied to realproblems. Provides a solid introduction to applied data mining methods ina consistent statistical framework Includes coverage of classical, multivariate and Bayesianstatistical methodology Includes many recent developments such as web mining,sequential Bayesian analysis and memory based reasoning Each statistical method described is illustrated with real lifeapplications Features a number of detailed case studies based on appliedprojects within industry Incorporates discussion on software used in data mining, withparticular emphasis on SAS Supported by a website featuring data sets, software andadditional material Includes an extensive bibliography and pointers to furtherreading within the text Author has many years experience teaching introductory andmultivariate statistics and data mining, and working on appliedprojects within industry A valuable resource for advanced undergraduate and graduatestudents of applied statistics, data mining, computer science andeconomics, as well as for professionals working in industry onprojects involving large volumes of data - such as in marketing orfinancial risk management.
Cluster analysis finds groups in data automatically. Most methods have been heuristic and leave open such central questions as: how many clusters are there? Which method should I use? How should I handle outliers? Classification assigns new observations to groups given previously classified observations, and also has open questions about parameter tuning, robustness and uncertainty assessment. This book frames cluster analysis and classification in terms of statistical models, thus yielding principled estimation, testing and prediction methods, and sound answers to the central questions. It builds the basic ideas in an accessible but rigorous way, with extensive data examples and R code; describes modern approaches to high-dimensional data and networks; and explains such recent advances as Bayesian regularization, non-Gaussian model-based clustering, cluster merging, variable selection, semi-supervised and robust classification, clustering of functional data, text and images, and co-clustering. Written for advanced undergraduates in data science, as well as researchers and practitioners, it assumes basic knowledge of multivariate calculus, linear algebra, probability and statistics.
The increasing availability of data in our current, information overloaded society has led to the need for valid tools for its modelling and analysis. Data mining and applied statistical methods are the appropriate tools to extract knowledge from such data. This book provides an accessible introduction to data mining methods in a consistent and application oriented statistical framework, using case studies drawn from real industry projects and highlighting the use of data mining methods in a variety of business applications. Introduces data mining methods and applications. Covers classical and Bayesian multivariate statistical methodology as well as machine learning and computational data mining methods. Includes many recent developments such as association and sequence rules, graphical Markov models, lifetime value modelling, credit risk, operational risk and web mining. Features detailed case studies based on applied projects within industry. Incorporates discussion of data mining software, with case studies analysed using R. Is accessible to anyone with a basic knowledge of statistics or data analysis. Includes an extensive bibliography and pointers to further reading within the text. Applied Data Mining for Business and Industry, 2nd edition is aimed at advanced undergraduate and graduate students of data mining, applied statistics, database management, computer science and economics. The case studies will provide guidance to professionals working in industry on projects involving large volumes of data, such as customer relationship management, web design, risk management, marketing, economics and finance.
This book includes 40 peer-reviewed short papers submitted to the Scientific Conference titled Statistics and Information Systems for Policy Evaluation, aimed at promoting new statistical methods and applications for the evaluation of policies and organized by the Association for Applied Statistics (ASA) and the Dept. of Statistics, Computer Science, Applications DiSIA “G. Parenti” of the University of Florence, jointly with the partners AICQ (Italian Association for Quality Culture), AICQ-CN (Italian Association for Quality Culture North and Centre of Italy), AISS (Italian Academy for Six Sigma), ASSIRM (Italian Association for Marketing, Social and Opinion Research), Comune di Firenze, the SIS – Italian Statistical Society, Regione Toscana and Valmon – Evaluation & Monitoring.
This book includes 25 peer-reviewed short papers submitted to the Scientific Opening Conference titled “Statistics and Information Systems for Policy Evaluation”, aimed at promoting new statistical methods and applications for the evaluation of policies and organized by the Association for Applied Statistics (ASA) and the Department of Statistics, Computer Science, Applications DiSIA “G. Parenti” of the University of Florence, jointly with the partners AICQ (Italian Association for Quality Culture), AICQ-CN (Italian Association for Quality Culture North and Centre of Italy), AISS (Italian Academy for Six Sigma), ASSIRM (Italian Association for Marketing, Social and Opinion Research), Comune di Firenze, the SIS – Italian Statistical Society, Regione Toscana and Valmon – Evaluation & Monitoring.
This work is the critical edition of the catalog of Holy Roman Emperor Leopold’s II mineralogical collection. The volume, unpublished and preserved at the Historical Archives of the University of Firenze Museum System, dates to 1765 and describes 242 mineralogical specimens coming primarily from the current Slovak-Hungarian mining district. This edition gives the transcription of the German manuscript and its translation into English together with an organized system of notation to illustrate the complex history of the text, the characterization of the mineralogical species, and the geographical location of the mineral extraction sites. This work represents to date the only published catalog of a mineralogical collection belonging to a member of the Habsburg-Lorraine family.