Proceedings of the 9th International Conference on Computer Recognition Systems CORES 2015

Proceedings of the 9th International Conference on Computer Recognition Systems CORES 2015

Author: Robert Burduk

Publisher: Springer

Published: 2016-03-05

Total Pages: 827

ISBN-13: 3319262270

DOWNLOAD EBOOK

The computer recognition systems are nowadays one of the most promising directions in artificial intelligence. This book is the most comprehensive study of this field. It contains a collection of 79 carefully selected articles contributed by experts of pattern recognition. It reports on current research with respect to both methodology and applications. In particular, it includes the following sections: Features, learning, and classifiers Biometrics Data Stream Classification and Big Data Analytics Image processing and computer vision Medical applications Applications RGB-D perception: recent developments and applications This book is a great reference tool for scientists who deal with the problems of designing computer pattern recognition systems. Its target readers can be the as well researchers as students of computer science, artificial intelligence or robotics.


Modal Array Signal Processing: Principles and Applications of Acoustic Wavefield Decomposition

Modal Array Signal Processing: Principles and Applications of Acoustic Wavefield Decomposition

Author: Heinz Teutsch

Publisher: Springer

Published: 2007-05-10

Total Pages: 267

ISBN-13: 3540408967

DOWNLOAD EBOOK

This book deals with the problem of detecting and localizing multiple simultaneously active wideband acoustic sources by applying the notion of wavefield decomposition using circular and spherical microphone arrays. A rigorous derivation of modal array signal processing algorithms for unambiguous source detection and localization, as well as performance evaluations by means of measurements using an actual real-time capable implementation, are discussed.


Academic Press Library in Signal Processing

Academic Press Library in Signal Processing

Author:

Publisher: Academic Press

Published: 2013-09-14

Total Pages: 1131

ISBN-13: 0123972256

DOWNLOAD EBOOK

This fourth volume, edited and authored by world leading experts, gives a review of the principles, methods and techniques of important and emerging research topics and technologies in Image, Video Processing and Analysis, Hardware, Audio, Acoustic and Speech Processing. With this reference source you will: - Quickly grasp a new area of research - Understand the underlying principles of a topic and its application - Ascertain how a topic relates to other areas and learn of the research issues yet to be resolved - Quick tutorial reviews of important and emerging topics of research in Image, Video Processing and Analysis, Hardware, Audio, Acoustic and Speech Processing - Presents core principles and shows their application - Reference content on core principles, technologies, algorithms and applications - Comprehensive references to journal articles and other literature on which to build further, more specific and detailed knowledge - Edited by leading people in the field who, through their reputation, have been able to commission experts to write on a particular topic


Computational Phonogram Archiving

Computational Phonogram Archiving

Author: Rolf Bader

Publisher: Springer

Published: 2019-01-25

Total Pages: 354

ISBN-13: 3030026957

DOWNLOAD EBOOK

The future of music archiving and search engines lies in deep learning and big data. Music information retrieval algorithms automatically analyze musical features like timbre, melody, rhythm or musical form, and artificial intelligence then sorts and relates these features. At the first International Symposium on Computational Ethnomusicological Archiving held on November 9 to 11, 2017 at the Institute of Systematic Musicology in Hamburg, Germany, a new Computational Phonogram Archiving standard was discussed as an interdisciplinary approach. Ethnomusicologists, music and computer scientists, systematic musicologists as well as music archivists, composers and musicians presented tools, methods and platforms and shared fieldwork and archiving experiences in the fields of musical acoustics, informatics, music theory as well as on music storage, reproduction and metadata. The Computational Phonogram Archiving standard is also in high demand in the music market as a search engine for music consumers. This book offers a comprehensive overview of the field written by leading researchers around the globe.


Parametric Time-Frequency Domain Spatial Audio

Parametric Time-Frequency Domain Spatial Audio

Author: Ville Pulkki

Publisher: John Wiley & Sons

Published: 2017-10-04

Total Pages: 412

ISBN-13: 111925258X

DOWNLOAD EBOOK

A comprehensive guide that addresses the theory and practice of spatial audio This book provides readers with the principles and best practices in spatial audio signal processing. It describes how sound fields and their perceptual attributes are captured and analyzed within the time-frequency domain, how essential representation parameters are coded, and how such signals are efficiently reproduced for practical applications. The book is split into four parts starting with an overview of the fundamentals. It then goes on to explain the reproduction of spatial sound before offering an examination of signal-dependent spatial filtering. The book finishes with coverage of both current and future applications and the direction that spatial audio research is heading in. Parametric Time-frequency Domain Spatial Audio focuses on applications in entertainment audio, including music, home cinema, and gaming—covering the capturing and reproduction of spatial sound as well as its generation, transduction, representation, transmission, and perception. This book will teach readers the tools needed for such processing, and provides an overview to existing research. It also shows recent up-to-date projects and commercial applications built on top of the systems. Provides an in-depth presentation of the principles, past developments, state-of-the-art methods, and future research directions of spatial audio technologies Includes contributions from leading researchers in the field Offers MATLAB codes with selected chapters An advanced book aimed at readers who are capable of digesting mathematical expressions about digital signal processing and sound field analysis, Parametric Time-frequency Domain Spatial Audio is best suited for researchers in academia and in the audio industry.


Intelligent Data analysis and its Applications, Volume II

Intelligent Data analysis and its Applications, Volume II

Author: Jeng-Shyang Pan

Publisher: Springer

Published: 2014-06-05

Total Pages: 583

ISBN-13: 3319077732

DOWNLOAD EBOOK

This volume presents the proceedings of the First Euro-China Conference on Intelligent Data Analysis and Applications (ECC 2014), which was hosted by Shenzhen Graduate School of Harbin Institute of Technology and was held in Shenzhen City on June 13-15, 2014. ECC 2014 was technically co-sponsored by Shenzhen Municipal People’s Government, IEEE Signal Processing Society, Machine Intelligence Research Labs, VSB-Technical University of Ostrava (Czech Republic), National Kaohsiung University of Applied Sciences (Taiwan), and Secure E-commerce Transactions (Shenzhen) Engineering Laboratory of Shenzhen Institute of Standards and Technology.


Latent Variable Analysis and Signal Separation

Latent Variable Analysis and Signal Separation

Author: Vincent Vigneron

Publisher: Springer Science & Business Media

Published: 2010-09-27

Total Pages: 672

ISBN-13: 364215994X

DOWNLOAD EBOOK

Thisvolumecollectsthepaperspresentedatthe9thInternationalConferenceon Latent Variable Analysis and Signal Separation,LVA/ICA 2010. The conference was organized by INRIA, the French National Institute for Computer Science and Control,and was held in Saint-Malo, France, September 27–30,2010,at the Palais du Grand Large. Tenyearsafterthe?rstworkshoponIndependent Component Analysis(ICA) in Aussois, France, the series of ICA conferences has shown the liveliness of the community of theoreticians and practitioners working in this ?eld. While ICA and blind signal separation have become mainstream topics, new approaches have emerged to solve problems involving signal mixtures or various other types of latent variables: semi-blind models, matrix factorization using sparse com- nent analysis, non-negative matrix factorization, probabilistic latent semantic indexing, tensor decompositions, independent vector analysis, independent s- space analysis, and so on. To re?ect this evolution towards more general latent variable analysis problems in signal processing, the ICA International Steering Committee decided to rename the 9th instance of the conference LVA/ICA. From more than a hundred submitted papers, 25 were accepted as oral p- sentationsand53 asposter presentations. Thecontent ofthis volumefollowsthe conference schedule, resulting in 14 chapters. The papers collected in this v- ume demonstrate that the research activity in the ?eld continues to range from abstract concepts to the most concrete and applicable questions and consid- ations. Speech and audio, as well as biomedical applications, continue to carry the mass of the applications considered.


Speech Dereverberation

Speech Dereverberation

Author: Patrick A. Naylor

Publisher: Springer Science & Business Media

Published: 2010-07-27

Total Pages: 388

ISBN-13: 1849960569

DOWNLOAD EBOOK

Speech Dereverberation gathers together an overview, a mathematical formulation of the problem and the state-of-the-art solutions for dereverberation. Speech Dereverberation presents current approaches to the problem of reverberation. It provides a review of topics in room acoustics and also describes performance measures for dereverberation. The algorithms are then explained with mathematical analysis and examples that enable the reader to see the strengths and weaknesses of the various techniques, as well as giving an understanding of the questions still to be addressed. Techniques rooted in speech enhancement are included, in addition to a treatment of multichannel blind acoustic system identification and inversion. The TRINICON framework is shown in the context of dereverberation to be a generalization of the signal processing for a range of analysis and enhancement techniques. Speech Dereverberation is suitable for students at masters and doctoral level, as well as established researchers.


Timbre: Acoustics, Perception, and Cognition

Timbre: Acoustics, Perception, and Cognition

Author: Kai Siedenburg

Publisher: Springer

Published: 2019-05-07

Total Pages: 392

ISBN-13: 3030148327

DOWNLOAD EBOOK

Roughly defined as any property other than pitch, duration, and loudness that allows two sounds to be distinguished, timbre is a foundational aspect of hearing. The remarkable ability of humans to recognize sound sources and events (e.g., glass breaking, a friend’s voice, a tone from a piano) stems primarily from a capacity to perceive and process differences in the timbre of sounds. Timbre raises many important issues in psychology and the cognitive sciences, musical acoustics, speech processing, medical engineering, and artificial intelligence. Current research on timbre perception unfolds along three main fronts: On the one hand, researchers explore the principal perceptual processes that orchestrate timbre processing, such as the structure of its perceptual representation, sound categorization and recognition, memory for timbre, and its ability to elicit rich semantic associations, as well as the underlying neural mechanisms. On the other hand, timbre is studied as part of specific scenarios, including the perception of the human voice, as a structuring force in music, as perceived with cochlear implants, and through its role in affecting sound quality and sound design. Finally, computational acoustic models are sought through prediction of psychophysical data, physiologically inspired representations, and audio analysis-synthesis techniques. Along these three scientific fronts, significant breakthroughs have been achieved during the last decade. This volume will be the first book dedicated to a comprehensive and authoritative presentation of timbre perception and cognition research and the acoustic modeling of timbre. The volume will serve as a natural complement to the SHAR volumes on the basic auditory parameters of Pitch edited by Plack, Oxenham, Popper, and Fay, and Loudness by Florentine, Popper, and Fay. Moreover, through the integration of complementary scientific methods ranging from signal processing to brain imaging, the book has the potential to leverage new interdisciplinary synergies in hearing science. For these reasons, the volume will be exceptionally valuable to various subfields of hearing science, including cognitive auditory neuroscience, psychoacoustics, music perception and cognition, but may even exert significant influence on fields such as musical acoustics, music information retrieval, and acoustic signal processing. It is expected that the volume will have broad appeal to psychologists, neuroscientists, and acousticians involved in research on auditory perception and cognition. Specifically, this book will have a strong impact on hearing researchers with interest in timbre and will serve as the key publication and up-to-date reference on timbre for graduate students, postdoctoral researchers, as well as established scholars.