1997 IEEE International Conference on Acoustics, Speech, and Signal Processing
Author:
Publisher:
Published: 1997
Total Pages: 766
ISBN-13:
DOWNLOAD EBOOKRead and Download eBook Full
Author:
Publisher:
Published: 1997
Total Pages: 766
ISBN-13:
DOWNLOAD EBOOKAuthor: Sameer Singh
Publisher: Springer Science & Business Media
Published: 2012-12-06
Total Pages: 474
ISBN-13: 1447108337
DOWNLOAD EBOOKInternational Conference on Advances in Pattern Recognition (ICAPR 98) at Plymouth represents an important meeting for advanced research in pattern recognition. There is considerable interest in the areas of image processing, medical imaging, speech recognition, document analysis and character recognition, fuzzy data analysis and neural networks. ICAPR 98 is aimed at providing an international platform for invited research in this multi-disciplinary area. It is expected that the conference will grow in future years to include more research contributions that detail state-of the-art research in pattern recognition. ICAPR 98 attracted contributions from different countries of the highest quality. I should like to thank the programme and organising committee for doing an excellent job in organising this conference. The peer reviewed nature of the conference ensured high quality publications in these proceedings. My personal thanks to Mrs. Barbara Davies who served as conference secretary and worked tirelessly in organising the conference. I thank the organising chair for the local arrangements and our should also key-note, plenary and tutorial speakers for their valuable contributions to the conference. I also thank Springer-Verlag for publishing these proceedings that will be a valuable source of research reference for the readers. Finally, I thank all participants who made this conference successful.
Author: Jens Blauert
Publisher: Springer Nature
Published: 2020-08-12
Total Pages: 808
ISBN-13: 3030003868
DOWNLOAD EBOOKSound, devoid of meaning, would not matter to us. It is the information sound conveys that helps the brain to understand its environment. Sound and its underlying meaning are always associated with time and space. There is no sound without spatial properties, and the brain always organizes this information within a temporal–spatial framework. This book is devoted to understanding the importance of meaning for spatial and related further aspects of hearing, including cross-modal inference. People, when exposed to acoustic stimuli, do not react directly to what they hear but rather to what they hear means to them. This semiotic maxim may not always apply, for instance, when the reactions are reflexive. But, where it does apply, it poses a major challenge to the builders of models of the auditory system. Take, for example, an auditory model that is meant to be implemented on a robotic agent for autonomous search-&-rescue actions. Or think of a system that can perform judgments on the sound quality of multimedia-reproduction systems. It becomes immediately clear that such a system needs • Cognitive capabilities, including substantial inherent knowledge • The ability to integrate information across different sensory modalities To realize these functions, the auditory system provides a pair of sensory organs, the two ears, and the means to perform adequate preprocessing of the signals provided by the ears. This is realized in the subcortical parts of the auditory system. In the title of a prior book, the term Binaural Listening is used to indicate a focus on sub-cortical functions. Psychoacoustics and auditory signal processing contribute substantially to this area. The preprocessed signals are then forwarded to the cortical parts of the auditory system where, among other things, recognition, classification, localization, scene analysis, assignment of meaning, quality assessment, and action planning take place. Also, information from different sensory modalities is integrated at this level. Between sub-cortical and cortical regions of the auditory system, numerous feedback loops exist that ultimately support the high complexity and plasticity of the auditory system. The current book concentrates on these cognitive functions. Instead of processing signals, processing symbols is now the predominant modeling task. Substantial contributions to the field draw upon the knowledge acquired by cognitive psychology. The keyword Binaural Understanding in the book title characterizes this shift. Both books, The Technology of Binaural Listening and the current one, have been stimulated and supported by AABBA, an open research group devoted to the development and application of models of binaural hearing. The current book is dedicated to technologies that help explain, facilitate, apply, and support various aspects of binaural understanding. It is organized into five parts, each containing three to six chapters in order to provide a comprehensive overview of this emerging area. Each chapter was thoroughly reviewed by at least two anonymous, external experts. The first part deals with the psychophysical and physiological effects of Forming and Interpreting Aural Objects as well as the underlying models. The fundamental concepts of reflexive and reflective auditory feedback are introduced. Mechanisms of binaural attention and attention switching are covered—as well as how auditory Gestalt rules facilitate binaural understanding. A general blackboard architecture is introduced as an example of how machines can learn to form and interpret aural objects to simulate human cognitive listening. The second part, Configuring and Understanding Aural Space, focuses on the human understanding of complex three-dimensional environments—covering the psychological and biological fundamentals of auditory space formation. This part further addresses the human mechanisms used to process information and interact in complex reverberant environments, such as concert halls and forests, and additionally examines how the auditory system can learn to understand and adapt to these environments. The third part is dedicated to Processing Cross-Modal Inference and highlights the fundamental human mechanisms used to integrate auditory cues with cues from other modalities to localize and form perceptual objects. This part also provides a general framework for understanding how complex multimodal scenes can be simulated and rendered. The fourth part, Evaluating Aural-scene Quality and Speech Understanding, focuses on the object-forming aspects of binaural listening and understanding. It addresses cognitive mechanisms involved in both the understanding of speech and the processing of nonverbal information such as Sound Quality and Quality-of- Experience. The aesthetic judgment of rooms is also discussed in this context. Models that simulate underlying human processes and performance are covered in addition to techniques for rendering virtual environments that can then be used to test these models. The fifth part deals with the Application of Cognitive Mechanisms to Audio Technology. It highlights how cognitive mechanisms can be utilized to create spatial auditory illusions using binaural and other 3D-audio technologies. Further, it covers how cognitive binaural technologies can be applied to improve human performance in auditory displays and to develop new auditory technologies for interactive robots. The book concludes with the application of cognitive binaural technologies to the next generation of hearing aids.
Author: Jose Luis Rojo-Alvarez
Publisher: John Wiley & Sons
Published: 2018-02-05
Total Pages: 665
ISBN-13: 1118611799
DOWNLOAD EBOOKA realistic and comprehensive review of joint approaches to machine learning and signal processing algorithms, with application to communications, multimedia, and biomedical engineering systems Digital Signal Processing with Kernel Methods reviews the milestones in the mixing of classical digital signal processing models and advanced kernel machines statistical learning tools. It explains the fundamental concepts from both fields of machine learning and signal processing so that readers can quickly get up to speed in order to begin developing the concepts and application software in their own research. Digital Signal Processing with Kernel Methods provides a comprehensive overview of kernel methods in signal processing, without restriction to any application field. It also offers example applications and detailed benchmarking experiments with real and synthetic datasets throughout. Readers can find further worked examples with Matlab source code on a website developed by the authors: http://github.com/DSPKM • Presents the necessary basic ideas from both digital signal processing and machine learning concepts • Reviews the state-of-the-art in SVM algorithms for classification and detection problems in the context of signal processing • Surveys advances in kernel signal processing beyond SVM algorithms to present other highly relevant kernel methods for digital signal processing An excellent book for signal processing researchers and practitioners, Digital Signal Processing with Kernel Methods will also appeal to those involved in machine learning and pattern recognition.
Author: Jun Peng
Publisher: BoD – Books on Demand
Published: 2010-09-28
Total Pages: 449
ISBN-13: 9533071141
DOWNLOAD EBOOKThis book "Communications and Networking" focuses on the issues at the lowest two layers of communications and networking and provides recent research results on some of these issues. In particular, it first introduces recent research results on many important issues at the physical layer and data link layer of communications and networking and then briefly shows some results on some other important topics such as security and the application of wireless networks. In summary, this book covers a wide range of interesting topics of communications and networking. The introductions, data, and references in this book will help the readers know more abut this topic and help them explore this exciting and fast-evolving field.
Author: Thomas Fang Zheng
Publisher: Springer
Published: 2017-04-06
Total Pages: 57
ISBN-13: 9811032386
DOWNLOAD EBOOKThis book presents an overview of speaker recognition technologies with an emphasis on dealing with robustness issues. Firstly, the book gives an overview of speaker recognition, such as the basic system framework, categories under different criteria, performance evaluation and its development history. Secondly, with regard to robustness issues, the book presents three categories, including environment-related issues, speaker-related issues and application-oriented issues. For each category, the book describes the current hot topics, existing technologies, and potential research focuses in the future. The book is a useful reference book and self-learning guide for early researchers working in the field of robust speech recognition.
Author: Michael M. Goodwin
Publisher: Springer Science & Business Media
Published: 2012-09-10
Total Pages: 259
ISBN-13: 1441986286
DOWNLOAD EBOOKAdaptive Signal Models: Theory, Algorithms and Audio Applications presents methods for deriving mathematical models of natural signals. The introduction covers the fundamentals of analysis-synthesis systems and signal representations. Some of the topics in the introduction include perfect and near-perfect reconstruction, the distinction between parametric and nonparametric methods, the role of compaction in signal modeling, basic and overcomplete signal expansions, and time-frequency resolution issues. These topics arise throughout the book as do a number of other topics such as filter banks and multiresolution. The second chapter gives a detailed development of the sinusoidal model as a parametric extension of the short-time Fourier transform. This leads to multiresolution sinusoidal modeling techniques in Chapter Three, where wavelet-like approaches are merged with the sinusoidal model to yield improved models. In Chapter Four, the analysis-synthesis residual is considered; for realistic synthesis, the residual must be separately modeled after coherent components (such as sinusoids) are removed. The residual modeling approach is based on psychoacoustically motivated nonuniform filter banks. Chapter Five deals with pitch-synchronous versions of both the wavelet and the Fourier transform; these allow for compact models of pseudo-periodic signals. Chapter Six discusses recent algorithms for deriving signal representations based on time-frequency atoms; primarily, the matching pursuit algorithm is reviewed and extended. The signal models discussed in the book are compact, adaptive, parametric, time-frequency representations that are useful for analysis, coding, modification, and synthesis of natural signals such as audio. The models are all interpreted as methods for decomposing a signal in terms of fundamental time-frequency atoms; these interpretations, as well as the adaptive and parametric natures of the models, serve to link the various methods dealt with in the text. Adaptive Signal Models: Theory, Algorithms and Audio Applications serves as an excellent reference for researchers of signal processing and may be used as a text for advanced courses on the topic.
Author: Shun-Zheng Yu
Publisher: Morgan Kaufmann
Published: 2015-10-22
Total Pages: 209
ISBN-13: 0128027711
DOWNLOAD EBOOKHidden semi-Markov models (HSMMs) are among the most important models in the area of artificial intelligence / machine learning. Since the first HSMM was introduced in 1980 for machine recognition of speech, three other HSMMs have been proposed, with various definitions of duration and observation distributions. Those models have different expressions, algorithms, computational complexities, and applicable areas, without explicitly interchangeable forms. Hidden Semi-Markov Models: Theory, Algorithms and Applications provides a unified and foundational approach to HSMMs, including various HSMMs (such as the explicit duration, variable transition, and residential time of HSMMs), inference and estimation algorithms, implementation methods and application instances. Learn new developments and state-of-the-art emerging topics as they relate to HSMMs, presented with examples drawn from medicine, engineering and computer science. - Discusses the latest developments and emerging topics in the field of HSMMs - Includes a description of applications in various areas including, Human Activity Recognition, Handwriting Recognition, Network Traffic Characterization and Anomaly Detection, and Functional MRI Brain Mapping. - Shows how to master the basic techniques needed for using HSMMs and how to apply them.
Author: Thomas Kaiser
Publisher: Hindawi Publishing Corporation
Published: 2005
Total Pages: 891
ISBN-13: 9775945097
DOWNLOAD EBOOKSmart Antennas—State of the Art brings together the broad expertise of 41 European experts in smart antennas. They provide a comprehensive review and an extensive analysis of the recent progress and new results generated during the last years in almost all fields of smart antennas and MIMO (multiple-input multiple-output) transmission. The following represents a summarized table of content.Receiver: space-time processing, antenna combining, reduced rank processing, robust beamforming, subspace methods, synchronization, equalization, multiuser detection, iterative methods Channel: propagation, measurements and sounding, modelling, channel estimation, direction-of-arrival estimation, subscriber location estimation Transmitter: space-time block coding, channel side information, unified design of linear transceivers, ill-conditioned channels, MIMO-MAC strategies Network Theory: channel capacity, network capacity, multihop networks Technology: antenna design, transceivers, demonstrators and testbeds, future air interfaces Applications and Systems: 3G system and link level aspects, MIMO HSDPA, MIMO-WLAN/UMTS implementation issues This book serves as a reference for scientists and engineers who need to be aware of the leading edge research in multiple-antenna communications, an essential technology for emerging broadband wireless systems.
Author: Ivan Habernal
Publisher: Springer
Published: 2013-08-17
Total Pages: 617
ISBN-13: 3642405851
DOWNLOAD EBOOKThis book constitutes the refereed proceedings of the 16th International Conference on Text, Speech and Dialogue, TSD 2013, held in Pilsen, Czech Republic, in September 2013. The 65 papers presented together with 5 invited talks were carefully reviewed and selected from 148 submissions. The main topics of this year's conference was corpora, texts and transcription, speech analysis, recognition and synthesis, and their intertwining within NL dialogue systems. The topics also included speech recognition, corpora and language resources, speech and spoken language generation, tagging, classification and parsing of text and speech, semantic processing of text and speech, integrating applications of text and speech processing, as well as automatic dialogue systems, and multimodal techniques and modelling.