The use of non-intrusive virtual environments is gaining more and more importance but was focused mainly on addressing the visual sense. However, the human perception consists not only of visual input and thus it would be worthwhile to create multi-modal and interactive virtual environments. This thesis describes the techniques required to include the acoustic component into a virtual environment and furthermore the implementation of a software system, which takes advantage of these techniques to create complex acoustical scenes in real time. The system is based on the binaural technology. It features spatially distributed sound sources which are utilized to create an environment that is as authentic as possible. This comprises a description of the source, including its relevant angle-, distance- and time- dependent radiation, the sound distribution in the virtual scene (room acoustics), the perception-related consideration of all sound field components, as well as the exact reproduction of the artificial sound at the ears of the user. The focus of this thesis is put on the reproduction technology. In this context, an approach for dynamic crosstalk cancellation is presented, which enables a loudspeaker-based reproduction. The required filters are processed in real time on the basis of the position data and measured transfer functions of the outer ear. Furthermore the integration of this spatial audio system into a five-sided Virtual Reality display system is described and evaluated.
In this work the importance of individualization in binaural technique is investigated. The results extend the present knowledge on the efficient measurement of individual head-related transfer functions (HRTFs) and highlight the importance of individual equalization filters in binaural reproduction, using both loudspeakers and headphones. Moreover, an integrated framework for the calculation of such equalization filters is presented. An innovative measurement setup was developed to allow the fast acquisition of individual HRTFs. The hardware was designed to be compatible with the range extrapolation technique. An individual HRTF dataset with 4000 directions can be measured in less than 6 minutes with this new setup. A framework was presented that integrates causality constraints to the regularized frequency domain calculation of crosstalk cancellation (CTC) filter. This framework also addresses the switching of active loudspeakers applying a weighted filter calculation method. A sound localization test showed that individualized CTC systems provide performance similar to that of binaural listening while nonindividualized CTC systems provide a significantly lower localization performance. Finally, a robust individual headphone equalization method was proposed. Perceptual tests showed that, in all but one of the tested situations, no audible differences between the original sound source and its binaural auditory display could be perceived.
The current popular and scientific interest in virtual environments has provided a new impetus for investigating binaural and spatial hearing. However, the many intriguing phenomena of spatial hearing have long made it an exciting area of scientific inquiry. Psychophysical and physiological investigations of spatial hearing seem to be converging on common explanations of underlying mechanisms. These understandings have in turn been incorporated into sophisticated yet mathematically tractable models of binaural interaction. Thus, binaural and spatial hearing is one of the few areas in which professionals are soon likely to find adequate physiological explanations of complex psychological phenomena that can be reasonably and usefully approximated by mathematical and physical models. This volume grew out of the Conference on Binaural and Spatial Hearing, a four-day event held at Wright-Patterson Air Force Base in response to rapid developments in binaural and spatial hearing research and technology. Meant to be more than just a proceedings, it presents chapters that are longer than typical proceedings papers and contain considerably more review material, including extensive bibliographies in many cases. Arranged into topical sections, the chapters represent major thrusts in the recent literature. The authors of the first chapter in each section have been encouraged to take a broad perspective and review the current state of literature. Subsequent chapters in each section tend to be somewhat more narrowly focused, and often emphasize the authors' own work. Thus, each section provides overview, background, and current research on a particular topic. This book is significant in that it reviews the important work during the past 10 to 15 years, and provides greater breadth and depth than most of the previous works.
Immersive Sound: The Art and Science of Binaural and Multi-Channel Audio provides a comprehensive guide to multi-channel sound. With contributions from leading recording engineers, researchers, and industry experts, Immersive Sound includes an in-depth description of the physics and psychoacoustics of spatial audio as well as practical applications. Chapters include the history of 3D sound, binaural reproduction over headphones and loudspeakers, stereo, surround sound, height channels, object-based audio, soundfield (ambisonics), wavefield synthesis, and multi-channel mixing techniques. Knowledge of the development, theory, and practice of spatial and multi-channel sound is essential to those advancing the research and applications in the rapidly evolving fields of 3D sound recording, augmented and virtual reality, gaming, film sound, music production, and post-production.
The field of spatial hearing has exploded in the decade or so since Jens Blauert's classic work on acoustics was first published in English. This revised edition adds a new chapter that describes developments in such areas as auditory virtual reality (an important field of application that is based mainly on the physics of spatial hearing), binaural technology (modeling speech enhancement by binaural hearing), and spatial sound-field mapping. The chapter also includes recent research on the precedence effect that provides clear experimental evidence that cognition plays a significant role in spatial hearing.The remaining four chapters in this comprehensive reference cover auditory research procedures and psychometric methods, spatial hearing with one sound source, spatial hearing with multiple sound sources and in enclosed spaces, and progress and trends from 1972 (the first German edition) to 1983 (the first English edition) -- work that includes research on the physics of the external ear, and the application of signal processing theory to modeling the spatial hearing process. There is an extensive bibliography of more than 900 items.
Sound, devoid of meaning, would not matter to us. It is the information sound conveys that helps the brain to understand its environment. Sound and its underlying meaning are always associated with time and space. There is no sound without spatial properties, and the brain always organizes this information within a temporal–spatial framework. This book is devoted to understanding the importance of meaning for spatial and related further aspects of hearing, including cross-modal inference. People, when exposed to acoustic stimuli, do not react directly to what they hear but rather to what they hear means to them. This semiotic maxim may not always apply, for instance, when the reactions are reflexive. But, where it does apply, it poses a major challenge to the builders of models of the auditory system. Take, for example, an auditory model that is meant to be implemented on a robotic agent for autonomous search-&-rescue actions. Or think of a system that can perform judgments on the sound quality of multimedia-reproduction systems. It becomes immediately clear that such a system needs • Cognitive capabilities, including substantial inherent knowledge • The ability to integrate information across different sensory modalities To realize these functions, the auditory system provides a pair of sensory organs, the two ears, and the means to perform adequate preprocessing of the signals provided by the ears. This is realized in the subcortical parts of the auditory system. In the title of a prior book, the term Binaural Listening is used to indicate a focus on sub-cortical functions. Psychoacoustics and auditory signal processing contribute substantially to this area. The preprocessed signals are then forwarded to the cortical parts of the auditory system where, among other things, recognition, classification, localization, scene analysis, assignment of meaning, quality assessment, and action planning take place. Also, information from different sensory modalities is integrated at this level. Between sub-cortical and cortical regions of the auditory system, numerous feedback loops exist that ultimately support the high complexity and plasticity of the auditory system. The current book concentrates on these cognitive functions. Instead of processing signals, processing symbols is now the predominant modeling task. Substantial contributions to the field draw upon the knowledge acquired by cognitive psychology. The keyword Binaural Understanding in the book title characterizes this shift. Both books, The Technology of Binaural Listening and the current one, have been stimulated and supported by AABBA, an open research group devoted to the development and application of models of binaural hearing. The current book is dedicated to technologies that help explain, facilitate, apply, and support various aspects of binaural understanding. It is organized into five parts, each containing three to six chapters in order to provide a comprehensive overview of this emerging area. Each chapter was thoroughly reviewed by at least two anonymous, external experts. The first part deals with the psychophysical and physiological effects of Forming and Interpreting Aural Objects as well as the underlying models. The fundamental concepts of reflexive and reflective auditory feedback are introduced. Mechanisms of binaural attention and attention switching are covered—as well as how auditory Gestalt rules facilitate binaural understanding. A general blackboard architecture is introduced as an example of how machines can learn to form and interpret aural objects to simulate human cognitive listening. The second part, Configuring and Understanding Aural Space, focuses on the human understanding of complex three-dimensional environments—covering the psychological and biological fundamentals of auditory space formation. This part further addresses the human mechanisms used to process information and interact in complex reverberant environments, such as concert halls and forests, and additionally examines how the auditory system can learn to understand and adapt to these environments. The third part is dedicated to Processing Cross-Modal Inference and highlights the fundamental human mechanisms used to integrate auditory cues with cues from other modalities to localize and form perceptual objects. This part also provides a general framework for understanding how complex multimodal scenes can be simulated and rendered. The fourth part, Evaluating Aural-scene Quality and Speech Understanding, focuses on the object-forming aspects of binaural listening and understanding. It addresses cognitive mechanisms involved in both the understanding of speech and the processing of nonverbal information such as Sound Quality and Quality-of- Experience. The aesthetic judgment of rooms is also discussed in this context. Models that simulate underlying human processes and performance are covered in addition to techniques for rendering virtual environments that can then be used to test these models. The fifth part deals with the Application of Cognitive Mechanisms to Audio Technology. It highlights how cognitive mechanisms can be utilized to create spatial auditory illusions using binaural and other 3D-audio technologies. Further, it covers how cognitive binaural technologies can be applied to improve human performance in auditory displays and to develop new auditory technologies for interactive robots. The book concludes with the application of cognitive binaural technologies to the next generation of hearing aids.
This is the complete practical introduction to virtual reality and multimedia for those wishing to build systems. It covers the foundations and engineering needed to design and construct projects incorporating video, audio and textural elements and including the use of the latest hardware, to create an artificial world for education, information or entertainment. Production and authoring platforms are described, computer animation and hypertext are covered, but those looking for pages of software listings and computerspeak will be disappointed. This book is about the nuts and bolts: sound and video cards, head mounted displays, CrystalEyes glasses, other 3D glasses for entertainment, audio and video production, and realistic auditory and visual stimulation including stereoscopy. The creation of Cyberspace, and strategies to achieve a complete Cyberatmosphere are presented. Three-dimensional sound generation and video techniques that have never previously been published are revealed. This is the handbook for anyone working in the industry, or hoping to enter it. It also provides a guide for those hoping to 'cross-fertilise' the industry, coming from audio, video, computing or engineering backgrounds. A complete technical guide to MM and VR Includes a Hypertext edition of the book with added audio and graphics on CD Hardware, software, video and never before published 3D audio techniques covered
This open access book provides a concise explanation of the fundamentals and background of the surround sound recording and playback technology Ambisonics. It equips readers with the psychoacoustical, signal processing, acoustical, and mathematical knowledge needed to understand the inner workings of modern processing utilities, special equipment for recording, manipulation, and reproduction in the higher-order Ambisonic format. The book comes with various practical examples based on free software tools and open scientific data for reproducible research. The book’s introductory section offers a perspective on Ambisonics spanning from the origins of coincident recordings in the 1930s to the Ambisonic concepts of the 1970s, as well as classical ways of applying Ambisonics in first-order coincident sound scene recording and reproduction that have been practiced since the 1980s. As, from time to time, the underlying mathematics become quite involved, but should be comprehensive without sacrificing readability, the book includes an extensive mathematical appendix. The book offers readers a deeper understanding of Ambisonic technologies, and will especially benefit scientists, audio-system and audio-recording engineers. In the advanced sections of the book, fundamentals and modern techniques as higher-order Ambisonic decoding, 3D audio effects, and higher-order recording are explained. Those techniques are shown to be suitable to supply audience areas ranging from studio-sized to hundreds of listeners, or headphone-based playback, regardless whether it is live, interactive, or studio-produced 3D audio material.