Perceptual Organization for Speech and Other Auditory Signals

Perceptual Organization for Speech and Other Auditory Signals

Author: Robert Peters

Publisher:

Published: 1967

Total Pages: 82

ISBN-13:

DOWNLOAD EBOOK

A series of experiments that treated auditory perception in humans was conducted. These investigations were at the information processing level and were structured to test hypotheses of sensory filtering, feature detection, the organization of a matching system, and the possible role of the motor theory of speech perception in the perception of speech. The studies include multidimensional scaling investigations, tests of the motor theory of speech perception, studies on subphonemic or distinctive features of speech, and experiments on the perceived order of short auditory events. The results of these studies support the idea that the auditory system operates as a feature detector and that these features may relate to articulatory properties of the vocal tract. Further evidence of features was found in short-term recall of phonemes where the error responses indicated that features were retained where phonomes were forgotten. Investigations of perceived order of short auditory events indicate that similar stimuli are grouped together by the auditory system and, in some instances, are heard in a perceptual order that is different from the actual physical order of the stimuli. (Author).


Auditory Scene Analysis

Auditory Scene Analysis

Author: Albert S. Bregman

Publisher: MIT Press

Published: 1994-09-29

Total Pages: 800

ISBN-13: 9780262521956

DOWNLOAD EBOOK

Auditory Scene Analysis addresses the problem of hearing complex auditory environments, using a series of creative analogies to describe the process required of the human auditory system as it analyzes mixtures of sounds to recover descriptions of individual sounds. In a unified and comprehensive way, Bregman establishes a theoretical framework that integrates his findings with an unusually wide range of previous research in psychoacoustics, speech perception, music theory and composition, and computer modeling.


Computational Auditory Scene Analysis

Computational Auditory Scene Analysis

Author: Deliang Wang

Publisher: Wiley-IEEE Press

Published: 2006-09-29

Total Pages: 432

ISBN-13:

DOWNLOAD EBOOK

Provides a comprehensive and coherent account of the state of the art in CASA, in terms of the underlying principles, the algorithms and system architectures that are employed, and the potential applications of this exciting new technology.


Neural Mechanisms of Perceptual Categorization as Precursors to Speech Perception

Neural Mechanisms of Perceptual Categorization as Precursors to Speech Perception

Author: Einat Liebenthal

Publisher: Frontiers Media SA

Published: 2017-05-03

Total Pages: 188

ISBN-13: 2889451585

DOWNLOAD EBOOK

Perceptual categorization is fundamental to the brain’s remarkable ability to process large amounts of sensory information and efficiently recognize objects including speech. Perceptual categorization is the neural bridge between lower-level sensory and higher-level language processing. A long line of research on the physical properties of the speech signal as determined by the anatomy and physiology of the speech production apparatus has led to descriptions of the acoustic information that is used in speech recognition (e.g., stop consonants place and manner of articulation, voice onset time, aspiration). Recent research has also considered what visual cues are relevant to visual speech recognition (i.e., the visual counter-parts used in lipreading or audiovisual speech perception). Much of the theoretical work on speech perception was done in the twentieth century without the benefit of neuroimaging technologies and models of neural representation. Recent progress in understanding the functional organization of sensory and association cortices based on advances in neuroimaging presents the possibility of achieving a comprehensive and far reaching account of perception in the service of language. At the level of cell assemblies, research in animals and humans suggests that neurons in the temporal cortex are important for encoding biological categories. On the cellular level, different classes of neurons (interneurons and pyramidal neurons) have been suggested to play differential roles in the neural computations underlying auditory and visual categorization. The moment is ripe for a research topic focused on neural mechanisms mediating the emergence of speech representations (including auditory, visual and even somatosensory based forms). Important progress can be achieved by juxtaposing within the same research topic the knowledge that currently exists, the identified lacunae, and the theories that can support future investigations. This research topic provides a snapshot and platform for discussion of current understanding of neural mechanisms underlying the formation of perceptual categories and their relationship to language from a multidisciplinary and multisensory perspective. It includes contributions (reviews, original research, methodological developments) pertaining to the neural substrates, dynamics, and mechanisms underlying perceptual categorization and their interaction with neural processes governing speech perception.


Audiovisual Speech Recognition: Correspondence between Brain and Behavior

Audiovisual Speech Recognition: Correspondence between Brain and Behavior

Author: Nicholas Altieri

Publisher: Frontiers E-books

Published: 2014-07-09

Total Pages: 102

ISBN-13: 2889192512

DOWNLOAD EBOOK

Perceptual processes mediating recognition, including the recognition of objects and spoken words, is inherently multisensory. This is true in spite of the fact that sensory inputs are segregated in early stages of neuro-sensory encoding. In face-to-face communication, for example, auditory information is processed in the cochlea, encoded in auditory sensory nerve, and processed in lower cortical areas. Eventually, these “sounds” are processed in higher cortical pathways such as the auditory cortex where it is perceived as speech. Likewise, visual information obtained from observing a talker’s articulators is encoded in lower visual pathways. Subsequently, this information undergoes processing in the visual cortex prior to the extraction of articulatory gestures in higher cortical areas associated with speech and language. As language perception unfolds, information garnered from visual articulators interacts with language processing in multiple brain regions. This occurs via visual projections to auditory, language, and multisensory brain regions. The association of auditory and visual speech signals makes the speech signal a highly “configural” percept. An important direction for the field is thus to provide ways to measure the extent to which visual speech information influences auditory processing, and likewise, assess how the unisensory components of the signal combine to form a configural/integrated percept. Numerous behavioral measures such as accuracy (e.g., percent correct, susceptibility to the “McGurk Effect”) and reaction time (RT) have been employed to assess multisensory integration ability in speech perception. On the other hand, neural based measures such as fMRI, EEG and MEG have been employed to examine the locus and or time-course of integration. The purpose of this Research Topic is to find converging behavioral and neural based assessments of audiovisual integration in speech perception. A further aim is to investigate speech recognition ability in normal hearing, hearing-impaired, and aging populations. As such, the purpose is to obtain neural measures from EEG as well as fMRI that shed light on the neural bases of multisensory processes, while connecting them to model based measures of reaction time and accuracy in the behavioral domain. In doing so, we endeavor to gain a more thorough description of the neural bases and mechanisms underlying integration in higher order processes such as speech and language recognition.


Dynamics of Speech Production and Perception

Dynamics of Speech Production and Perception

Author: P.L. Divenyi

Publisher: IOS Press

Published: 2006-09-20

Total Pages: 388

ISBN-13: 1607502038

DOWNLOAD EBOOK

The idea that speech is a dynamic process is a tautology: whether from the standpoint of the talker, the listener, or the engineer, speech is an action, a sound, or a signal continuously changing in time. Yet, because phonetics and speech science are offspring of classical phonology, speech has been viewed as a sequence of discrete events-positions of the articulatory apparatus, waveform segments, and phonemes. Although this perspective has been mockingly referred to as "beads on a string", from the time of Henry Sweet's 19th century treatise almost up to our days specialists of speech science and speech technology have continued to conceptualize the speech signal as a sequence of static states interleaved with transitional elements reflecting the quasi-continuous nature of vocal production. This book, a collection of papers of which each looks at speech as a dynamic process and highlights one of its particularities, is dedicated to the memory of Ludmilla Andreevna Chistovich. At the outset, it was planned to be a Chistovich festschrift but, sadly, she passed away a few months before the book went to press. The 24 chapters of this volume testify to the enormous influence that she and her colleagues have had over the four decades since the publication of their 1965 monograph.


Auditory Perception

Auditory Perception

Author: Richard M. Warren

Publisher: Cambridge University Press

Published: 1998-11-28

Total Pages: 255

ISBN-13: 9780521587839

DOWNLOAD EBOOK

This new edition of Auditory Perception: A New Synthesis, a book originally published by Pergamon Press (1982), describes the nature of sound, how it is analyzed by the auditory system, and the rules and principles governing our interpretation of auditory input. It guides the reader through the physics of sound and the anatomy and physiology of the inner ear and nervous system before embarking on an explanation of how experiments reveal the means by which we locate and identify sound sources and events, and how we recognize and interpret the patterns of music and speech. The new material includes discoveries concerning cochlear mechanics and neural transduction, processes involved in the perceptual restoration of portions of signals obliterated by extraneous sounds, and the manner in which sequences of sounds including those of speech and music, are organized into recognizable patterns. In addition, a chapter on speech describes how processes employed for the perception of brief nonverbal sounds are used for the organization of syllables and words, along with an overlay of special linguistic mechanisms. The book comes with an accompanying CD-ROM containing audio demonstrations, allowing the reader to experience directly some of the auditory illusions that have been described, and providing new insight into the mechanisms employed in perceptual organization. Advance undergraduate and graduate students interested in auditory perception in behavioral sciences, psychology, neurobiology, and speech and hearing sciences, will find this book an excellent advanced guide to the subject.


Listening to Speech

Listening to Speech

Author: Steven Greenberg

Publisher: Psychology Press

Published: 2012-12-06

Total Pages: 443

ISBN-13: 1135624909

DOWNLOAD EBOOK

The human species is largely defined by its use of spoken language, so integral is speech communication to behavior and social interaction. Despite its importance in everyday life, comparatively little is known about the auditory mechanisms that underlie the ability to understand language. The current volume examines the perception and processing of speech from the perspective of the hearing system. The chapters in this book describe a comprehensive set of approaches to the scientific study of speech and hearing, ranging from anatomy and physiology, to psychophysics and perception, and computational modeling. The auditory basis of speech is examined within a biological and an evolutionary context, and its relevance to applied domains such as communication disorders and speech technology discussed in detail. This volume will be of interest to scientists, engineers, and clinicians whose professional work pertains to any aspect of spoken language or hearing science.