New Spectral Methods for Analysis of Source/filter Characteristics of Speech Signals

New Spectral Methods for Analysis of Source/filter Characteristics of Speech Signals

Author: Baris Bozkurt

Publisher: Presses univ. de Louvain

Published: 2006

Total Pages: 125

ISBN-13: 2874630136

DOWNLOAD EBOOK

This study proposes a new spectral representation called the Zeros of Z-Transform (ZZT), which is an all-zero representation of the z-transform of the signal. In addition, new chirp group delay processing techniques are developed for analysis of resonances of a signal. The combination of the ZZT representation with the chirp group delay processing algorithms provides a useful domain to study resonance characteristics of source and filter components of speech. Using the two representations, effective algorithms are developed for: source-tract decomposition of speech, glottal flow parameter estimation, formant tracking and feature extraction for speech recognition. The ZZT representation is mainly important for theoretical studies. Studying the ZZT of a signal is essential to be able to develop effective chirp group delay processing methods. Therefore, first the ZZT representation of the source-filter model of speech is studied for providing a theoretical background. We confirm through ZZT representation that anti-causality of the glottal flow signal introduces mixed-phase characteristics in speech signals. The ZZT of windowed speech signals is also studied since windowing cannot be avoided in practical signal processing algorithms and the effect of windowing on ZZT representation is drastic. We show that separate patterns exist in ZZT representations of windowed speech signals for the glottal flow and the vocal tract contributions. A decomposition method for source-tract separation is developed based on these patterns in ZZT. We define chirp group delay as group delay calculated on a circle other than the unit circle in z-plane. The need to compute group delay on a circle other than the unit circle comes from the fact that group delay spectra are often very noisy and cannot be easily processed for formant tracking purposes (the reasons are explained through ZZT representation). In this thesis, we propose methods to avoid such problems by modifying the ZZT of a signal and further computing the chirp group delay spectrum. New algorithms based on processing of the chirp group delay spectrum are developed for formant tracking and feature estimation for speech recognition. The proposed algorithms are compared to state-of-the-art techniques. Equivalent or higher efficiency is obtained for all proposed algorithms. The theoretical parts of the thesis further discuss a mixed-phase model for speech and phase processing problems in detail. Index Terms—spectral representation, source-filter separation, glottal flow estimation, formant tracking, zeros of z-transform, group delay processing, phase processing.


Progress in Nonlinear Speech Processing

Progress in Nonlinear Speech Processing

Author: Yannis Stylianou

Publisher: Springer

Published: 2007-05-24

Total Pages: 280

ISBN-13: 3540715053

DOWNLOAD EBOOK

This book constitutes of the major results of the EU COST (European Cooperation in the field of Scientific and Technical Research) Action 277: NSP, Nonlinear Speech Processing, running from April 2001 to June 2005. Coverage includes such areas as speech analysis for speech synthesis, speech recognition, speech-non speech discrimination and voice quality assessment, speech enhancement, and emotional state detection.


Advances in Nonlinear Speech Processing

Advances in Nonlinear Speech Processing

Author: Mohamed Chetouani

Publisher: Springer Science & Business Media

Published: 2008-01-11

Total Pages: 293

ISBN-13: 3540773460

DOWNLOAD EBOOK

This intriguing book constitutes the thoroughly refereed postproceedings of the International Conference on Non-Linear Speech Processing, NOLISP 2007, held in Paris, France, in May 2007. The 24 revised full papers presented were carefully reviewed and selected from numerous submissions. The papers are organized in topical sections on nonlinear and non-conventional techniques, speech synthesis, speaker recognition, speech recognition, and many other subjects.


Bandwidth Extension of Speech Signals

Bandwidth Extension of Speech Signals

Author: Bernd Iser

Publisher: Springer Science & Business Media

Published: 2008-07-15

Total Pages: 200

ISBN-13: 9780387688992

DOWNLOAD EBOOK

Bandwidth Extension of Speech Signals describes the theory and methods for quality enhancement of clean speech signals and distorted speech signals such as those that have undergone a band limitation, for instance, in a telephone network. Problems and the respective solutions are discussed for the different approaches. The different approaches are evaluated and a real-time implementation of the most promising approach is presented. The book includes topics related to speech coding, pattern- / speech recognition, speech enhancement, statistics and digital signal processing in general.


Secure IT Systems

Secure IT Systems

Author: Hans P. Reiser

Publisher: Springer Nature

Published: 2023-01-01

Total Pages: 390

ISBN-13: 3031222954

DOWNLOAD EBOOK

This book constitutes the refereed proceedings of the 27th Nordic Conference on Secure IT Systems, NordSec 2022, held in Reykjavic, Iceland, during November 30 – December 2, 2022. The 20 full papers presented in this volume were carefully reviewed and selected from 89 submissions. The NordSec conference series addresses a broad range of topics within IT security and privacy.


Digital Signal Processing and Applications with the C6713 and C6416 DSK

Digital Signal Processing and Applications with the C6713 and C6416 DSK

Author: Rulph Chassaing

Publisher: John Wiley & Sons

Published: 2004-12-20

Total Pages: 542

ISBN-13: 0471704067

DOWNLOAD EBOOK

This book is a tutorial on digital techniques for waveform generation, digital filters, and digital signal processing tools and techniques The typical chapter begins with some theoretical material followed by working examples and experiments using the TMS320C6713-based DSPStarter Kit (DSK) The C6713 DSK is TI's newest signal processor based on the C6x processor (replacing the C6711 DSK)


Digital Signal Processing Handbook on CD-ROM

Digital Signal Processing Handbook on CD-ROM

Author: VIJAY MADISETTI

Publisher: CRC Press

Published: 1999-02-26

Total Pages: 1725

ISBN-13: 0849321352

DOWNLOAD EBOOK

A best-seller in its print version, this comprehensive CD-ROM reference contains unique, fully searchable coverage of all major topics in digital signal processing (DSP), establishing an invaluable, time-saving resource for the engineering community. Its unique and broad scope includes contributions from all DSP specialties, including: telecommunications, computer engineering, acoustics, seismic data analysis, DSP software and hardware, image and video processing, remote sensing, multimedia applications, medical technology, radar and sonar applications


Audio and Speech Processing with MATLAB

Audio and Speech Processing with MATLAB

Author: Paul Hill

Publisher: CRC Press

Published: 2018-12-07

Total Pages: 330

ISBN-13: 0429813961

DOWNLOAD EBOOK

Speech and audio processing has undergone a revolution in preceding decades that has accelerated in the last few years generating game-changing technologies such as truly successful speech recognition systems; a goal that had remained out of reach until very recently. This book gives the reader a comprehensive overview of such contemporary speech and audio processing techniques with an emphasis on practical implementations and illustrations using MATLAB code. Core concepts are firstly covered giving an introduction to the physics of audio and vibration together with their representations using complex numbers, Z transforms and frequency analysis transforms such as the FFT. Later chapters give a description of the human auditory system and the fundamentals of psychoacoustics. Insights, results, and analyses given in these chapters are subsequently used as the basis of understanding of the middle section of the book covering: wideband audio compression (MP3 audio etc.), speech recognition and speech coding. The final chapter covers musical synthesis and applications describing methods such as (and giving MATLAB examples of) AM, FM and ring modulation techniques. This chapter gives a final example of the use of time-frequency modification to implement a so-called phase vocoder for time stretching (in MATLAB). Features A comprehensive overview of contemporary speech and audio processing techniques from perceptual and physical acoustic models to a thorough background in relevant digital signal processing techniques together with an exploration of speech and audio applications. A carefully paced progression of complexity of the described methods; building, in many cases, from first principles. Speech and wideband audio coding together with a description of associated standardised codecs (e.g. MP3, AAC and GSM). Speech recognition: Feature extraction (e.g. MFCC features), Hidden Markov Models (HMMs) and deep learning techniques such as Long Short-Time Memory (LSTM) methods. Book and computer-based problems at the end of each chapter. Contains numerous real-world examples backed up by many MATLAB functions and code.


Video, Speech, and Audio Signal Processing and Associated Standards

Video, Speech, and Audio Signal Processing and Associated Standards

Author: Vijay Madisetti

Publisher: CRC Press

Published: 2018-09-03

Total Pages: 616

ISBN-13: 1420046098

DOWNLOAD EBOOK

Now available in a three-volume set, this updated and expanded edition of the bestselling The Digital Signal Processing Handbook continues to provide the engineering community with authoritative coverage of the fundamental and specialized aspects of information-bearing signals in digital form. Encompassing essential background material, technical details, standards, and software, the second edition reflects cutting-edge information on signal processing algorithms and protocols related to speech, audio, multimedia, and video processing technology associated with standards ranging from WiMax to MP3 audio, low-power/high-performance DSPs, color image processing, and chips on video. Drawing on the experience of leading engineers, researchers, and scholars, the three-volume set contains 29 new chapters that address multimedia and Internet technologies, tomography, radar systems, architecture, standards, and future applications in speech, acoustics, video, radar, and telecommunications. This volume, Video, Speech, and Audio Signal Processing and Associated Standards, provides thorough coverage of the basic foundations of speech, audio, image, and video processing and associated applications to broadcast, storage, search and retrieval, and communications.


Speech Processing in the Auditory System

Speech Processing in the Auditory System

Author: Steven Greenberg

Publisher: Springer Science & Business Media

Published: 2006-05-09

Total Pages: 487

ISBN-13: 0387215751

DOWNLOAD EBOOK

Although speech is the primary behavioral medium by which humans communicate, its auditory basis is poorly understood, having profound implications on efforts to ameliorate the behavioral consequences of hearing impairment and on the development of robust algorithms for computer speech recognition. In this volume, the authors provide an up-to-date synthesis of recent research in the area of speech processing in the auditory system, bringing together a diverse range of scientists to present the subject from an interdisciplinary perspective. Of particular concern is the ability to understand speech in uncertain, potentially adverse acoustic environments, currently the bane of both hearing aid and speech recognition technology. There is increasing evidence that the perceptual stability characteristic of speech understanding is due, at least in part, to elegant transformations of the acoustic signal performed by auditory mechanisms. As a comprehensive review of speech's auditory basis, this book will interest physiologists, anatomists, psychologists, phoneticians, computer scientists, biomedical and electrical engineers, and clinicians.