New Time-frequency Domain Pitch Estimation Methods for Speed Signals Under Low Levels of SNR

New Time-frequency Domain Pitch Estimation Methods for Speed Signals Under Low Levels of SNR

Author: Celia Shahnaz

Publisher:

Published: 2009

Total Pages: 0

ISBN-13:

DOWNLOAD EBOOK

The major objective of this research is to develop novel pitch estimation methods capable of handling speech signals in practical situations where only noise-corrupted speech observations are available. With this objective in mind, the estimation task is carried out in two different approaches. In the first approach, the noisy speech observations are directly employed to develop two new time-frequency domain pitch estimation methods. These methods are based on extracting a pitch-harmonic and finding the corresponding harmonic number required for pitch estimation. Considering that voiced speech is the output of a vocal tract system driven by a sequence of pulses separated by the pitch period, in the second approach, instead of using the noisy speech directly for pitch estimation, an excitation-like signal (ELS) is first generated from the noisy speech or its noise- reduced version. In the first approach, at first, a harmonic cosine autocorrelation (HCAC) model of clean speech in terms of its pitch-harmonics is introduced. In order to extract a pitch-harmonic, we propose an optimization technique based on least-squares fitting of the autocorrelation function (ACF) of the noisy speech to the HCAC model. By exploiting the extracted pitch-harmonic along with the fast Fourier transform (FFT) based power spectrum of noisy speech, we then deduce a harmonic measure and a harmonic-to-noise-power ratio (HNPR) to determine the desired harmonic number of the extracted pitch-harmonic. In the proposed optimization, an initial estimate of the pitch-harmonic is obtained from the maximum peak of the smoothed FFT power spectrum. In addition to the HCAC model, where the cross-product terms of different harmonics are neglected, we derive a compact yet accurate harmonic sinusoidal autocorrelation (HSAC) model for clean speech signal. The new HSAC model is then used in the least-squares model-fitting optimization technique to extract a pitch-harmonic. In the second approach, first, we develop a pitch estimation method by using an excitation-like signal (ELS) generated from the noisy speech. To this end, a technique is based on the principle of homomorphic deconvolution is proposed for extracting the vocal-tract system (VTS) parameters from the noisy speech, which are utilized to perform an inverse-filtering of the noisy speech to produce a residual signal (RS). In order to reduce the effect of noise on the RS, a noise-compensation scheme is introduced in the autocorrelation domain. The noise-compensated ACF of the RS is then employed to generate a squared Hilbert envelope (SHE) as the ELS of the voiced speech. With a view to further overcome the adverse effect of noise on the ELS, a new symmetric normalized magnitude difference function of the ELS is proposed for eventual pitch estimation. Cepstrum has been widely used in speech signal processing but has limited capability of handling noise. One potential solution could be the introduction of a noise reduction block prior to pitch estimation based on the conventional cepstrum, a framework already available in many practical applications, such as mobile communication and hearing aids. Motivated by the advantages of the existing framework and considering the superiority of our ELS to the speech itself in providing clues for pitch information, we develop a cepstrum-based pitch estimation method by using the ELS obtained from the noise-reduced speech. For this purpose, we propose a noise subtraction scheme in frequency domain, which takes into account the possible cross-correlation between speech and noise and has advantages of noise being updated with time and adjusted at each frame. The enhanced speech thus obtained is utilized to extract the vocal-tract system (VTS) parameters via the homomorphic deconvolution technique. A residual signal (RS) is then produced by inverse-filtering the enhanced speech with the extracted VTS parameters. It is found that, unlike the previous ELS-based method, the squared Hilbert envelope (SHE) computed from the RS of the enhanced speech without noise compensation, is sufficient to represent an ELS. Finally, in order to tackle the undesirable effect of noise of the ELS at a very low SNR and overcome the limitation of the conventional cepstrum in handling different types of noises, a time-frequency domain pseudo cepstrum of the ELS of the enhanced speech, incorporating information of both magnitude and phase spectra of the ELS, is proposed for pitch estimation. (Abstract shortened by UMI.).


Green and Smart Technology with Sensor Applications

Green and Smart Technology with Sensor Applications

Author: Hyun-seob Cho

Publisher: Springer

Published: 2012-11-07

Total Pages: 429

ISBN-13: 3642352510

DOWNLOAD EBOOK

This book comprises the refereed proceedings of the two International Conference on Green and Smart Technology, GST 2012, and on Sensor and Its Applications, SIA 2012, held in Jeju Island, Korea, in November/December 2012. The papers presented were carefully reviewed and selected from numerous submissions and focus on the various aspects of green and smart technology with sensor applications.


Time-Frequency Signal Analysis with Applications

Time-Frequency Signal Analysis with Applications

Author: Ljubisa Stankovic

Publisher: Artech House

Published: 2014-05-10

Total Pages: 673

ISBN-13: 1608076520

DOWNLOAD EBOOK

"The culmination of more than twenty years of research, this authoritative resource provides you with a practical understanding of time-frequency signal analysis. The book offers in-depth coverage of critical concepts and principles, along with discussions on key applications in a wide range of signal processing areas, from communications and optics... to radar and biomedicine. Supported with over 140 illustrations and more than 1,700 equations, this detailed reference explores the topics you need to understand for your work in the field, such as Fourier analysis, linear time frequency representations, quadratic time-frequency distributions, higher order time-frequency representations, and analysis of non-stationary noisy signals. This unique book also serves as an excellent text for courses in this area, featuring numerous examples and problems at the end of each chapter. "


Window Functions and Their Applications in Signal Processing

Window Functions and Their Applications in Signal Processing

Author: K. M. M. Prabhu

Publisher: CRC Press

Published: 2018-09-03

Total Pages: 404

ISBN-13: 1466515848

DOWNLOAD EBOOK

Window functions—otherwise known as weighting functions, tapering functions, or apodization functions—are mathematical functions that are zero-valued outside the chosen interval. They are well established as a vital part of digital signal processing. Window Functions and their Applications in Signal Processing presents an exhaustive and detailed account of window functions and their applications in signal processing, focusing on the areas of digital spectral analysis, design of FIR filters, pulse compression radar, and speech signal processing. Comprehensively reviewing previous research and recent developments, this book: Provides suggestions on how to choose a window function for particular applications Discusses Fourier analysis techniques and pitfalls in the computation of the DFT Introduces window functions in the continuous-time and discrete-time domains Considers two implementation strategies of window functions in the time- and frequency domain Explores well-known applications of window functions in the fields of radar, sonar, biomedical signal analysis, audio processing, and synthetic aperture radar


Index to IEEE Publications

Index to IEEE Publications

Author: Institute of Electrical and Electronics Engineers

Publisher:

Published: 1979

Total Pages: 752

ISBN-13:

DOWNLOAD EBOOK

Issues for 1973- cover the entire IEEE technical literature.


The Estimation and Tracking of Frequency

The Estimation and Tracking of Frequency

Author: B. G. Quinn

Publisher: Cambridge University Press

Published: 2001-02-05

Total Pages: 282

ISBN-13: 9780521804462

DOWNLOAD EBOOK

This book presents practical techniques for estimating frequencies of signals. Includes Matlab code. For researchers.


Introduction to Digital Speech Processing

Introduction to Digital Speech Processing

Author: Lawrence R. Rabiner

Publisher: Now Publishers Inc

Published: 2007

Total Pages: 212

ISBN-13: 1601980701

DOWNLOAD EBOOK

Provides the reader with a practical introduction to the wide range of important concepts that comprise the field of digital speech processing. Students of speech research and researchers working in the field can use this as a reference guide.


Speech Coding and Synthesis

Speech Coding and Synthesis

Author: W. Bastiaan Kleijn

Publisher: Elsevier Science & Technology

Published: 1995

Total Pages: 784

ISBN-13:

DOWNLOAD EBOOK

Hardbound. The fields of speech coding and synthesis have developed rapidly over the last decade. Text-to-text speech systems now produce reasonable quality speech, and currently available speech coders can transmit good quality speech at below 10kb/s. This, in combination with the ever-increasing speed of microprocessors and signal processing hardware, has resulted in a large number of practical applications. These applications in turn have stimulated research, and the number of papers published on speech coding and synthesis have proliferated rapidly. Reflecting periodically on such developments have inspired the publication of this book. Topics such as the effect of cross channel errors on coded speech and the determination of a proper pitch contour for synthesized speech are included.Both readers unfamiliar with the fields of speech coding and speech synthesis as well as those already working within the areas, will find the book of interest.