Speaker Classification I

Speaker Classification I

Author: Christian Müller

Publisher: Springer

Published: 2007-08-28

Total Pages: 363

ISBN-13: 354074200X

DOWNLOAD EBOOK

This volume and its companion volume LNAI 4441 constitute a state-of-the-art survey in the field of speaker classification. Together they address such intriguing issues as how speaker characteristics are manifested in voice and speaking behavior. The nineteen contributions in this volume are organized into topical sections covering fundamentals, characteristics, applications, methods, and evaluation.


Speaker Classification II

Speaker Classification II

Author: C. Müller

Publisher: Springer

Published: 2007-09-13

Total Pages: 317

ISBN-13: 3540741224

DOWNLOAD EBOOK

This two-volume set constitutes a state-of-the-art survey in the field of speaker classification, addressing many critical questions. The twenty-two articles of the second volume cover a number of areas, including gender recognition systems, emotion recognition, text-dependent speaker verification systems, an analysis of both speaker and verbal content information, and accent identification.


Group Decision Making under Multiple Criteria

Group Decision Making under Multiple Criteria

Author: Ching-Lai Hwang

Publisher: Springer Science & Business Media

Published: 2012-12-06

Total Pages: 416

ISBN-13: 3642615805

DOWNLOAD EBOOK

This monograph is intended for an advanced undergraduate or graduate course of engineering and management science. as well as for persons in business. industry. military or in any field. who want an introductory and a capsule look into the methods of group decision making under multiple criteria. This is a sequel to our previous works entitled "Multiple Objective Decision Making--Methods and Applications (No. 164 of the Lecture Notes). and "Multiple Attribute Decision Making--Methods and Applications (No. 186 of the Lecture Notes). Moving from a single decision maker (the consideration of Lecture Notes 164 and 186) to a multiple decision maker setting introduces a great deal of complexity into the analysis. The problem is no longer the selection of the most preferred alternative among the nondominated solutions according to one individual's (single decision maker's) preference structure. The analysis is extended to account for the conflicts among different interest groups who have different objectives. goals. and so forth. Group decision making under multiple criteria includes such diverse and interconnected fields as preference analysis. utility theory. social choice theory. committee decision theory. theory of voting. game theory. expert evaluation analysis. aggregation of qualitative factors. economic equilibrium theory. etc; these are simplified and systematically classified for beginners. This work is to provide readers with a capsule look into the existing methods. their characteristics. and applicability in the complexity of group decision making.


Speaker Classification II

Speaker Classification II

Author: C. Müller

Publisher: Springer Science & Business Media

Published: 2007-08-15

Total Pages: 317

ISBN-13: 3540741216

DOWNLOAD EBOOK

This two-volume set constitutes a state-of-the-art survey in the field of speaker classification, addressing many critical questions. The twenty-two articles of the second volume cover a number of areas, including gender recognition systems, emotion recognition, text-dependent speaker verification systems, an analysis of both speaker and verbal content information, and accent identification.


The Native Speaker Concept

The Native Speaker Concept

Author: Neriko Musha Doerr

Publisher: Walter de Gruyter

Published: 2009

Total Pages: 401

ISBN-13: 3110220946

DOWNLOAD EBOOK

Presents a fresh look at the 'native speaker' by situating him/her in wider sociopolitical contexts. Using anthropological frameworks and ethnographic data from around the world, this book addresses the questions of who qualifies as a 'native speaker' and his/her social relations in the regime of standardization in multilingual situations.


Fundamentals of Speaker Recognition

Fundamentals of Speaker Recognition

Author: Homayoon Beigi

Publisher: Springer Science & Business Media

Published: 2011-12-09

Total Pages: 984

ISBN-13: 0387775927

DOWNLOAD EBOOK

An emerging technology, Speaker Recognition is becoming well-known for providing voice authentication over the telephone for helpdesks, call centres and other enterprise businesses for business process automation. "Fundamentals of Speaker Recognition" introduces Speaker Identification, Speaker Verification, Speaker (Audio Event) Classification, Speaker Detection, Speaker Tracking and more. The technical problems are rigorously defined, and a complete picture is made of the relevance of the discussed algorithms and their usage in building a comprehensive Speaker Recognition System. Designed as a textbook with examples and exercises at the end of each chapter, "Fundamentals of Speaker Recognition" is suitable for advanced-level students in computer science and engineering, concentrating on biometrics, speech recognition, pattern recognition, signal processing and, specifically, speaker recognition. It is also a valuable reference for developers of commercial technology and for speech scientists. Please click on the link under "Additional Information" to view supplemental information including the Table of Contents and Index.


Robust Speaker Recognition in Noisy Environments

Robust Speaker Recognition in Noisy Environments

Author: K. Sreenivasa Rao

Publisher: Springer

Published: 2014-06-21

Total Pages: 149

ISBN-13: 3319071300

DOWNLOAD EBOOK

This book discusses speaker recognition methods to deal with realistic variable noisy environments. The text covers authentication systems for; robust noisy background environments, functions in real time and incorporated in mobile devices. The book focuses on different approaches to enhance the accuracy of speaker recognition in presence of varying background environments. The authors examine: (a) Feature compensation using multiple background models, (b) Feature mapping using data-driven stochastic models, (c) Design of super vector- based GMM-SVM framework for robust speaker recognition, (d) Total variability modeling (i-vectors) in a discriminative framework and (e) Boosting method to fuse evidences from multiple SVM models.


A Framework for Enhancing Speaker Age and Gender Classification by Using a New Feature Set and Deep Neural Network Architectures

A Framework for Enhancing Speaker Age and Gender Classification by Using a New Feature Set and Deep Neural Network Architectures

Author: Arafat Abumallouh

Publisher:

Published: 2016

Total Pages: 96

ISBN-13: 9780355631753

DOWNLOAD EBOOK

Abstract : Speaker age and gender classification is one of the most challenging problems in speech processing. Recently with developing technologies, identifying a speaker age and gender has become a necessity for speaker verification and identification systems such as identifying suspects in criminal cases, improving human-machine interaction, and adapting music for awaiting people queue. Although many studies have been carried out focusing on feature extraction and classifier design for improvement, classification accuracies are still not satisfactory. The key issue in identifying speaker’s age and gender is to generate robust features and to design an in-depth classifier. Age and gender information is concealed in speaker’s speech, which is liable for many factors such as, background noise, speech contents, and phonetic divergences.In this work, different methods are proposed to enhance the speaker age and gender classification based on the deep neural networks (DNNs) as a feature extractor and classifier. First, a model for generating new features from a DNN is proposed. The proposed method uses the Hidden Markov Model toolkit (HTK) tool to find tied-state triphones for all utterances, which are used as labels for the output layer in the DNN. The DNN with a bottleneck layer is trained in an unsupervised manner for calculating the initial weights between layers, then it is trained and tuned in a supervised manner to generate transformed mel-frequency cepstral coefficients (T-MFCCs). Second, the shared class labels method is introduced among misclassified classes to regularize the weights in DNN. Third, DNN-based speakers models using the SDC feature set is proposed. The speakers-aware model can capture the characteristics of the speaker age and gender more effectively than a model that represents a group of speakers. In addition, AGender-Tune system is proposed to classify the speaker age and gender by jointly fine-tuning two DNN models; the first model is pre-trained to classify the speaker age, and second model is pre-trained to classify the speaker gender. Moreover, the new T-MFCCs feature set is used as the input of a fusion model of two systems. The first system is the DNN-based class model and the second system is the DNN-based speaker model. Utilizing the T-MFCCs as input and fusing the final score with the score of a DNN-based class model enhanced the classification accuracies. Finally, the DNN-based speaker models are embedded into an AGender-Tune system to exploit the advantages of each method for a better speaker age and gender classification.The experimental results on a public challenging database showed the effectiveness of the proposed methods for enhancing the speaker age and gender classification and achieved the state of the art on this database.