Deep Learning Based Speech Quality Prediction

Deep Learning Based Speech Quality Prediction

Author: Gabriel Mittag

Publisher: Springer Nature

Published: 2022-02-24

Total Pages: 171

ISBN-13: 3030914798

DOWNLOAD EBOOK

This book presents how to apply recent machine learning (deep learning) methods for the task of speech quality prediction. The author shows how recent advancements in machine learning can be leveraged for the task of speech quality prediction and provides an in-depth analysis of the suitability of different deep learning architectures for this task. The author then shows how the resulting model outperforms traditional speech quality models and provides additional information about the cause of a quality impairment through the prediction of the speech quality dimensions of noisiness, coloration, discontinuity, and loudness.


Speech and Computer

Speech and Computer

Author: Alexey Karpov

Publisher: Springer Nature

Published: 2023-12-23

Total Pages: 587

ISBN-13: 303148312X

DOWNLOAD EBOOK

The two-volume proceedings set LNAI 14338 and 14339 constitutes the refereed proceedings of the 25th International Conference on Speech and Computer, SPECOM 2023, held in Dharwad, India, during November 29–December 2, 2023. The 94 papers included in these proceedings were carefully reviewed and selected from 174 submissions. They focus on all aspects of speech science and technology: ​automatic speech recognition; computational paralinguistics; digital signal processing; speech prosody; natural language processing; child speech processing; speech processing for medicine; industrial speech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization.


Simulating Conversations for the Prediction of Speech Quality

Simulating Conversations for the Prediction of Speech Quality

Author: Thilo Michael

Publisher: Springer Nature

Published: 2023-06-30

Total Pages: 157

ISBN-13: 3031318447

DOWNLOAD EBOOK

This book discusses the simulation of conversations through a novel approach of predicting speech quality based on the interactions of two simulated interlocutors. The author describes the setup of a simulation environment that is capable of simulating human dialogue on the speech level. The impact of delay and bursty packet loss on VoIP conversations is investigated and modeled for the use in the simulation. Based on parameters extracted from simulated conversations, the author proposes extensions to the E-model, a parametric model standardized by the International Telecommunications Union, in order to predict the quality of the simulated conversations. The author shows that predictions based on the simulated conversations outperform models that rely on the transmission parameters alone.


Speech Enhancement

Speech Enhancement

Author: Philipos C. Loizou

Publisher: CRC Press

Published: 2013-02-25

Total Pages: 715

ISBN-13: 1466599227

DOWNLOAD EBOOK

With the proliferation of mobile devices and hearing devices, including hearing aids and cochlear implants, there is a growing and pressing need to design algorithms that can improve speech intelligibility without sacrificing quality. Responding to this need, Speech Enhancement: Theory and Practice, Second Edition introduces readers to the basic pr


Advances in Multimedia Modeling

Advances in Multimedia Modeling

Author: Susanne Boll

Publisher: Springer

Published: 2009-12-24

Total Pages: 822

ISBN-13: 364211301X

DOWNLOAD EBOOK

The 16th international conference on Multimedia Modeling (MMM2010) was held in the famous mountain city Chongqing, China, January 6–8, 2010, and hosted by Southwest University. MMM is a leading international conference for researchersand industry practitioners to share their new ideas, original research results and practicaldevelopment experiences from all multimedia related areas. MMM2010attractedmorethan160regular,specialsession,anddemosession submissions from 21 countries/regions around the world. All submitted papers were reviewed by at least two PC members or external reviewers, and most of them were reviewed by three reviewers. The review process was very selective. From the total of 133 submissions to the main track, 43 (32. 3%) were accepted as regular papers, 22 (16. 5%) as short papers. In all, 15 papers were received for three special sessions, which is by invitation only, and 14 submissions were received for a demo session, with 9 being selected. Authors of accepted papers come from 16 countries/regions. This volume of the proceedings contains the abstracts of three invited talks and all the regular, short, special session and demo papers. The regular papers were categorized into nine sections: 3D mod- ing;advancedvideocodingandadaptation;face,gestureandapplications;image processing;imageretrieval;learningsemanticconcepts;mediaanalysisandm- eling; semantic video concepts; and tracking and motion analysis. Three special sessions were video analysis and event recognition, cross-X multimedia mining in large scale, and mobile computing and applications. The technical programfeatured three invited talks, paralleloral presentation of all the accepted regular and special session papers, and poster sessions for short and demo papers.


Artificial Neural Networks and Machine Learning – ICANN 2023

Artificial Neural Networks and Machine Learning – ICANN 2023

Author: Lazaros Iliadis

Publisher: Springer Nature

Published: 2023-10-23

Total Pages: 559

ISBN-13: 3031441958

DOWNLOAD EBOOK

The 10-volume set LNCS 14254-14263 constitutes the proceedings of the 32nd International Conference on Artificial Neural Networks and Machine Learning, ICANN 2023, which took place in Heraklion, Crete, Greece, during September 26–29, 2023. The 426 full papers, 9 short papers and 9 abstract papers included in these proceedings were carefully reviewed and selected from 947 submissions. ICANN is a dual-track conference, featuring tracks in brain inspired computing on the one hand, and machine learning on the other, with strong cross-disciplinary interactions and applications.


The 10th International Conference on Computer Engineering and Networks

The 10th International Conference on Computer Engineering and Networks

Author: Qi Liu

Publisher: Springer Nature

Published: 2020-10-05

Total Pages: 1770

ISBN-13: 9811584621

DOWNLOAD EBOOK

This book contains a collection of the papers accepted by the CENet2020 – the 10th International Conference on Computer Engineering and Networks held on October 16-18, 2020 in Xi’an, China. The topics focus but are not limited to Internet of Things and Smart Systems, Artificial Intelligence and Applications, Communication System Detection, Analysis and Application, and Medical Engineering and Information Systems. Each part can be used as an excellent reference by industry practitioners, university faculties, research fellows and undergraduates as well as graduate students who need to build a knowledge base of the most current advances and state-of-practice in the topics covered by this conference proceedings. This will enable them to produce, maintain, and manage systems with high levels of trustworthiness and complexity.


Text, Speech, and Dialogue

Text, Speech, and Dialogue

Author: Kamil Ekštein

Publisher: Springer Nature

Published: 2023-08-22

Total Pages: 383

ISBN-13: 303140498X

DOWNLOAD EBOOK

This book constitutes the refereed proceedings of the 26th International Conference on Text, Speech, and Dialogue, TSD 2023, held in Pilsen, Czech Republic, during September 4–6, 2023. The 31 full papers presented together with the abstracts of 3 keynote talks were carefully reviewed and selected from 64 submissions. The conference attracts researchers not only from Central and Eastern Europe but also from other parts of the world. One of its goals has always been bringing together NLP researchers with various interests from different parts of the world and promoting their cooperation. One of the ambitions of the conference is, not only to deal with dialogue systems but also to improve dialogue among researchers in areas of NLP, i.e., among the “text” and the “speech” and the “dialogue” people.