Speech and Computer

Speech and Computer

Author: Alexey Karpov

Publisher: Springer Nature

Published: 2023-12-23

Total Pages: 587

ISBN-13: 303148312X

DOWNLOAD EBOOK

The two-volume proceedings set LNAI 14338 and 14339 constitutes the refereed proceedings of the 25th International Conference on Speech and Computer, SPECOM 2023, held in Dharwad, India, during November 29–December 2, 2023. The 94 papers included in these proceedings were carefully reviewed and selected from 174 submissions. They focus on all aspects of speech science and technology: ​automatic speech recognition; computational paralinguistics; digital signal processing; speech prosody; natural language processing; child speech processing; speech processing for medicine; industrial speech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization.


Prosodic Studies

Prosodic Studies

Author: Hongming Zhang

Publisher: Taylor & Francis

Published: 2019-09-23

Total Pages: 391

ISBN-13: 1351212869

DOWNLOAD EBOOK

Prosody is one of the core components of language and speech, indicating information about syntax, turn-taking in conversation, types of utterances, such as questions or statements, as well as speakers' attitudes and feelings. This edited volume takes studies in prosody on Asian languages as well as examples from other languages. It brings together the most recent research in the field and also charts the influence on such diverse fields as multimedia communication and SLA. Intended for a wide audience of linguists that includes neighbouring disciplines such as computational sciences, psycholinguists, and specialists in language acquisition, Prosodic Studies is also ideal for scholars and researchers working in intonation who want a complement of information on specifics.


Advances in Signal Processing and Intelligent Recognition Systems

Advances in Signal Processing and Intelligent Recognition Systems

Author: Sabu M. Thampi

Publisher: Springer

Published: 2017-09-12

Total Pages: 471

ISBN-13: 3319679341

DOWNLOAD EBOOK

This Edited Volume gathers a selection of refereed and revised papers originally presented at the Third International Symposium on Signal Processing and Intelligent Recognition Systems (SIRS’17), held on September 13–16, 2017 in Manipal, India. The papers offer stimulating insights into biometrics, digital watermarking, recognition systems, image and video processing, signal and speech processing, pattern recognition, machine learning and knowledge-based systems. Taken together, they offer a valuable resource for all researchers and scientists engaged in the various fields of signal processing and related areas.


New Approaches for Multidimensional Signal Processing

New Approaches for Multidimensional Signal Processing

Author: Roumen Kountchev

Publisher: Springer Nature

Published: 2022-12-02

Total Pages: 287

ISBN-13: 9811978425

DOWNLOAD EBOOK

This book is a collection of papers presented at the International Workshop on New Approaches for Multidimensional Signal Processing (NAMSP 2022), held at Technical University of Sofia, Sofia, Bulgaria, during 23–25 June 2022. The book covers research papers in the field of N-dimensional multicomponent image processing, multidimensional image representation and super-resolution, 3D image processing and reconstruction, MD computer vision systems, multidimensional multimedia systems, neural networks for MD image processing, data-based MD image retrieval and knowledge data mining, watermarking, hiding and encryption of MD images, MD image processing in robot systems, tensor-based data processing, 3D and multi-view visualization, forensic analysis systems for MD images and many more.


Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments

Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments

Author: Xiao-Lei Zhang

Publisher: Elsevier

Published: 2024-09-04

Total Pages: 282

ISBN-13: 0443248575

DOWNLOAD EBOOK

Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments provides a detailed discussion of deep learning-based robust speech processing and its applications. The book begins by looking at the basics of deep learning and common deep network models, followed by front-end algorithms for deep learning-based speech denoising, speech detection, single-channel speech enhancement multi-channel speech enhancement, multi-speaker speech separation, and the applications of deep learning-based speech denoising in speaker verification and speech recognition. - Provides a comprehensive introduction to the development of deep learning-based robust speech processing - Covers speech detection, speech enhancement, dereverberation, multi-speaker speech separation, robust speaker verification, and robust speech recognition - Focuses on a historical overview and then covers methods that demonstrate outstanding performance in practical applications


ICDSMLA 2020

ICDSMLA 2020

Author: Amit Kumar

Publisher: Springer Nature

Published: 2021-11-08

Total Pages: 1600

ISBN-13: 9811636907

DOWNLOAD EBOOK

This book gathers selected high-impact articles from the 2nd International Conference on Data Science, Machine Learning & Applications 2020. It highlights the latest developments in the areas of artificial intelligence, machine learning, soft computing, human–computer interaction and various data science and machine learning applications. It brings together scientists and researchers from different universities and industries around the world to showcase a broad range of perspectives, practices and technical expertise.


Deep Learning Approaches for Spoken and Natural Language Processing

Deep Learning Approaches for Spoken and Natural Language Processing

Author: Virender Kadyan

Publisher: Springer Nature

Published: 2022-01-01

Total Pages: 171

ISBN-13: 3030797783

DOWNLOAD EBOOK

This book provides insights into how deep learning techniques impact language and speech processing applications. The authors discuss the promise, limits and the new challenges in deep learning. The book covers the major differences between the various applications of deep learning and the classical machine learning techniques. The main objective of the book is to present a comprehensive survey of the major applications and research oriented articles based on deep learning techniques that are focused on natural language and speech signal processing. The book is relevant to academicians, research scholars, industrial experts, scientists and post graduate students working in the field of speech signal and natural language processing and would like to add deep learning to enhance capabilities of their work. Discusses current research challenges and future perspective about how deep learning techniques can be applied to improve NLP and speech processing applications; Presents and escalates the research trends and future direction of language and speech processing; Includes theoretical research, experimental results, and applications of deep learning.


Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges

Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges

Author: Jean-Jacques Rousseau

Publisher: Springer Nature

Published: 2023-07-29

Total Pages: 723

ISBN-13: 3031376609

DOWNLOAD EBOOK

This 4-volumes set constitutes the proceedings of the ICPR 2022 Workshops of the 26th International Conference on Pattern Recognition Workshops, ICPR 2022, Montreal, QC, Canada, August 2023. The 167 full papers presented in these 4 volumes were carefully reviewed and selected from numerous submissions. ICPR workshops covered domains related to pattern recognition, artificial intelligence, computer vision, image and sound analysis. Workshops’ contributions reflected the most recent applications related to healthcare, biometrics, ethics, multimodality, cultural heritage, imagery, affective computing, etc.