Crowdsourcing for Speech Processing

Crowdsourcing for Speech Processing

Author: Maxine Eskenazi

Publisher: John Wiley & Sons

Published: 2013-02-15

Total Pages: 343

ISBN-13: 1118541251

DOWNLOAD EBOOK

Provides an insightful and practical introduction to crowdsourcing as a means of rapidly processing speech data Intended for those who want to get started in the domain and learn how to set up a task, what interfaces are available, how to assess the work, etc. as well as for those who already have used crowdsourcing and want to create better tasks and obtain better assessments of the work of the crowd. It will include screenshots to show examples of good and poor interfaces; examples of case studies in speech processing tasks, going through the task creation process, reviewing options in the interface, in the choice of medium (MTurk or other) and explaining choices, etc. Provides an insightful and practical introduction to crowdsourcing as a means of rapidly processing speech data. Addresses important aspects of this new technique that should be mastered before attempting a crowdsourcing application. Offers speech researchers the hope that they can spend much less time dealing with the data gathering/annotation bottleneck, leaving them to focus on the scientific issues. Readers will directly benefit from the book’s successful examples of how crowd- sourcing was implemented for speech processing, discussions of interface and processing choices that worked and choices that didn’t, and guidelines on how to play and record speech over the internet, how to design tasks, and how to assess workers. Essential reading for researchers and practitioners in speech research groups involved in speech processing


Influencing Factors in Speech Quality Assessment using Crowdsourcing

Influencing Factors in Speech Quality Assessment using Crowdsourcing

Author: Rafael Zequeira Jiménez

Publisher: Springer Nature

Published: 2022-04-04

Total Pages: 129

ISBN-13: 3030933105

DOWNLOAD EBOOK

This book evaluates the impact of relevant factors affecting the results of speech quality assessment studies carried out in crowdsourcing. The author describes how these factors relate to the test structure, the effect of environmental background noise, and the influence of language differences. He details multiple user-centered studies that have been conducted to derive guidelines for reliable collection of speech quality scores in crowdsourcing. Specifically, different questions are addressed such as the optimal number of speech samples to include in a listening task, the influence of the environmental background noise in the speech quality ratings, as well as methods for classifying background noise from web audio recordings, or the impact of language proficiency in the user perception of speech quality. Ultimately, the results of these studies contributed to the definition of the ITU-T Recommendation P.808 that defines the guidelines to conduct speech quality studies in crowdsourcing.


Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2017

Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2017

Author: Aboul Ella Hassanien

Publisher: Springer

Published: 2017-08-30

Total Pages: 932

ISBN-13: 3319648616

DOWNLOAD EBOOK

This book gathers the proceedings of the 3rd International Conference on Advanced Intelligent Systems and Informatics 2017 (AISI2017), which took place in Cairo, Egypt from September 9 to 11, 2017. This international and interdisciplinary conference, which highlighted essential research and developments in the field of informatics and intelligent systems, was organized by the Scientific Research Group in Egypt (SRGE). The book’s content is divided into five main sections: Intelligent Language Processing, Intelligent Systems, Intelligent Robotics Systems, Informatics, and the Internet of Things.


Macrotask Crowdsourcing

Macrotask Crowdsourcing

Author: Vassillis-Javed Khan

Publisher: Springer

Published: 2019-08-06

Total Pages: 279

ISBN-13: 3030123340

DOWNLOAD EBOOK

Crowdsourcing is an emerging paradigm that promises to transform several domains: creative work, business work, cultural cooperation, etc. Crowdsourcing reflects the close-knit interplay between the latest computer technologies, the rapidly changing work model of the 21st century, and the very nature of people. The interplay makes for an exciting but at the same time challenging new field to investigate under the lens of a diverse set of disciplines, ranging from the technical to the social and from the theoretical to the applied. Early research has focused on an aspect of crowdsourcing known as micro-tasking. Micro-tasks are simple tasks (like image annotations) that anyone could perform. An emerging area is how to utilize crowdsourcing to solve problems that go beyond simple tasks towards more complex ones, that require collaboration and creativity. In juxtaposition to micro-task crowdsourcing, this book investigates macro-task crowdsourcing and its potential.


Text, Speech, and Dialogue

Text, Speech, and Dialogue

Author: Ivan Habernal

Publisher: Springer

Published: 2013-08-17

Total Pages: 617

ISBN-13: 3642405851

DOWNLOAD EBOOK

This book constitutes the refereed proceedings of the 16th International Conference on Text, Speech and Dialogue, TSD 2013, held in Pilsen, Czech Republic, in September 2013. The 65 papers presented together with 5 invited talks were carefully reviewed and selected from 148 submissions. The main topics of this year's conference was corpora, texts and transcription, speech analysis, recognition and synthesis, and their intertwining within NL dialogue systems. The topics also included speech recognition, corpora and language resources, speech and spoken language generation, tagging, classification and parsing of text and speech, semantic processing of text and speech, integrating applications of text and speech processing, as well as automatic dialogue systems, and multimodal techniques and modelling.


Evaluation in the Crowd. Crowdsourcing and Human-Centered Experiments

Evaluation in the Crowd. Crowdsourcing and Human-Centered Experiments

Author: Daniel Archambault

Publisher: Springer

Published: 2017-09-27

Total Pages: 200

ISBN-13: 3319664352

DOWNLOAD EBOOK

As the outcome of the Dagstuhl Seminar 15481 on Crowdsourcing and Human-Centered Experiments, this book is a primer for computer science researchers who intend to use crowdsourcing technology for human centered experiments. The focus of this Dagstuhl seminar, held in Dagstuhl Castle in November 2015, was to discuss experiences and methodological considerations when using crowdsourcing platforms to run human-centered experiments to test the effectiveness of visual representations. The inspiring Dagstuhl atmosphere fostered discussions and brought together researchers from different research directions. The papers provide information on crowdsourcing technology and experimental methodologies, comparisons between crowdsourcing and lab experiments, the use of crowdsourcing for visualisation, psychology, QoE and HCI empirical studies, and finally the nature of crowdworkers and their work, their motivation and demographic background, as well as the relationships among people forming the crowdsourcing community.


Emerging Technologies in Data Mining and Information Security

Emerging Technologies in Data Mining and Information Security

Author: Ajith Abraham

Publisher: Springer

Published: 2018-09-01

Total Pages: 872

ISBN-13: 9811315019

DOWNLOAD EBOOK

The book features research papers presented at the International Conference on Emerging Technologies in Data Mining and Information Security (IEMIS 2018) held at the University of Engineering & Management, Kolkata, India, on February 23–25, 2018. It comprises high-quality research by academics and industrial experts in the field of computing and communication, including full-length papers, research-in-progress papers, case studies related to all the areas of data mining, machine learning, IoT and information security.


Advances in Speech and Language Technologies for Iberian Languages

Advances in Speech and Language Technologies for Iberian Languages

Author: Alberto Abad

Publisher: Springer

Published: 2016-11-11

Total Pages: 296

ISBN-13: 3319491695

DOWNLOAD EBOOK

This book constitutes the refereed proceedings of the IberSPEECH 2016 Conference, held in Lisbon, Portugal, in November 2016. The 27 papers presented were carefully reviewed and selected from 48 submissions. The selected articles in this volume are organized into four different topics: Speech Production, Analysis, Coding and Synthesis; Automatic Speech Recognition; Paralinguistic Speaker Trait Characterization; Speech and Language Technologies in Different Application Fields


Applications and Usability of Interactive TV

Applications and Usability of Interactive TV

Author: María J. Abásolo

Publisher: Springer Nature

Published: 2022-12-16

Total Pages: 154

ISBN-13: 3031222105

DOWNLOAD EBOOK

This book constitutes thoroughly refereed and revised selected papers from the 10th Iberoamerican Conference on Applications and Usability of Interactive TV, jAUTI 2021, held in Sangolqui, Ecuador, during December 2–3, 2021. The 9 full papers included in this book were carefully reviewed and selected from 25 submissions. They were organized in topical sections as follows: ​Usability and UX; interaction techniques and accesibility; and technologies, services, and applications for interactive digital TV.


The Oxford Handbook of Affective Computing

The Oxford Handbook of Affective Computing

Author: Rafael A. Calvo

Publisher: Oxford Library of Psychology

Published: 2015

Total Pages: 625

ISBN-13: 0199942234

DOWNLOAD EBOOK

"The Oxford Handbook of Affective Computing is a definitive reference in the burgeoning field of affective computing (AC), a multidisciplinary field encompassing computer science, engineering, psychology, education, neuroscience, and other disciplines. AC research explores how affective factors influence interactions between humans and technology, how affect sensing and affect generation techniques can inform our understanding of human affect, and on the design, implementation, and evaluation of systems involving affect at their core. The volume features 41 chapters and is divided into five sections: history and theory, detection, generation, methodologies, and applications. Section 1 begins with the making of AC and a historical review of the science of emotion. The following chapters discuss the theoretical underpinnings of AC from an interdisciplinary viewpoint. Section 2 examines affect detection or recognition, a commonly investigated area. Section 3 focuses on aspects of affect generation, including the synthesis of emotion and its expression via facial features, speech, postures, and gestures. Cultural issues are also discussed. Section 4 focuses on methodological issues in AC research, including data collection techniques, multimodal affect databases, formats for the representation of emotion, crowdsourcing techniques, machine learning approaches, affect elicitation techniques, useful AC tools, and ethical issues. Finally, Section 5 highlights applications of AC in such domains as formal and informal learning, games, robotics, virtual reality, autism research, health care, cyberpsychology, music, deception, reflective writing, and cyberpsychology. This compendium will prove suitable for use as a textbook and serve as a valuable resource for everyone with an interest in AC."--