This book presents the outcomes of the 9th International Workshop on Spoken Dialogue Systems (IWSDS), “Towards creating more human-like conversational agent technologies”. It compiles and provides a synopsis of current global research to push forward the state of the art in dialogue technologies, including advances in the context of the classical problems of language understanding, dialogue management and language generation, as well as cognitive topics related to the human nature of conversational phenomena, such as humor, empathy and social context understanding and awareness.
Get a broad overview of the different modalities of immersive video technologies—from omnidirectional video to light fields and volumetric video—from a multimedia processing perspective. From capture to representation, coding, and display, video technologies have been evolving significantly and in many different directions over the last few decades, with the ultimate goal of providing a truly immersive experience to users. After setting up a common background for these technologies, based on the plenoptic function theoretical concept, Immersive Video Technologies offers a comprehensive overview of the leading technologies enabling visual immersion, including omnidirectional (360 degrees) video, light fields, and volumetric video. Following the critical components of the typical content production and delivery pipeline, the book presents acquisition, representation, coding, rendering, and quality assessment approaches for each immersive video modality. The text also reviews current standardization efforts and explores new research directions. With this book the reader will a) gain a broad understanding of immersive video technologies that use three different modalities: omnidirectional video, light fields, and volumetric video; b) learn about the most recent scientific results in the field, including the recent learning-based methodologies; and c) understand the challenges and perspectives for immersive video technologies. - Describes the whole content processing chain for the main immersive video modalities (omnidirectional video, light fields, and volumetric video) - Offers a common theoretical background for immersive video technologies based on the concept of plenoptic function - Presents some exemplary applications of immersive video technologies
The two-volume set LNCS 11295 and 11296 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2019, held in Thessaloniki, Greece, in January 2019. Of the 172 submitted full papers, 49 were selected for oral presentation and 47 for poster presentation; in addition, 6 demonstration papers, 5 industry papers, 6 workshop papers, and 6 papers for the Video Browser Showdown 2019 were accepted. All papers presented were carefully reviewed and selected from 204 submissions.
This 8-volumes set constitutes the refereed of the 25th International Conference on Pattern Recognition Workshops, ICPR 2020, held virtually in Milan, Italy and rescheduled to January 10 - 11, 2021 due to Covid-19 pandemic. The 416 full papers presented in these 8 volumes were carefully reviewed and selected from about 700 submissions. The 46 workshops cover a wide range of areas including machine learning, pattern analysis, healthcare, human behavior, environment, surveillance, forensics and biometrics, robotics and egovision, cultural heritage and document analysis, retrieval, and women at ICPR2020.
This book constitutes the conference proceedings of the 9th Pacific Rim Symposium on Image and Video Technology, PSIVT 2019, held in Sydney, NSW, Australia, in November 2019. A total of 31 papers were carefully reviewed and selected from 55 submissions. The main conference comprises 11 major subject areas that span the field of image and video technology, namely imaging and graphics hardware and visualization, image/video coding and transmission, image/video processing and analysis, image/video retrieval and scene understanding, applications of image and video technology, biomedical image processing and analysis, biometrics and image forensics, computational photography and arts, computer and robot vision, pattern recognition, and video surveillance.
This book focuses on the challenges and the recent findings in vision intelligence incorporating high performance computing applications. The contents provide in-depth discussions on a range of emerging multidisciplinary topics like computer vision, image processing, artificial intelligence, machine learning, cloud computing, IoT, and big data. The book also includes illustrations of algorithms, architecture, applications, software systems, and data analytics within the scope of the discussed topics. This book will help students, researchers, and technology professionals discover latest trends in the fields of computer vision and artificial intelligence.
Enables readers to understand the latest developments in speech enhancement/transmission due to advances in computational power and device miniaturization The Second Edition of Digital Speech Transmission and Enhancement has been updated throughout to provide all the necessary details on the latest advances in the theory and practice in speech signal processing and its applications, including many new research results, standards, algorithms, and developments which have recently appeared and are on their way into state-of-the-art applications. Besides mobile communications, which constituted the main application domain of the first edition, speech enhancement for hearing instruments and man-machine interfaces has gained significantly more prominence in the past decade, and as such receives greater focus in this updated and expanded 2nd edition. In the Second Edition of Digital Speech Transmission and Enhancement, readers can expect to find information and novel methods on: Low-latency spectral analysis-synthesis, single-channel and dual-channel algorithms for noise reduction and dereverberation. Multi-microphone processing methods, which are now widely used in applications such as mobile phones, hearing aids, and man-computer interfaces. Algorithms for near-end listening enhancement, which provide a significantly increased speech intelligibility for users at the noisy receiving side of their mobile phone. Fundamentals of speech signal processing, estimation and machine learning, speech coding, error concealment by soft decoding, and artificial bandwidth extension of speech signals Digital Speech Transmission and Enhancement is a single-source, comprehensive guide to the fundamental issues, algorithms, standards, and trends in speech signal processing and speech communication technology, and as such is an invaluable resource for engineers, researchers, academics, and graduate students in the areas of communications, electrical engineering, and information technology.
The three-volume set LNCS 101164, 11165, and 11166 constitutes the refereed proceedings of the 19th Pacific-Rim Conference on Multimedia, PCM 2018, held in Hefei, China, in September 2018. The 209 regular papers presented together with 20 special session papers were carefully reviewed and selected from 452 submissions. The papers cover topics such as: multimedia content analysis; multimedia signal processing and communications; and multimedia applications and services.
This volume provides methods on the study of the systems of the brain. Chapters are divided into four parts covering; discriminative touch, proprioception and kinaesthesis, affective touch, individual differences due to atypical development, ageing, illusions and sensory substitution, microneurography, electrophysiology, brain imaging, and brain stimulation. In Neuromethods series style, chapters include the kind of detail and key advice from the specialists needed to get successful results in your research center and clinical investigation. Thorough and comprehensive, Somatosensory Research Methods aims to be comprehensive guide for researchers.