This book presents the outcomes of the 9th International Workshop on Spoken Dialogue Systems (IWSDS), “Towards creating more human-like conversational agent technologies”. It compiles and provides a synopsis of current global research to push forward the state of the art in dialogue technologies, including advances in the context of the classical problems of language understanding, dialogue management and language generation, as well as cognitive topics related to the human nature of conversational phenomena, such as humor, empathy and social context understanding and awareness.
Get a broad overview of the different modalities of immersive video technologies—from omnidirectional video to light fields and volumetric video—from a multimedia processing perspective. From capture to representation, coding, and display, video technologies have been evolving significantly and in many different directions over the last few decades, with the ultimate goal of providing a truly immersive experience to users. After setting up a common background for these technologies, based on the plenoptic function theoretical concept, Immersive Video Technologies offers a comprehensive overview of the leading technologies enabling visual immersion, including omnidirectional (360 degrees) video, light fields, and volumetric video. Following the critical components of the typical content production and delivery pipeline, the book presents acquisition, representation, coding, rendering, and quality assessment approaches for each immersive video modality. The text also reviews current standardization efforts and explores new research directions. With this book the reader will a) gain a broad understanding of immersive video technologies that use three different modalities: omnidirectional video, light fields, and volumetric video; b) learn about the most recent scientific results in the field, including the recent learning-based methodologies; and c) understand the challenges and perspectives for immersive video technologies. - Describes the whole content processing chain for the main immersive video modalities (omnidirectional video, light fields, and volumetric video) - Offers a common theoretical background for immersive video technologies based on the concept of plenoptic function - Presents some exemplary applications of immersive video technologies
This book constitutes the conference proceedings of the 9th Pacific Rim Symposium on Image and Video Technology, PSIVT 2019, held in Sydney, NSW, Australia, in November 2019. A total of 31 papers were carefully reviewed and selected from 55 submissions. The main conference comprises 11 major subject areas that span the field of image and video technology, namely imaging and graphics hardware and visualization, image/video coding and transmission, image/video processing and analysis, image/video retrieval and scene understanding, applications of image and video technology, biomedical image processing and analysis, biometrics and image forensics, computational photography and arts, computer and robot vision, pattern recognition, and video surveillance.
The two-volume set LNCS 11295 and 11296 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2019, held in Thessaloniki, Greece, in January 2019. Of the 172 submitted full papers, 49 were selected for oral presentation and 47 for poster presentation; in addition, 6 demonstration papers, 5 industry papers, 6 workshop papers, and 6 papers for the Video Browser Showdown 2019 were accepted. All papers presented were carefully reviewed and selected from 204 submissions.
This 8-volumes set constitutes the refereed of the 25th International Conference on Pattern Recognition Workshops, ICPR 2020, held virtually in Milan, Italy and rescheduled to January 10 - 11, 2021 due to Covid-19 pandemic. The 416 full papers presented in these 8 volumes were carefully reviewed and selected from about 700 submissions. The 46 workshops cover a wide range of areas including machine learning, pattern analysis, healthcare, human behavior, environment, surveillance, forensics and biometrics, robotics and egovision, cultural heritage and document analysis, retrieval, and women at ICPR2020.
This book focuses on the challenges and the recent findings in vision intelligence incorporating high performance computing applications. The contents provide in-depth discussions on a range of emerging multidisciplinary topics like computer vision, image processing, artificial intelligence, machine learning, cloud computing, IoT, and big data. The book also includes illustrations of algorithms, architecture, applications, software systems, and data analytics within the scope of the discussed topics. This book will help students, researchers, and technology professionals discover latest trends in the fields of computer vision and artificial intelligence.
This volume provides methods on the study of the systems of the brain. Chapters are divided into four parts covering; discriminative touch, proprioception and kinaesthesis, affective touch, individual differences due to atypical development, ageing, illusions and sensory substitution, microneurography, electrophysiology, brain imaging, and brain stimulation. In Neuromethods series style, chapters include the kind of detail and key advice from the specialists needed to get successful results in your research center and clinical investigation. Thorough and comprehensive, Somatosensory Research Methods aims to be comprehensive guide for researchers.
The three-volume set LNCS 101164, 11165, and 11166 constitutes the refereed proceedings of the 19th Pacific-Rim Conference on Multimedia, PCM 2018, held in Hefei, China, in September 2018. The 209 regular papers presented together with 20 special session papers were carefully reviewed and selected from 452 submissions. The papers cover topics such as: multimedia content analysis; multimedia signal processing and communications; and multimedia applications and services.
BRAIN-COMPUTER INTERFACE It covers all the research prospects and recent advancements in the brain-computer interface using deep learning. The brain-computer interface (BCI) is an emerging technology that is developing to be more functional in practice. The aim is to establish, through experiences with electronic devices, a communication channel bridging the human neural networks within the brain to the external world. For example, creating communication or control applications for locked-in patients who have no control over their bodies will be one such use. Recently, from communication to marketing, recovery, care, mental state monitoring, and entertainment, the possible application areas have been expanding. Machine learning algorithms have advanced BCI technology in the last few decades, and in the sense of classification accuracy, performance standards have been greatly improved. For BCI to be effective in the real world, however, some problems remain to be solved. Research focusing on deep learning is anticipated to bring solutions in this regard. Deep learning has been applied in various fields such as computer vision and natural language processing, along with BCI growth, outperforming conventional approaches to machine learning. As a result, a significant number of researchers have shown interest in deep learning in engineering, technology, and other industries; convolutional neural network (CNN), recurrent neural network (RNN), and generative adversarial network (GAN). Audience Researchers and industrialists working in brain-computer interface, deep learning, machine learning, medical image processing, data scientists and analysts, machine learning engineers, electrical engineering, and information technologists.