This book features selected research papers presented at the First International Conference on Computing, Communications, and Cyber-Security (IC4S 2019), organized by Northwest Group of Institutions, Punjab, India, Southern Federal University, Russia, and IAC Educational Trust, India along with KEC, Ghaziabad and ITS, College Ghaziabad as an academic partner and held on 12–13 October 2019. It includes innovative work from researchers, leading innovators and professionals in the area of communication and network technologies, advanced computing technologies, data analytics and intelligent learning, the latest electrical and electronics trends, and security and privacy issues.
We are delighted to introduce the proceedings of the 13th edition of the 2020 European Alliance for Innovation (EAI) International Conference on Mobile Multimedia Communications (MOBIMEDIA). This conference has brought researchers, developers and practitioners around the world who are leveraging and developing multimedia coding, mobile communications and networking fields. Developing and leveraging multimedia coding, mobile communications and networking fields requires adopting an interdisciplinary approach where multimedia, networking and physical layer issues are addressed jointly. Basic theories, key technologies and Artificial Intelligence for next-generations wireless communications,intelligent technologies for subspace learning and clustering of high-dimensional data, security and safety, communication networks and coding analysis, electromagnetic and media access control, D2D and IoT, multimedia platform and analysis, new energy and smart city, vision and images analysis, systems and applications, case studies and prediction and educational application are research challenges that need to be carefully examined when designing new mobile media architectures. We also need to put a great effort in designing applications that take into account the way the user perceives the overall quality of the provided service. Within this scope, the MOBIMEDIA 2020 was intended to provide a unique international forum for researchers from industry and academia to study new technologies, applications and standards. Original unpublished contributions are solicited that can improve the knowledge and practice in the integrated design of efficient technologies and the relevant provision of advanced mobile multimedia applications.
Signal Processing and Machine Learning Theory, authored by world-leading experts, reviews the principles, methods and techniques of essential and advanced signal processing theory. These theories and tools are the driving engines of many current and emerging research topics and technologies, such as machine learning, autonomous vehicles, the internet of things, future wireless communications, medical imaging, etc. - Provides quick tutorial reviews of important and emerging topics of research in signal processing-based tools - Presents core principles in signal processing theory and shows their applications - Discusses some emerging signal processing tools applied in machine learning methods - References content on core principles, technologies, algorithms and applications - Includes references to journal articles and other literature on which to build further, more specific, and detailed knowledge
Get a broad overview of the different modalities of immersive video technologies—from omnidirectional video to light fields and volumetric video—from a multimedia processing perspective. From capture to representation, coding, and display, video technologies have been evolving significantly and in many different directions over the last few decades, with the ultimate goal of providing a truly immersive experience to users. After setting up a common background for these technologies, based on the plenoptic function theoretical concept, Immersive Video Technologies offers a comprehensive overview of the leading technologies enabling visual immersion, including omnidirectional (360 degrees) video, light fields, and volumetric video. Following the critical components of the typical content production and delivery pipeline, the book presents acquisition, representation, coding, rendering, and quality assessment approaches for each immersive video modality. The text also reviews current standardization efforts and explores new research directions. With this book the reader will a) gain a broad understanding of immersive video technologies that use three different modalities: omnidirectional video, light fields, and volumetric video; b) learn about the most recent scientific results in the field, including the recent learning-based methodologies; and c) understand the challenges and perspectives for immersive video technologies. - Describes the whole content processing chain for the main immersive video modalities (omnidirectional video, light fields, and volumetric video) - Offers a common theoretical background for immersive video technologies based on the concept of plenoptic function - Presents some exemplary applications of immersive video technologies
This book constitutes the proceedings of the 22nd International Conference on Speech and Computer, SPECOM 2020, held in St. Petersburg, Russia, in October 2020. The 65 papers presented were carefully reviewed and selected from 160 submissions. The papers present current research in the area of computer speech processing including speech science, speech technology, natural language processing, human-computer interaction, language identification, multimedia processing, human-machine interaction, deep learning for audio processing, computational paralinguistics, affective computing, speech and language resources, speech translation systems, text mining and sentiment analysis, voice assistants, etc. Due to the Corona pandemic SPECOM 2020 was held as a virtual event.
Providing an essential and unique bridge between the theories of signal processing, machine learning, and artificial intelligence (AI) in music, this book provides a holistic overview of foundational ideas in music, from the physical and mathematical properties of sound to symbolic representations. Combining signals and language models in one place, this book explores how sound may be represented and manipulated by computer systems, and how our devices may come to recognize particular sonic patterns as musically meaningful or creative through the lens of information theory. Introducing popular fundamental ideas in AI at a comfortable pace, more complex discussions around implementations and implications in musical creativity are gradually incorporated as the book progresses. Each chapter is accompanied by guided programming activities designed to familiarize readers with practical implications of discussed theory, without the frustrations of free-form coding. Surveying state-of-the art methods in applications of deep neural networks to audio and sound computing, as well as offering a research perspective that suggests future challenges in music and AI research, this book appeals to both students of AI and music, as well as industry professionals in the fields of machine learning, music, and AI.
The five-volume set LNCS 14355, 14356, 14357, 14358 and 14359 constitutes the refereed proceedings of the 12th International Conference on Image and Graphics, ICIG 2023, held in Nanjing, China, during September 22–24, 2023. The 166 papers presented in the proceedings set were carefully reviewed and selected from 409 submissions. They were organized in topical sections as follows: computer vision and pattern recognition; computer graphics and visualization; compression, transmission, retrieval; artificial intelligence; biological and medical image processing; color and multispectral processing; computational imaging; multi-view and stereoscopic processing; multimedia security; surveillance and remote sensing, and virtual reality. The ICIG 2023 is a biennial conference that focuses on innovative technologies of image, video and graphics processing and fostering innovation, entrepreneurship, and networking. It will feature world-class plenary speakers, exhibits, and high-quality peer reviewed oral and poster presentations.
This proceedings constitutes the refereed proceedings of the 17th International Conference on Communications and Networking, ChinaCom 2022, held in November 19-20, 2022. Due to COVID-19 pandemic the conference was held virtually. The 31 full papers presented were carefully selected from 83 submissions. The papers are organized in topical sections on Signal Processing and Communication Optimization; Scheduling and Transmission Optimization; Network Communication Performance Enhancement; Deep Learning Applications and Optimization; Deep Learning and Network Performance Optimization; Edge Computing and Artificial Intelligence Applications.