Excellent textbook of multimedia signal processing also dealing with the optimization of multimedia communication systems. It covers the theoretical background of one- and multidimensional signal processing, statistical analysis and modelling, coding and information theory as well as estimation and classification theory.
Multimedia signals include different data types (text, sound, graphics, picture, animations, video, etc.), which can be time-dependent (sound, video and animation) or spatially-dependent (images, text and graphics). Hence, the multimedia systems represent an interdisciplinary cross-section of the following areas: digital signal processing, computer architecture, computer networks and telecommunications. Multimedia Signals and Systems is an introductory text, designed for students or professionals and researchers in other fields, with a need to learn the basics of signals and systems. A considerable emphasis is placed on the analysis and processing of multimedia signals (audio, images, video). Additionally, the book connects these principles to other important elements of multimedia systems such as the analysis of optical media, computer networks, QoS, and digital watermarking.
"This book offers an in-depth explanation of multimedia technologies within their many specific application areas as well as presenting developing trends for the future"--Provided by publisher.
Multimedia Content Analysis: Theory and Applications covers the latest in multimedia content analysis and applications based on such analysis. As research has progressed, it has become clear that this field has to appeal to other disciplines such as psycho-physics, media production, etc. This book consists of invited chapters that cover the entire range of the field. Some of the topics covered include low-level audio-visual analysis based retrieval and indexing techniques, the TRECVID effort, video browsing interfaces, content creation and content analysis, and multimedia analysis-based applications, among others. The chapters are written by leading researchers in the multimedia field.
-Presents state-of-the-art in visual media retrieval. -Coverage of adaptive content-based retrieval systems and techniques in image and video database applications. -Includes a novel machine-controlled interactive retrieval (MCIR) method that optimizes image search in distributed digital libraries over the Internet.
This book is the condensed result of an extensive European project developing the future of 3D-Television. The book describes the state of the art in relevant topics: Capture of 3D scene for input to 3DTV system; Abstract representation of captured 3D scene information in digital form; Specifying data exchange format; Transmission of coded data; Conversion of 3DTV data for holographic and other displays; Equipment to decode and display 3DTV signal.
This is the proceedings of the 11th International Workshop on Structural and Syntactic Pattern Recognition, SSPR 2006 and the 6th International Workshop on Statistical Techniques in Pattern Recognition, SPR 2006, held in Hong Kong, August 2006 alongside the Conference on Pattern Recognition, ICPR 2006. 38 revised full papers and 61 revised poster papers are included, together with 4 invited papers covering image analysis, character recognition, bayesian networks, graph-based methods and more.
Foundations of Voice and Speech Quality Perception starts out with the fundamental question of: "How do listeners perceive voice and speech quality and how can these processes be modeled?" Any quantitative answers require measurements. This is natural for physical quantities but harder to imagine for perceptual measurands. This book approaches the problem by actually identifying major perceptual dimensions of voice and speech quality perception, defining units wherever possible and offering paradigms to position these dimensions into a structural skeleton of perceptual speech and voice quality. The emphasis is placed on voice and speech quality assessment of systems in artificial scenarios. Many scientific fields are involved. This book bridges the gap between two quite diverse fields, engineering and humanities, and establishes the new research area of Voice and Speech Quality Perception.
Telecommunication systems and human-machine interfaces have begun using multiple microphones and loudspeakers to render interaction more lifelike, and more efficient. This raises acoustic signal processing problems under multiple-input multiple-output (MIMO) scenarios, encompassing distant speech acquisition, sound source localization and tracking, echo and noise control, source separation and speech dereverberation, and many others. The book opens with an acoustic MIMO paradigm, establishing fundamentals, and linking acoustic MIMO signal processing with classical signal processing and communication theories. The second part of the book presents a novel analysis of acoustic applications carried out in the paradigm to reinforce the fundamentals of acoustic MIMO signal processing.