Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing

Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing

Author: Tong Zhang

Publisher: Springer Science & Business Media

Published: 2013-03-09

Total Pages: 145

ISBN-13: 1475733399

DOWNLOAD EBOOK

Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing is an up-to-date overview of audio and video content analysis. Included is extensive treatment of audiovisual data segmentation, indexing and retrieval based on multimodal media content analysis, and content-based management of audio data. In addition to the commonly studied audio types such as speech and music, the authors have included hybrid types of sounds that contain more than one kind of audio component such as speech or environmental sound with music in the background. Emphasis is also placed on semantic-level identification and classification of environmental sounds. The authors introduce a new generic audio retrieval system on top of the audio archiving schemes. Both theoretical analysis and implementation issues are presented. The developing MPEG-7 standards are explored. Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing will be especially useful to researchers and graduate level students designing and developing fully functional audiovisual systems for audio/video content parsing of multimedia streams.


Adaptive Multimedia Retrieval. Context, Exploration and Fusion

Adaptive Multimedia Retrieval. Context, Exploration and Fusion

Author: Marcin Detyniecki

Publisher: Springer

Published: 2012-01-06

Total Pages: 230

ISBN-13: 3642271693

DOWNLOAD EBOOK

This book constitutes the refereed proceedings of the 8th International Conference on Adaptive Multimedia Retrieval, AMR 2010, held in Linz, Austria, in August 2010. The 14 revised full papers and the invited contribution presented were carefully reviewed and selected from numerous submissions. The papers are organized in topical sections on Context-based personalization; media information fusion; video retrieval; audio and music retrieval; adaptive similarities; and finding and organizing.


Advances in Computers

Advances in Computers

Author: Marvin Zelkowitz

Publisher: Academic Press

Published: 2010-03-02

Total Pages: 368

ISBN-13: 0123810205

DOWNLOAD EBOOK

This is volume 78 of Advances in Computers. This series, which began publication in 1960, is the oldest continuously published anthology that chronicles the ever- changing information technology field. In these volumes we publish from 5 to 7 chapters, three times per year, that cover the latest changes to the design, development, use and implications of computer technology on society today. Covers the full breadth of innovations in hardware, software, theory, design, and applications. Many of the in-depth reviews have become standard references that continue to be of significant, lasting value in this rapidly expanding field.


Distributed Multimedia Database Technologies Supported by MPEG-7 and MPEG-21

Distributed Multimedia Database Technologies Supported by MPEG-7 and MPEG-21

Author: Harald Kosch

Publisher: CRC Press

Published: 2003-11-24

Total Pages: 276

ISBN-13: 0203009339

DOWNLOAD EBOOK

A multimedia system needs a mechanism to communicate with its environment, the Internet, clients, and applications. MPEG-7 provides a standard metadata format for global communication, but lacks the framework to let the various players in a system interact. MPEG-21 closes this gap by establishing an infrastructure for a distributed multimedia frame


Information and Communication Technology for Intelligent Systems

Information and Communication Technology for Intelligent Systems

Author: Suresh Chandra Satapathy

Publisher: Springer

Published: 2018-12-14

Total Pages: 743

ISBN-13: 981131747X

DOWNLOAD EBOOK

The book gathers papers addressing state-of-the-art research in all areas of Information and Communication Technologies and their applications in intelligent computing, cloud storage, data mining and software analysis. It presents the outcomes of the third International Conference on Information and Communication Technology for Intelligent Systems, which was held on April 6–7, 2018, in Ahmedabad, India. Divided into two volumes, the book discusses the fundamentals of various data analytics and algorithms, making it a valuable resource for researchers’ future studies.


Multimedia Information Extraction

Multimedia Information Extraction

Author: Mark T. Maybury

Publisher: John Wiley & Sons

Published: 2012-07-11

Total Pages: 436

ISBN-13: 111821952X

DOWNLOAD EBOOK

The advent of increasingly large consumer collections of audio (e.g., iTunes), imagery (e.g., Flickr), and video (e.g., YouTube) is driving a need not only for multimedia retrieval but also information extraction from and across media. Furthermore, industrial and government collections fuel requirements for stock media access, media preservation, broadcast news retrieval, identity management, and video surveillance. While significant advances have been made in language processing for information extraction from unstructured multilingual text and extraction of objects from imagery and video, these advances have been explored in largely independent research communities who have addressed extracting information from single media (e.g., text, imagery, audio). And yet users need to search for concepts across individual media, author multimedia artifacts, and perform multimedia analysis in many domains. This collection is intended to serve several purposes, including reporting the current state of the art, stimulating novel research, and encouraging cross-fertilization of distinct research disciplines. The collection and integration of a common base of intellectual material will provide an invaluable service from which to teach a future generation of cross disciplinary media scientists and engineers.


Radio Resource Management for Multimedia QoS Support in Wireless Networks

Radio Resource Management for Multimedia QoS Support in Wireless Networks

Author: Huan Chen

Publisher: Springer Science & Business Media

Published: 2012-12-06

Total Pages: 267

ISBN-13: 1461504694

DOWNLOAD EBOOK

Due to the great success and enormous impact of IP networks, In ternet access (such as sending and receiving e-mails) and web brows ing have become the ruling paradigm for next generation wireless systems. On the other hand, great technological and commercial success of services and applications is being witnessed in mobile wire less communications with examples of cellular, pes voice telephony and wireless LANs. The service paradigm has thus shifted from the conventional voice service to seamlessly integrated high quality mul timedia transmission over broadband wireless mobile networks. The multimedia content may include data, voice, audio, image, video and so on. With availability of more powerful portable devices, such as PDA, portable computer and cellular phone, coupled with the easier access to the core network (using a mobile device), the number of mobile users and the demand for multimedia-based applications is increasing rapidly. As a result, there is an urgent need for a sys tem that supports heterogeneous multimedia services and provides seamless access to the desired resources via wireless connections. Therefore, the convergence of multimedia communication and wireless mobile networking technologies into the next generation wireless multimedia (WMM) networks with the vision of "anytime, anywhere, anyform" information system is the certain trend in the foreseeable future. However, successful combination of these two technologies presents many challenges such as available spectral bandwidth, energy efficiency, seamless end-to-end communication, robustness, security, etc.


Automatic Classification and Indexing of Audio Broadcast Data

Automatic Classification and Indexing of Audio Broadcast Data

Author: P. Dhanalakshmi

Publisher:

Published: 2010

Total Pages: 0

ISBN-13:

DOWNLOAD EBOOK

Audio classification has been a focus area in the research of audio processing and pattern recognition. Automatic audio classification is very useful to audio indexing, content-based audio retrieval and online audio distribution, but the extraction of the most common and salient themes from unstructured raw audio data is a major challenge. The paper presents effective algorithms to automatically classify audio clips into one of the six classes: music, news, sports, advertisement, cartoon and movie. For these categories, a number of acoustic features that include linear predictive coefficients (LPC), linear predictive cepstral coefficients (LPCC) and Mel frequency cepstral coefficients (MFCC) are extracted to characterize the audio content. The auto associative neural network model (AANN) is used to capture the distribution of the acoustic feature vectors. The AANN model captures the distribution of the acoustic features of a class, and the back propagation learning algorithm is used to adjust the weights of the network to minimize the mean square error for each feature vector. This work also proposes an efficient audio indexing system which indexes movie clips using K-means clustering algorithm. Experimental results indicate that the proposed algorithms can produce satisfactory results.


Introduction to Video Search Engines

Introduction to Video Search Engines

Author: David C. Gibbon

Publisher: Springer Science & Business Media

Published: 2008-09-20

Total Pages: 282

ISBN-13: 3540793372

DOWNLOAD EBOOK

The evolution of technology has set the stage for the rapid growth of the video Web: broadband Internet access is ubiquitous, and streaming media protocols, systems, and encoding standards are mature. In addition to Web video delivery, users can easily contribute content captured on low cost camera phones and other consumer products. The media and entertainment industry no longer views these developments as a threat to their established business practices, but as an opportunity to provide services for more viewers in a wider range of consumption contexts. The emergence of IPTV and mobile video services offers unprecedented access to an ever growing number of broadcast channels and provides the flexibility to deliver new, more personalized video services. Highly capable portable media players allow us to take this personalized content with us, and to consume it even in places where the network does not reach. Video search engines enable users to take advantage of these emerging video resources for a wide variety of applications including entertainment, education and communications. However, the task of information extr- tion from video for retrieval applications is challenging, providing opp- tunities for innovation. This book aims to first describe the current state of video search engine technology and second to inform those with the req- site technical skills of the opportunities to contribute to the development of this field. Today’s Web search engines have greatly improved the accessibility and therefore the value of the Web.