Layered Wyner-Ziv Video Coding for Noisy Channels

Layered Wyner-Ziv Video Coding for Noisy Channels

Author: Qian Xu

Publisher:

Published: 2005

Total Pages:

ISBN-13:

DOWNLOAD EBOOK

The growing popularity of video sensor networks and video celluar phones has generated the need for low-complexity and power-efficient multimedia systems that can handle multiple video input and output streams. While standard video coding techniques fail to satisfy these requirements, distributed source coding is a promising technique for "uplink" applications. Wyner-Ziv coding refers to lossy source coding with side information at the decoder. Based on recent theoretical result on successive Wyner-Ziv coding, we propose in this thesis a practical layered Wyner-Ziv video codec using the DCT, nested scalar quantizer, and irregular LDPC code based Slepian-Wolf coding (or lossless source coding with side information) for noiseless channel. The DCT is applied as an approximation to the conditional KLT, which makes the components of the transformed block conditionally independent given the side information. NSQ is a binning scheme that facilitates layered bit-plane coding of the bin indices while reducing the bit rate. LDPC code based Slepian-Wolf coding exploits the correlation between the quantized version of the source and the side information to achieve further compression. Different from previous works, an attractive feature of our proposed system is that video encoding is done only once but decoding allowed at many lower bit rates without quality loss. For Wyner-Ziv coding over discrete noisy channels, we present a Wyner-Ziv video codec using IRA codes for Slepian-Wolf coding based on the idea of two equivalent channels. For video streaming applications where the channel is packet based, we apply unequal error protection scheme to the embedded Wyner-Ziv coded video stream to find the optimal source-channel coding trade-off for a target transmission rate over packet erasure channel.


Layered Wyner-Ziv Video Coding

Layered Wyner-Ziv Video Coding

Author: Qian Xu

Publisher:

Published: 2010

Total Pages:

ISBN-13:

DOWNLOAD EBOOK

Following recent theoretical works on successive Wyner-Ziv coding, we propose a practical layered Wyner-Ziv video coder using the DCT, nested scalar quantization, and irregular LDPC code based Slepian-Wolf coding (or lossless source coding with side information at the decoder). Our main novelty is to use the base layer of a standard scalable video coder (e.g., MPEG-4/H.26L FGS or H.263+) as the decoder side information and perform layered Wyner-Ziv coding for quality enhancement. Similar to FGS coding, there is no performance difference between layered and monolithic Wyner-Ziv coding when the enhancement bitstream is generated in our proposed coder. Using an H.26L coded version as the base layer, experiments indicate that Wyner-Ziv coding gives slightly worse performance than FGS coding when the channel (for both the base and enhancement layers) is noiseless. However, when the channel is noisy, extensive simulations of video transmission over wireless networks conforming to the CDMA2000 1X standard show that H.26L base layer coding plus Wyner-Ziv enhancement layer coding are more robust against channel errors than H.26L FGS coding. These results demonstrate that layered Wyner-Ziv video coding is a promising new technique for video streaming over wireless networks. For scalable video transmission over the Internet and 3G wireless networks, we propose a system for receiver-driven layered multicast based on layered Wyner-Ziv video coding and digital fountain coding. Digital fountain codes are near-capacity erasure codes that are ideally suited for multicast applications because of their rate- less property. By combining an error-resilient Wyner-Ziv video coder and rateless fountain codes, our system allows reliable multicast of high-quality video to an arbitrary number of heterogeneous receivers without the requirement of feedback channels. Extending this work on separate source-channel coding, we consider distributed joint source-channel coding by using a single channel code for both video compression (via Slepian-Wolf coding) and packet loss protection. We choose Raptor codes - the best approximation to a digital fountain - and address in detail both encoder and de- coder designs. Simulation results show that, compared to one separate design using Slepian-Wolf compression plus erasure protection and another based on FGS coding plus erasure protection, the proposed joint design provides better video quality at the same number of transmitted packets.


Multimedia Image and Video Processing

Multimedia Image and Video Processing

Author: Ling Guan

Publisher: CRC Press

Published: 2017-12-19

Total Pages: 1064

ISBN-13: 1351833650

DOWNLOAD EBOOK

As multimedia applications have become part of contemporary daily life, numerous paradigm-shifting technologies in multimedia processing have emerged over the last decade. Substantially updated with 21 new chapters, Multimedia Image and Video Processing, Second Edition explores the most recent advances in multimedia research and applications. This edition presents a comprehensive treatment of multimedia information mining, security, systems, coding, search, hardware, and communications as well as multimodal information fusion and interaction. Clearly divided into seven parts, the book begins with a section on standards, fundamental methods, design issues, and typical architectures. It then focuses on the coding of video and multimedia content before covering multimedia search, retrieval, and management. After examining multimedia security, the book describes multimedia communications and networking and explains the architecture design and implementation for multimedia image and video processing. It concludes with a section on multimedia systems and applications. Written by some of the most prominent experts in the field, this updated edition provides readers with the latest research in multimedia processing and equips them with advanced techniques for the design of multimedia systems.


Performance of Wyner-Ziv Video Codec Under Rayleigh Fading Channel

Performance of Wyner-Ziv Video Codec Under Rayleigh Fading Channel

Author:

Publisher:

Published: 2005

Total Pages: 62

ISBN-13:

DOWNLOAD EBOOK

In this thesis, we analyze the performance of a Wyner-Ziv codec over a flat-fading Rayleigh channel. In current interframe video compression systems, the encoder performs predictive coding to exploit the similarities of successive frames. The Wyner-Ziv Theorem on source coding with side information available only at the decoder suggests that an asymmetric video codec, where individual frames are encoded separately, but decoded conditionally (given temporally adjacent frames) could achieve similar efficiency. A transformdomain Wyner-Ziv coding scheme for motion video that uses intraframe encoding, but interframe decoding is implemented. In this system, the transform coefficients of a Wyner-Ziv frame are encoded independently using a scalar quantizer and turbo coder. The decoder uses previously reconstructed frames to generate side information to conditionally decode the Wyner-Ziv frames. In this work, the video is encoded using the Wyner-Ziv video codec that is implemented. The key(even) frames are compressed using a standard H.263 intra-frame encoder and the Wyner-Ziv(odd) frames are coded using Wyner-Ziv coding. Since Wyner-Ziv ceodec acts as a combined source channel coder there is no further channel coding performed on this bitstream. But, the H.263 compressed key frames are protected using RCPC codes. The compressed video is simulated over a flat-fading Rayleigh channel with Additive White Gaussian Noise(AWGN). The experimental results show that Wynwr-Ziv codec performs better than pure intra coding under channel noise and also they offer impressive bitrate savings.


Video Coding for Wireless Communication Systems

Video Coding for Wireless Communication Systems

Author: King N. Ngan

Publisher: CRC Press

Published: 2018-10-03

Total Pages: 560

ISBN-13: 148229009X

DOWNLOAD EBOOK

"Explains the transmission of image and video information over wireless channels. Describes MPEG-4, the latest video coding standard. Discusses error resilient combined source channel image and video coders, and multiple access spread spectrum and future generation wireless video communication systems."


Wyner-Ziv Video Coding

Wyner-Ziv Video Coding

Author: Ghazaleh R. Esmaili

Publisher:

Published: 2011

Total Pages: 82

ISBN-13: 9781267073556

DOWNLOAD EBOOK

Wyner Ziv (WZ) video coding is based on the results of the Slepian-Wolf and Wyner-Ziv theorems where the temporal correlation between neighboring frames is exploited at the decoder. This approach enables designing low-cost, low complexity encoders which are essential for some recent applications such as video surveillance and mobile camera phone. Despite recent advances in WZ video coding, the rate distortion performance is still far from that of predictive coding. One of the main factors affecting the WZ coding performance is the accuracy of modeling the dependency (correlation noise) between a frame to be coded and its estimation (side information) at the decoder. In existing transform domain Wyner-Ziv video coding methods, blocks within a frame are treated uniformly to estimate the correlation noise even though the success of generating side information is different for each block. This thesis proposes a method to estimate the correlation noise by differentiating blocks within a frame based on the accuracy of the side information. A second contribution of this dissertation is exploiting the temporal correlation between key frames which are usually simply intra coded. We propose a frequency band coding mode selection for key frames to exploit similarities between adjacent key frames at the decoder to improve the overall performance. Furthermore, the advantage of applying both schemes in a hierarchical order is investigated. Rate control is an important issue in many video applications. In Wyner-Ziv video coding architectures, the available bit budget for each GOP is shared between key frames and Wyner-Ziv frames. In this dissertation, we propose a model to express the relationship between the quantization step size of key and WZ frames based on their motion activity. Then we apply this model to propose an adaptive algorithm adjusting the quantization step size of key and WZ frames to achieve and maintain a target bit rate. The objective and subjective quality of the proposed method is evaluated.


Distributed Source Coding

Distributed Source Coding

Author: Pier Luigi Dragotti

Publisher: Academic Press

Published: 2009-02-24

Total Pages: 359

ISBN-13: 0080922740

DOWNLOAD EBOOK

The advent of wireless sensor technology and ad-hoc networks has made DSC a major field of interest. Edited and written by the leading players in the field, this book presents the latest theory, algorithms and applications, making it the definitive reference on DSC for systems designers and implementers, researchers, and graduate students. This book gives a clear understanding of the performance limits of distributed source coders for specific classes of sources and presents the design and application of practical algorithms for realistic scenarios. Material covered includes the use of standard channel codes, such as LDPC and Turbo codes, to DSC, and discussion of the suitability of compressed sensing for distributed compression of sparse signals. Extensive applications are presented and include distributed video coding, microphone arrays and securing biometric data. Clear explanation of the principles of distributed source coding (DSC), a technology that has applications in sensor networks, ad-hoc networks, and distributed wireless video systems for surveillance Edited and written by the leading players in the field, providing a complete and authoritative reference Contains all the latest theory, practical algorithms for DSC design and the most recently developed applications


Joint Source-Channel Decoding

Joint Source-Channel Decoding

Author: Pierre Duhamel

Publisher: Academic Press

Published: 2009-11-26

Total Pages: 337

ISBN-13: 0080922449

DOWNLOAD EBOOK

Treats joint source and channel decoding in an integrated way Gives a clear description of the problems in the field together with the mathematical tools for their solution Contains many detailed examples useful for practical applications of the theory to video broadcasting over mobile and wireless networks Traditionally, cross-layer and joint source-channel coding were seen as incompatible with classically structured networks but recent advances in theory changed this situation. Joint source-channel decoding is now seen as a viable alternative to separate decoding of source and channel codes, if the protocol layers are taken into account. A joint source/protocol/channel approach is thus addressed in this book: all levels of the protocol stack are considered, showing how the information in each layer influences the others. This book provides the tools to show how cross-layer and joint source-channel coding and decoding are now compatible with present-day mobile and wireless networks, with a particular application to the key area of video transmission to mobiles. Typical applications are broadcasting, or point-to-point delivery of multimedia contents, which are very timely in the context of the current development of mobile services such as audio (MPEG4 AAC) or video (H263, H264) transmission using recent wireless transmission standards (DVH-H, DVB-SH, WiMAX, LTE). This cross-disciplinary book is ideal for graduate students, researchers, and more generally professionals working either in signal processing for communications or in networking applications, interested in reliable multimedia transmission. This book is also of interest to people involved in cross-layer optimization of mobile networks. Its content may provide them with other points of view on their optimization problem, enlarging the set of tools which they could use. Pierre Duhamel is director of research at CNRS/ LSS and has previously held research positions at Thomson-CSF, CNET, and ENST, where he was head of the Signal and Image Processing Department. He has served as chairman of the DSP committee and associate Editor of the IEEE Transactions on Signal Processing and Signal Processing Letters, as well as acting as a co-chair at MMSP and ICASSP conferences. He was awarded the Grand Prix France Telecom by the French Science Academy in 2000. He is co-author of more than 80 papers in international journals, 250 conference proceedings, and 28 patents. Michel Kieffer is an assistant professor in signal processing for communications at the Université Paris-Sud and a researcher at the Laboratoire des Signaux et Systèmes, Gif-sur-Yvette, France. His research interests are in joint source-channel coding and decoding techniques for the reliable transmission of multimedia contents. He serves as associate editor of Signal Processing (Elsevier). He is co-author of more than 90 contributions to journals, conference proceedings, and book chapters. Treats joint source and channel decoding in an integrated way Gives a clear description of the problems in the field together with the mathematical tools for their solution Contains many detailed examples useful for practical applications of the theory to video broadcasting over mobile and wireless networks


Improving the Rate-Distortion Performance in Distributed Video Coding

Improving the Rate-Distortion Performance in Distributed Video Coding

Author: Yaser Mohammad Taheri

Publisher:

Published: 2017

Total Pages: 136

ISBN-13:

DOWNLOAD EBOOK

Distributed video coding is a coding paradigm, which allows encoding of video frames at a complexity that is substantially lower than that in conventional video coding schemes. This feature makes it suitable for some emerging applications such as wireless surveillance video and mobile camera phones. In distributed video coding, a subset of frames in the video sequence, known as the key frames, are encoded using a conventional intra-frame encoder, such as H264/AVC in the intra mode, and then transmitted to the decoder. The remaining frames, known as the Wyner-Ziv frames, are encoded based on the Wyner-Ziv principle by using the channel codes, such as LDPC codes. In the transform-domain distributed video coding, each Wyner-Ziv frame undergoes a 4x4 block DCT transform and the resulting DCT coefficients are grouped into DCT bands. The bitplaines corresponding to each DCT band are encoded by a channel encoder, for example an LDPCA encoder, one after another. The resulting error-correcting bits are retained in a buffer at the encoder and transmitted incrementally as needed by the decoder. At the decoder, the key frames are first decoded. The decoded key frames are then used to generate a side information frame as an initial estimate of the corresponding Wyner-Ziv frame, usually by employing an interpolation method. The difference between the DCT band in the side information frame and the corresponding one in the Wyner-Ziv frame, referred to as the correlation noise, is often modeled by Laplacian distribution. A soft-input information for each bit in the bitplane is obtained using this correlation noise model and the corresponding DCT band of the side information frame. The channel decoder then uses this soft-input information along with some error-correcting bits sent by the encoder to decode the bitplanes of each DCT band in each of the Wyner-Ziv frames. Hence, an accurate estimation of the correlation noise model parameter(s) and generation of high-quality side information are required for reliable soft-input information for the bitplanes in the decoder, which in turn leads to a more efficient decoding. Consequently, less error-correcting bits need to be transmitted from the encoder to the decoder to decode the bitplanes, leading to a better compression efficiency and rate-distortion performance. The correlation noise is not stationary and its statistics vary within each Wyner-Ziv frame and within its corresponding DCT bands. Hence, it is difficult to find an accurate model for the correlation noise and estimate its parameters precisely at the decoder. Moreover, in existing schemes the parameters of the correlation noise for each DCT band are estimated before the decoder starts to decode the bitplanes of that DCT band and they are not modified and kept unchanged during decoding process of the bitplanes. Another problem of concern is that, since side information frame is generated in the decoder using the temporal interpolation between the previously decoded frames, the quality of the side information frames is generally poor when the motions between the frames are non-linear. Hence, generating a high-quality side information is a challenging problem. This thesis is concerned with the study of accurate estimation of correlation noise model parameters and increasing in the quality of the side information from the standpoint of improving the rate-distortion performance in distributed video coding. A new scheme is proposed for the estimation of the correlation noise parameters wherein the decoder decodes simultaneously all the bitplanes of a DCT band in a Wyner-Ziv frame and then refines the parameters of the correlation noise model of the band in an iterative manner. This process is carried out on an augmented factor graph using a new recursive message passing algorithm, with the side information generated and kept unchanged during the decoding of the Wyner-Ziv frame. Extensive simulations are carried out showing that the proposed decoder leads to an improved rate-distortion performance in comparison to the original DISCOVER codec and in another DVC codec employing side information frame refinement, particularly for video sequences with high motion content. In the second part of this work, a new algorithm for the generation of the side information is proposed to refine the initial side information frame using the additional information obtained after decoding the previous DCT bands of a Wyner-Ziv frame. The simulations are carried out demonstrating that the proposed algorithm provides a performance superior to that of schemes employing the other side information refinement mechanisms. Finally, it is shown that incorporating the proposed algorithm for refining the side information into the decoder proposed in the first part of the thesis leads to a further improvement in the rate-distortion performance of the DVC codec.