Rate Distortion Control in Digital Video Coding

Rate Distortion Control in Digital Video Coding

Author: Haoxiang Zhang

Publisher:

Published: 2007

Total Pages:

ISBN-13:

DOWNLOAD EBOOK

Lossy compression is widely applied for coding visual information in applications such as entertainment in order to achieve a high compression ratio. In this case, the video quality worsens as the compression ratio increases. Rate control tries to use the bit budget properly so the visual distortion is minimized. Rate control for H.264, the state-of-the-art hybrid video coder, is investigated. Based on the Rate-Distortion (R-D) slope analysis, an operational rate distortion optimization scheme for H.264 using Lagrangian multiplier method is proposed. The scheme tries to find the best path of quantization parameter (OP) options at each macroblock. The proposed scheme provides a smoother rate control that is able to cover a wider range of bit rates and for many sequences it outperforms the H.264 (JM92 version) rate control scheme in the sense of PSNR. The Bath University Matching Pursuit (BUMP) project develops a new matching pursuit (MP) technique as an alternative to transform video coders. By combining MP with precision limited quantization (PLO) and multi-pass embedded residual group encoder (MERGE), a very efficient coder is built that is able to produce an embedded bit stream, which is highly desirable for rate control. The problem of optimal bit allocation with a BUMP based video coder is investigated. An ad hoc scheme of simply limiting the maximum atom number shows an obvious performance improvement, which indicates a potential of efficiency improvement. An in depth study on the bit Rate-Atom character has been carried out and a rate estimation model has been proposed. The model gives a theoretical description of how the oit number changes. An adaptive rate estimation algorithm has been proposed. Experiments show that the algorithm provides extremely high estimation accuracy. The proposed R-D source model is then applied to bit allocation in the BUMP based video coder. An R-D slope unifying scheme was applied to optimize the performance of the coder'. It adopts the R-D model and fits well within the BUMP coder. The optimization can be performed in a straightforward way. Experiments show that the proposed method greatly improved performance of BUMP video coder, and outperforms H.264 in low and medium bit rates by up to 2 dB.


Recent Advances on Video Coding

Recent Advances on Video Coding

Author: Javier Del Ser Lorente

Publisher: BoD – Books on Demand

Published: 2011-07-05

Total Pages: 414

ISBN-13: 9533071818

DOWNLOAD EBOOK

This book is intended to attract the attention of practitioners and researchers from industry and academia interested in challenging paradigms of multimedia video coding, with an emphasis on recent technical developments, cross-disciplinary tools and implementations. Given its instructional purpose, the book also overviews recently published video coding standards such as H.264/AVC and SVC from a simulational standpoint. Novel rate control schemes and cross-disciplinary tools for the optimization of diverse aspects related to video coding are also addressed in detail, along with implementation architectures specially tailored for video processing and encoding. The book concludes by exposing new advances in semantic video coding. In summary: this book serves as a technically sounding start point for early-stage researchers and developers willing to join leading-edge research on video coding, processing and multimedia transmission.


Rate-Distortion Based Video Compression

Rate-Distortion Based Video Compression

Author: Guido M. Schuster

Publisher: Springer Science & Business Media

Published: 2013-03-09

Total Pages: 305

ISBN-13: 1475725663

DOWNLOAD EBOOK

One of the most intriguing problems in video processing is the removal of the redundancy or the compression of a video signal. There are a large number of applications which depend on video compression. Data compression represents the enabling technology behind the multimedia and digital television revolution. In motion compensated lossy video compression the original video sequence is first split into three new sources of information, segmentation, motion and residual error. These three information sources are then quantized, leading to a reduced rate for their representation but also to a distorted reconstructed video sequence. After the decomposition of the original source into segmentation, mo tion and residual error information is decided, the key remaining problem is the allocation of the available bits into these three sources of information. In this monograph a theory is developed which provides a solution to this fundamental bit allocation problem. It can be applied to all quad-tree-based motion com pensated video coders which use a first order differential pulse code modulation (DPCM) scheme for the encoding of the displacement vector field (DVF) and a block-based transform scheme for the encoding of the displaced frame differ ence (DFD). An optimal motion estimator which results in the smallest DFD energy for a given bit rate for the encoding of the DVF is also a result of this theory. Such a motion estimator is used to formulate a motion compensated interpolation scheme which incorporates a global smoothness constraint for the DVF.


Rate-Quality Optimized Video Coding

Rate-Quality Optimized Video Coding

Author: Yoo-Sok Saw

Publisher: Springer Science & Business Media

Published: 2012-12-06

Total Pages: 199

ISBN-13: 1461551250

DOWNLOAD EBOOK

Rate-Quality Optimized Video Coding discusses the matter of optimizing (or negotiating) the data rate of compressed digital video and its quality, which has been a relatively neglected topic in either side of image/video coding and tele-traffic management. Video rate management becomes a technically challenging task since it is required to maintain a certain video quality regardless of the availability of transmission or storage media. This is caused by the broadband nature of digital video and inherent algorithmic features of mainstream video compression schemes, e.g. H.261, H.263 and MPEG series. In order to maximize the media utilization and to enhance video quality, the data rate of compressed video should be regulated within a budget of available media resources while maintaining the video quality as high as possible. In Part I (Chapters 1 to 4) the non-stationarity of digital video is discussed. Since the non-stationary nature is also inherited from algorithmic properties of international video coding standards, which are a combination of statistical coding techniques, the video rate management techniques of these standards are explored. Although there is a series of known video rate control techniques, such as picture rate variation, frame dropping, etc., these techniques do not view the matter as an optimization between rate and quality. From the view of rate-quality optimization, the quantizer is the sole means of controling rate and quality. Thus, quantizers and quantizer control techniques are analyzed, based on the relationship of rate and quality. In Part II (Chapters 5 and 6), as a coherent approach to non-stationary video, established but still thriving nonlinear techniques are applied to video rate-quality optimization such as artificial neural networks including radical basis function networks, and fuzzy logic-based schemes. Conventional linear techniques are also described before the nonlinear techniques are explored. By using these nonlinear techniques, it is shown how they influence and tackle the rate-quality optimization problem. Finally, in Chapter 7 rate-quality optimization issues are reviewed in emerging video communication applications such as video transcoding and mobile video. This chapter discusses some new issues and prospects of rate and quality control in those technology areas. Rate-Quality Optimized Video Coding is an excellent reference and can be used for advanced courses on the topic.


Rate Distortion Theory for Causal Video Coding

Rate Distortion Theory for Causal Video Coding

Author: Lin Zheng

Publisher:

Published: 2012

Total Pages: 156

ISBN-13:

DOWNLOAD EBOOK

Due to the sheer volume of data involved, video coding is an important application of lossy source coding, and has received wide industrial interest and support as evidenced by the development and success of a series of video coding standards. All MPEG-series and H-series video coding standards proposed so far are based upon a video coding paradigm called predictive video coding, where video source frames Xi, i=1,2 ..., N, are encoded in a frame by frame manner, the encoder and decoder for each frame Xi, i =1, 2 ..., N, enlist help only from all previous encoded frames Sj, j=1, 2 ..., i-1. In this thesis, we will look further beyond all existing and proposed video coding standards, and introduce a new coding paradigm called causal video coding, in which the encoder for each frame Xi can use all previous original frames Xj, j=1, 2 ..., i-1, and all previous encoded frames Sj, while the corresponding decoder can use only all previous encoded frames. We consider all studies, comparisons, and designs on causal video coding from an information theoretic point of view. Let R*c(D1 ..., D_N) (R*p(D1 ..., D_N), respectively) denote the minimum total rate required to achieve a given distortion level D1 ..., D_N> 0 in causal video coding (predictive video coding, respectively). A novel computation approach is proposed to analytically characterize, numerically compute, and compare the minimum total rate of causal video coding R*c(D1 ..., D_N) required to achieve a given distortion (quality) level D1 ..., D_N> 0. Specifically, we first show that for jointly stationary and ergodic sources X1 ..., X_N, R*c(D1 ..., D_N) is equal to the infimum of the n-th order total rate distortion function R_{c, n}(D1 ..., D_N) over all n, where R_{c, n}(D1 ..., D_N) itself is given by the minimum of an information quantity over a set of auxiliary random variables. We then present an iterative algorithm for computing R_{c, n}(D1 ..., D_N) and demonstrate the convergence of the algorithm to the global minimum. The global convergence of the algorithm further enables us to not only establish a single-letter characterization of R*c(D1 ..., D_N) in a novel way when the N sources are an independent and identically distributed (IID) vector source, but also demonstrate a somewhat surprising result (dubbed the more and less coding theorem)--under some conditions on source frames and distortion, the more frames need to be encoded and transmitted, the less amount of data after encoding has to be actually sent. With the help of the algorithm, it is also shown by example that R*c(D1 ..., D_N) is in general much smaller than the total rate offered by the traditional greedy coding method by which each frame is encoded in a local optimum manner based on all information available to the encoder of the frame. As a by-product, an extended Markov lemma is established for correlated ergodic sources. From an information theoretic point of view, it is interesting to compare causal video coding and predictive video coding, which all existing video coding standards proposed so far are based upon. In this thesis, by fixing N=3, we first derive a single-letter characterization of R*p(D1, D2, D3) for an IID vector source (X1, X2, X3) where X1 and X2 are independent, and then demonstrate the existence of such X1, X2, X3 for which R*p(D1, D2, D3)>R*c(D1, D2, D3) under some conditions on source frames and distortion. This result makes causal video coding an attractive framework for future video coding systems and standards. The design of causal video coding is also considered in the thesis from an information theoretic perspective by modeling each frame as a stationary information source. We first put forth a concept called causal scalar quantization, and then propose an algorithm for designing optimum fixed-rate causal scalar quantizers for causal video coding to minimize the total distortion among all sources. Simulation results show that in comparison with fixed-rate predictive scalar quantization, fixed-rate causal scalar quantization offers as large as 16% quality improvement (distortion reduction).


Low-complexity Mode Selection for Rate-distortion Optimal Video Coding

Low-complexity Mode Selection for Rate-distortion Optimal Video Coding

Author: Hyungjoon Kim

Publisher:

Published: 2007

Total Pages: 88

ISBN-13: 9781109991765

DOWNLOAD EBOOK

In addition to our theoretical work, we present practical solutions to real-time implementation of encoder modules including our proposed mode selection method on digital signal processors. First, we investigate the features provided by most of the recent digital signal processors, for example, hierarchical memory structure and efficient data transfer between on-chip and off-chip memory, and then present practical approaches for real-time implementation of a video encoder system with efficient use of the features.


Versatile Video Coding

Versatile Video Coding

Author: Humberto Ochoa Dominguez

Publisher: CRC Press

Published: 2022-09-01

Total Pages: 458

ISBN-13: 1000795055

DOWNLOAD EBOOK

Video is the main driver of bandwidth use, accounting for over 80 per cent of consumer Internet traffic. Video compression is a critical component of many of the available multimedia applications, it is necessary for storage or transmission of digital video over today's band-limited networks. The majority of this video is coded using international standards developed in collaboration with ITU-T Study Group and MPEG. The MPEG family of video coding standards begun on the early 1990s with MPEG-1, developed for video and audio storage on CD-ROMs, with support for progressive video. MPEG-2 was standardized in 1995 for applications of video on DVD, standard and high definition television, with support for interlaced and progressive video. MPEG-4 part 2, also known as MPEG-2 video, was standardized in 1999 for applications of low- bit rate multimedia on mobile platforms and the Internet, with the support of object-based or content based coding by modeling the scene as background and foreground. Since MPEG-1, the main video coding standards were based on the so-called macroblocks. However, research groups continued the work beyond the traditional video coding architectures and found that macroblocks could limit the performance of the compression when using high-resolution video. Therefore, in 2013 the high efficiency video coding (HEVC) also known and H.265, was released, with a structure similar to H.264/AVC but using coding units with more flexible partitions than the traditional macroblocks. HEVC has greater flexibility in prediction modes and transform block sizes, also it has a more sophisticated interpolation and de blocking filters. In 2006 the VC-1 was released. VC-1 is a video codec implemented by Microsoft and the Microsoft Windows Media Video (VMW) 9 and standardized by the Society of Motion Picture and Television Engineers (SMPTE). In 2017 the Joint Video Experts Team (JVET) released a call for proposals for a new video coding standard initially called Beyond the HEVC, Future Video Coding (FVC) or known as Versatile Video Coding (VVC). VVC is being built on top of HEVC for application on Standard Dynamic Range (SDR), High Dynamic Range (HDR) and 360° Video. The VVC is planned to be finalized by 2020. This book presents the new VVC, and updates on the HEVC. The book discusses the advances in lossless coding and covers the topic of screen content coding. Technical topics discussed include: Beyond the High Efficiency Video CodingHigh Efficiency Video Coding encoderScreen contentLossless and visually lossless coding algorithmsFast coding algorithmsVisual quality assessmentOther screen content coding algorithmsOverview of JPEG Series


Efficient Algorithms for MPEG Video Compression

Efficient Algorithms for MPEG Video Compression

Author: Dzung Tien Hoang

Publisher: Wiley-Interscience

Published: 2002-02-21

Total Pages: 208

ISBN-13:

DOWNLOAD EBOOK

Video compression is the enabling technology behind many cutting-edge business and Internet applications, including video-conferencing, video-on-demand, and digital cable TV. Coauthored by internationally recognized authorities on the subject, this book takes a close look at the essential tools of video compression, exploring some of the most promising algorithms for converting raw data to a compressed form.