| Literature DB >> 24701145 |
Patrick Seeling1, Martin Reisslein2.
Abstract
Video encoding for multimedia services over communication networks has significantly advanced in recent years with the development of the highly efficient and flexible H.264/AVC video coding standard and its SVC extension. The emerging H.265/HEVC video coding standard as well as 3D video coding further advance video coding for multimedia communications. This paper first gives an overview of these new video coding standards and then examines their implications for multimedia communications by studying the traffic characteristics of long videos encoded with the new coding standards. We review video coding advances from MPEG-2 and MPEG-4 Part 2 to H.264/AVC and its SVC and MVC extensions as well as H.265/HEVC. For single-layer (nonscalable) video, we compare H.265/HEVC and H.264/AVC in terms of video traffic and statistical multiplexing characteristics. Our study is the first to examine the H.265/HEVC traffic variability for long videos. We also illustrate the video traffic characteristics and statistical multiplexing of scalable video encoded with the SVC extension of H.264/AVC as well as 3D video encoded with the MVC extension of H.264/AVC.Entities:
Mesh:
Year: 2014 PMID: 24701145 PMCID: PMC3951063 DOI: 10.1155/2014/189481
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1Block diagram of video network transport system. The captured video frames are encoded and smoothed before network transmission. The evaluations of video transmission consider the rate-distortion (RD) characteristics, the rate variability-distortion (VD) characteristics before and after the smoother, and the required smoother buffer and the link bitrate C min requirements.
Summary of main terminology and notations.
| AVC | Advanced Video Coding |
| Avg. video bitrate (bit/s) | Average (mean) of frame sizes of frames in a video sequence divided by frame period |
|
| Number of bidirectional predicted (B) frames between successive I (or P) frames |
| CoV | Coefficient of variation, that is, mean value of a random quantity divided by its standard deviation |
| DCT | Discrete cosine transform |
| Frame size (Byte) | Number of Bytes of information to represent an encoded video frame |
| Frame period (s) | Display duration (in seconds) for a video frame, that is, inverse of frame rate (in frames/second) |
| HEVC | High Efficiency Video Coding |
| MVC | Multiview Video Coding |
| QP | Quantization parameter |
| RD curve | Rate-distortion curve, that is, plot of distortion (typically represented through PSNR video quality) |
| as a function of average video bitrate | |
| PSNR | Peak signal to noise ratio (dB) |
| PSNR video quality (dB) | Average (mean) of PSNR values of encoded frames in a video sequence |
| SVC | Scalable Video Coding |
|
| Number of layers in hierarchical B frame structure; τ = log2(β + 1) for dyadic hierarchy |
| VD curve | Rate variability-distortion curve, that is, plot of rate variability (typically represented by CoV of frame sizes) |
| as a function of video distortion (typically represented by PSNR video quality) |
Figure 2Illustration of classical B frame prediction structure used in MPEG-2 and MPEG-4 Part 2 (without reference arrows for even frames to avoid clutter) and dyadic hierarchical B frame prediction structure of H.264/AVC, H.264 SVC, and H.265/HEVC.
Figure 3Traffic characteristics and link bandwidth requirements for H.264/AVC without and with cascading (C) quantization parameters (QPs) and H.265/HEVC with cascading QPs for Sony video.
Figure 4Traffic characteristics and link bandwidth requirements for H.265/HEVC with cascading QPs for a variety of videos, as well as comparison of H.265/HEVC (with cascaded QPs) with H.264/AVC (with cascaded QPs) for Tears of Steel (ToS) video.
End-to-end delay in frame periods for video encodings with β, β ≥ 1, B frames between successive I/P frames and smoothing over a frames from [17]; encoding, network transport, and decoding are assumed to each requiring one frame period per frame.
| Live video | Prerecorded video | |
|---|---|---|
| Classical B frame prediction | β + 2 |
|
| Hierarchical B frame prediction | β + 2 |
|
Figure 5Illustration of traffic characteristics and link bandwidth requirements for Sony encoded with H.264 SVC with medium-grain scalability (MGS) for base layer QPs B = 35 and 40 and enhancement layer QP E = 25 with and without QP cascading (C), in comparison with H.264/AVC single-layer encoding with cascading QPs.
Figure 6RD and VD characteristics of MVC encodings without and with cascaded QPs (C) of 35 minutes each of 3D videos Alice in Wonderland and IMAX Space Station with full HD 1920 × 1080 pixel resolution. The encoded left and right views are streamed sequentially (S) or are streamed aggregated (A) into multiview frames.