book

Coding Video

Name: Coding Video
Author: Iain E. Richardson
ISBN: 9781118711781

by Iain E. Richardson

September 2024

Intermediate to advanced

448 pages

14h 23m

English

Wiley

Read now

Unlock full access

Cover
Table of Contents
Title Page
Copyright Page
Dedication Page
About the Author
Acknowledgements
About the Companion Website
1 Introduction
1.1 Why Write This Book?1.2 What Is in the Book?1.3 How Should You Use This Book?References
2 Video Coding and Video Quality
2.1 Introduction2.2 An Overview of Video Coding2.3 Inputs and Outputs2.4 Structural Elements2.5 Prediction2.6 Transform and Quantisation2.7 Bitstream Coding2.8 The Coded Bitstream2.9 Storing and Transmitting the Coded Bitstream2.10 The Decoder2.11 The Video Codec Model2.12 Video Codec Performance2.13 ConclusionReferences

3 A History of Video Coding and Video Coding Standards
3.1 Introduction3.2 The Foundations of Video Coding, 1950–19903.3 Video Coding Standards and Formats: 1990–20213.4 Comparing Video Coding Standards3.5 ConclusionsReferences
4 Structures
4.1 Introduction4.2 Coded Video: Sequence to Picture4.3 Coded Video: Picture to Basic Unit4.4 Coded Video: Basic Unit to Block4.5 HEVC Coding Structures4.6 Structures in Versatile Video Coding/H.2664.7 ConclusionReference
5 Intra Prediction
5.1 Introduction5.2 The Intra Prediction Process5.3 Intra Prediction Modes5.4 Prediction Block Sizes5.5 Signalling Intra Prediction Choices5.6 Choosing a Prediction5.7 HEVC Intra Prediction5.8 VVC Intra Prediction5.9 ConclusionsReferences
6 Inter Prediction
6.1 Introduction6.2 Inter Prediction – the Basics6.3 Forward, Backward and Biprediction6.4 Inter Prediction Block Sizes6.5 Motion Vectors6.6 Sub‐Pixel Interpolation6.7 Reference Pictures6.8 Signalling Inter Prediction Choices6.9 Skip Mode6.10 Loop Filter6.11 When Inter Prediction Does Not Find a Good Match6.12 HEVC Inter Prediction6.13 Inter Prediction in VVC6.14 ConclusionsReferences
7 Transform and Quantisation
7.1 Introduction7.2 Residual Blocks7.3 Block Transforms7.4 Quantisation7.5 Transform and Quantisation in Practice7.6 HEVC Transform and Quantisation7.7 Transform and Quantise in H.266 Versatile Video Coding7.8 ConclusionsReferences
8 Entropy Coding
8.1 Introduction8.2 Entropy Coding for Video Compression8.3 Pre‐processing8.4 Probability Models and Context Adaptation8.5 Variable‐Length Coding8.6 Arithmetic Coding8.7 Binary Arithmetic Coding8.8 Context‐Adaptive Binary Arithmetic Coding (CABAC)8.9 Entropy Coding in HEVC8.10 Entropy Coding in H.266/VVC8.11 ConclusionReferences
9 Coded Video Filtering
9.1 Introduction9.2 Filtering and Video Coding9.3 Detecting and Correcting Video Coding Artefacts9.4 HEVC In‐Loop Filtering9.5 VVC Filtering9.6 ConclusionsReferences
10 Storing and Transporting Coded Video
10.1 Introduction10.2 Storing and Delivering Coded Video10.3 Coded Video File Formats10.4 Transport of Coded Video10.5 Video Rate Control10.6 Error Handling10.7 ConclusionsReferences
11 Implementation and Performance
11.1 Introduction11.2 Implementing Video Codecs11.3 Software Implementation11.4 Hardware Implementation11.5 Video Codec Performance11.6 Getting Started with Experiments11.7 ConclusionReferences
12 Conclusions
12.1 What This Book Has and Has Not Covered12.2 Where Is Video Coding Going Next?12.3 Where Should You Go Next?References
Glossary
Index
End User License Agreement

Content preview from Coding Video

4Structures

4.1 Introduction

4.1.1 How Does a Video Codec Use Structures?

A complete video clip or sequence is processed by a video encoder to create a compressed bitstream. In order to handle the large amount of image data in a sequence of video frames, the encoder splits it up into manageable structures. In this chapter, we will look at how a video encoder does this and how it bridges the gap between a source video, which contains multiple frames each made up of thousands or millions of pixels, and encoded data units. We will also look at the processing and storage capabilities of a practical codec that handles a relatively small region or block of data at a time. Figure 4.1 illustrates how a source video sequence is split up into manageable‐sized structures, such as blocks of pixels. These structures are processed and encoded to produce the compressed bitstream.

A coded video sequence is a series of coded pictures that, when decoded, will play back as a complete video programme or clip. Coded frames or pictures may be organised into multi‐picture structures during encoding, such as Group of Pictures (s). Each picture may be coded as a single unit or in multiple sections known as slices or tiles. Each slice or tile contains one or more Basic Units, such as Macroblocks or Coding Tree Units.

The Basic Unit is a unit of data handled by the encoder and decoder. In present‐day codecs, it is a square. In the older MPEG‐2 and H.264/AVC standards, it is 16 × 16 pixels, up to 64 × 64 ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781118711781Purchase Link

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design