ISO/IEC 14496 Standards for MPEG-4, or “Coding of Audio-Visual Objects”

A child watching a video that follows ISO/IEC 14496 on a couch

In other posts, we discussed the MPEG-1 and MPEG-2 standards, as well as their respective corresponding international standard documents. Initially introduced in 1998, MPEG-4 absorbed the information of its predecessors and expanded upon it. As a video compression suite, MPEG-4 is used for webpages, digital television, and interactive media on the World Wide Web.

The set of technologies associated with MPEG-4 benefit authors by enabling them to provide content with far greater readability, network service providers by giving transparent content, and end users by bringing higher levels of interaction with content. MPEG-4 achieves these goals by representing units of aural, audio, or audiovisual content of natural or synthetic origin. MPEG-4 audiovisual scenes are composed of still images, video objects, and audio objects.

Please note that there is no MPEG-3. MPEG (Moving Picture Experts Group) once had a plan to define a video coding system for High Definition Television (HDTV) applications, but the working group realized that the tools needed for the video coding of HDTV were already contained in the MPEG-2 suite. As a result, the development of MPEG-3 was stopped, and the work was continued with MPEG-2.

Developed by MPEG as a working group of ISO/IEC JTC 1 (JTC1/SC29/WG11), MPEG-4 is specified in the ISO/IEC 14496 series of international standards. As with other MPEG documents, the first three parts cover the systems, video (visual), and audio. There are currently 30 parts in total, all listed below.

Please note that many of these standard documents have amendments, which make technical changes to the existing documents, and corrigenda, which repair editorial errors.

ISO/IEC 14496-1:2010 – Information technology – Coding of audio-visual objects – Part 1: Systems

This standard has two amendments: Amd1:2010 – Usage of LASeR in MPEG-4 systems and Registration Authority for MPEG-4 descriptors and Amd2:2014 – Support for raw audio-visual data.

ISO/IEC 14496-2:2004 – Information technology – Coding of audio-visual objects – Part 2: Visual

This standard has five amendments: Amd1:2004 – Streaming video profile – Amendment 1: Error resilient simple scalable profile, Amd2:2005 – Streaming video profile – Amendment 2: New Levels for Simple Profile, Amd3:2007 – Streaming video profile – Amendment 3: Support for colour spaces (this amendment has one corrigendum: Amd3/Cor1:2008), Amd4:2008 – Streaming video profile – Amendment 4: Simple profile level 6, Amd5:2009 – Simple studio profile levels 5 and 6.

It also has five corrigenda: Cor1:2004, Cor2:2007, Cor3:2008, Cor4:2010, and Cor5:2013.

ISO/IEC 14496-3:2019 – Information Technology – Coding Of Audio-Visual Objects – Part 3: Audio

Parts One through Three are available as the ISO/IEC 14496 – IT Coding of Audio Visual Objects Package.

ISO/IEC 14496-4:2004 – Information technology – Coding of audio-visual objects – Part 4: Conformance testing

This standard has around forty amendments. If you’d like to see a list, please refer to Part 4 Amendments.

It also has several corrigenda: Cor1:2005, Cor2:2007, Cor4:2008, and Cor7:2010.

ISO/IEC 14496-5:2001 – Information technology – Coding of audio-visual objects – Part 5: Reference software

This standard also has around forty amendments. If you’d like to see a list, please refer to Part 5 Amendments.

ISO/IEC 14496-6:2000 – Information technology — Coding of audio-visual objects — Part 6: Delivery Multimedia Integration Framework (DMIF)

ISO/IEC TR 14496-7:2004 – Information technology – Coding of audio-visual objects – Part 7: Optimized reference software for coding of audio-visual objects

ISO/IEC 14496-8:2004 – Information technology – Coding of audio-visual objects – Part 8: Carriage of ISO/IEC 14496 contents over IP networks

ISO/IEC TR 14496-9:2009 – Information technology – Coding of audio-visual objects – Part 9: Reference hardware description

ISO/IEC 14496-10:2022 – Information technology – Coding of audio-visual objects – Part 10: Advanced Video Coding

ISO/IEC 14496-11:2015 – Information technology – Coding of audio-visual objects – Part 11: Scene description and application engine

ISO/IEC 14496-12:2022 – Information technology – Coding of audio-visual objects – Part 12: ISO base media file format

ISO/IEC 14496-13:2004 – Information technology – Coding of audio-visual objects – Part 13: Intellectual Property Management and Protection (IPMP) extensions

ISO/IEC 14496-14:2020 – Information technology – Coding of audio-visual objects – Part 14: MP4 file format

ISO/IEC 14496-15:2022 – Information technology – Coding of audio-visual objects – Part 15: Carriage of network abstraction layer (NAL) unit structured video in the ISO base media file format

ISO/IEC 14496-16:2011 – Information technology – Coding of audio-visual objects – Part 16: Animation Framework eXtension (AFX)

This standard has four amendments: Amd1:2011 – Efficient representation of 3D meshes with multiple attributes, Amd2:2014 – Multi-resolution 3D mesh compression, Amd3:2016 – Printing material and 3D graphics coding for browsers, and Amd4:2017 – Pattern-based 3D mesh coding (PB3DMC).

It also has one corrigendum: Cor1:2015.

ISO/IEC 14496-17:2006 – Information technology – Coding of audio-visual objects – Part 17: Streaming text format

ISO/IEC 14496-18:2004 – Information technology – Coding of audio-visual objects – Part 18: Font compression and streaming

This standard has one amendment: Amd1:2014 – Updated semantics of Decoder Specific Info and font data description for ISOBMFF.

This document also has a corrigendum: Cor1:2007.

ISO/IEC 14496-19:2004 – Information technology – Coding of audio-visual objects – Part 19: Synthesized texture stream

ISO/IEC 14496-20:2008 – Information technology – Coding of audio-visual objects – Part 20: Lightweight Application Scene Representation (LASeR) and Simple Aggregation Format (SAF)

This standard has three amendments: Amd1:2009 – Extensions to support SVGT1.2, Amd2:2010 – Technology for scene adaptation, and Amd3:2010 – Presentation and Modification of Structured Information (PMSI).

It also has one corrigendum: Cor1:2010.

ISO/IEC 14496-21:2006 – Information technology – Coding of audio-visual objects – Part 21: MPEG-J Graphics Framework eXtensions (GFX)

This standard has one corrigendum: Cor1:2007.

ISO/IEC 14496-22:2019 – Information technology – Coding of audio-visual objects – Part 22: Open Font Format

ISO/IEC 14496-23:2008 – Information technology – Coding of audio-visual objects – Part 23: Symbolic Music Representation

ISO/IEC TR 14496-24:2008 – Information technology – Coding of audio-visual objects – Part 24: Audio and systems interaction

ISO/IEC 14496-25:2011 – Information technology – Coding of audio-visual objects – Part 25: 3D Graphics Compression Model

ISO/IEC 14496-26:2010 – Information technology – Coding of audio-visual objects – Part 26: Audio conformance

This standard has several amendments: Amd2:2010 – BSAC conformance for broadcasting, Amd3:2014 – Conformance for Low Delay AAC v2 profile, and Amd4:2016 – AAC Additional Multichannel Conformance Data.

It also has numerous corrigenda: Cor2:2011, Cor3:2011, Cor4:2011, Cor5:2012, Cor6:2013, Cor7:2013, and Cor8:2015.

ISO/IEC 14496-27:2009 – Information technology – Coding of audio-visual objects – Part 27: 3D Graphics conformance

This standard has several amendments: Amd2:2011 – Scalable complexity 3D mesh coding conformance (this amendment has a corrigendum: Amd2:2011/Cor1:2015), Amd3:2011 – Scalable complexity 3D mesh coding conformance in 3DGCM, Amd4:2012 – Conformance for efficient representation of 3D meshes with multiple attributes, Amd5:2015 – Multi-resolution 3D mesh compression, and Amd6:2015 – Pattern-based 3D mesh coding conformance.

ISO/IEC 14496-28:2012 – Information technology – Coding of audio-visual objects – Part 28: Composite font representation

This standard has two corrigenda: Cor1:2013 and Cor2:2014.

ISO/IEC 14496-29:2015 – Information technology – Coding of audio-visual objects – Part 29: Web video coding

ISO/IEC 14496-30:2018 – Information technology – Coding of audio-visual objects – Part 30: Timed text and other visual overlays in ISO base media file format

Parts Thirty-One, Thirty-Two, and Thirty-Three are currently under development.

Share this blog post:

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.