Welcome to the ISO/IEC JTC 1/SC 29/WG 11 web site

also know as MPEG, the Moving Picture Experts Group.

The MPEG acronym is also used to indicate a suite of

ISO/IEC digital media standards developed by this JTC 1 Working Group.

iso-iec-logo

The Moving Picture Experts Group

MPEG Work plan

Contents

1   Media coding. 4

1.1        Support for Dynamic Range Control 4

1.2        Multi-Resolution Frame Compatible Stereoscopic Video with Depth Maps etc. 4

1.3        Pattern Based 3D Mesh Compression. 4

1.4        Web 3D Graphics Coding. 4

1.5        Metadata for Realistic Material Representation. 4

1.6        Font compression and streaming. 5

1.7        Composite Font Representation. 5

1.8        Open Font Format 5

1.9        Web Video Coding. 5

1.10          Video Coding for Browsers. 5

1.11          Contract Expression Language. 6

1.12          Media Contract Ontology. 6

1.13          Parser instantiation for BSD.. 6

1.14          Media Tool Library. 6

1.15          SAOC Dialog enhancement 6

1.16          Audio Dynamic Range Control 7

1.17          Media Context and Control – Control Information. 7

1.18          Media Context and Control – Sensory Information. 7

1.19          Media Context and Control – Virtual World Object Characteristics. 7

1.20          Media Context and Control – Data Formats for Interaction Devices. 8

1.21          Media Context and Control – Common Types and Tools. 8

1.22          HEVC Range Extensions. 8

1.23          HEVC Multiview Extensions. 8

1.24          3D HEVC.. 9

1.25          HEVC Scalable Extensions. 9

1.26          Coding of screen content 9

1.27          3D Audio. 9

1.28          3D Audio Profiles. 9

1.29          HEVC Image Sequences. 9

1.30          Free Viewpoint Television. 9

1.31          Internet Video Coding. 10

1.32          Higher Dynamic Range and Wide Gamut Content Distribution. 10

1.33          Processing and Sharing of Media under User Control 10

1.34          Genome Compression. 10

2   Composition coding. 10

2.1        Scene Description Based Collaborative Applications. 10

2.2        MMT Composition Information. 11

3   Description coding. 11

3.1        Compact Descriptors for Visual Search. 11

3.2        Compact Descriptors for Video Analysis. 11

3.3        User Description. 11

3.4        Multiple text encodings, extended classification metadata. 12

3.5        Green Metadata. 12

3.6        Storage of traceable media signatures. 12

4   Systems support 13

4.1        Audio Synchronization. 13

4.2        Signalling of Transport profiles, signalling MVC stereo view association and MIME type registration  13

4.3        Signalling and Transport of SAOC-DE in AAC.. 13

4.4        Coding-independent codepoints. 13

4.5        MMT Forward Error Correction Codes. 14

4.6        MMT Cross Layer Interface. 14

4.7        Uniform signalling for timeline alignment 14

5   IPMP. 14

5.1        Common Encryption Format for ISO Base Media File Format 14

5.2        Support of Sparse Encryption. 14

6   Digital Item.. 15

7   Transport and File formats. 15

7.1        Carriage of additional audio profiles & levels. 15

7.2        Delivery of Timeline for External Data. 15

7.3        Carriage of Layered HEVC in MPEG-2 TS. 15

7.4        Transport of MVC depth video and HEVC low delay. 15

7.5        Carriage of Green Metadata. 16

7.6        Carriage of 3D Audio. 16

7.7        Enhanced audio support and other improvements. 16

7.8        Font streams and other improvements to file format 16

7.9        Ordered combination of separate tracks. 16

7.10          Carriage of MVC+D in ISO Base Media File Format 17

7.11          Enhanced carriage of HEVC.. 17

7.12          Carriage of Layered HEVC.. 17

7.13          Enhanced audio support 17

7.14          Timed Metadata Metrics of Media in the ISO Base Media File Format 18

7.15          Sample Variants in ISOBMFF. 18

7.16          Additional technologies for MPEG Media Transport 18

7.17          Image File Format 18

7.18          MMT Implementation Guidelines. 18

7.19          Extended Profiles and time synchronization. 18

7.20          Spatial Relationship Description, Generalized URL parameters and other extensions. 18

7.21          MPEG-DASH Implementation Guidelines. 19

7.22          Server and Network Assisted DASH.. 19

8   Multimedia architecture. 19

8.1        MPEG-M API. 19

8.2        MPEG-V Architecture. 19

9   Application formats. 20

9.1        Augmented Reality AF. 20

9.2        Mixed and Augmented Reality Reference Model 20

9.3        Multimedia Preservation. 20

9.4        Multisensory Effects Application Format 20

9.5        Publish/Subscribe Application Format (PSAF) 21

9.6        Adaptive Screen Content Sharing Application Format 21

10     Reference implementation. 21

10.1          New levels for AAC profiles, uniDRC support and AAC block length Reference Software  21

10.2          MVC plus depth extension of AVC Reference Software. 21

10.3          Multi-resolution Frame Compatible Stereo Coding extension of AVC Reference Software  21

10.4          3D extension of AVC Reference Software. 22

10.5          Pattern based 3D mesh compression Reference Software. 22

10.6          Video Coding for Browsers Reference Software. 22

10.7          CEL and MCO Reference Software. 22

10.8          MPEG-7 Visual Reference Software. 22

10.9          CDVS Reference Software. 22

10.10       ARAF reference software. 22

10.11       Media Tool Library Reference Software. 23

10.12       SAOC and SAOC Dialogue Enhancement Conformance. 23

10.13       MPEG-DASH Reference Software. 23

10.14       MPEG-V – Reference Software. 23

10.15       MMT Reference Software. 23

10.16       HEVC Reference Software. 23

10.17       3D Audio Reference Software. 23

11     Conformance. 24

11.1          New levels for AAC profiles and uniDRC support 24

11.2          Multi-resolution Frame Compatible Stereo Coding extension of AVC Conformance. 24

11.3          3D-AVC Conformance. 24

11.4          Pattern based 3D mesh compression Conformance. 24

11.5          Video Coding for Browsers Conformance. 24

11.6          CDVS Conformance. 24

11.7          CEL and MCO Conformance. 25

11.8          ARAF Conformance. 25

11.9          Media Tool Library Conformance. 25

11.10       SAOC and SAOC Dialogue Enhancement Conformance. 25

11.11       MPEG-V – Conformance. 25

11.12       MMT Conformance. 25

11.13       HEVC Conformance. 25

11.14       3D Audio Conformance. 26

12     Maintenance. 26

12.1          Systems coding standards. 26

12.2          Video coding standards. 26

12.3          Audio coding standards. 26

12.4          3DG coding standards. 26

12.5          Systems description coding standards. 26

12.6          Visual description coding standards. 26

12.7          Audio description coding standards. 26

12.8          MPEG-21 standards. 27

12.9          MPEG-A standards. 27

 

1Media coding

1.1Support for Dynamic Range Control

 

Std

Pt

Amd

Req

Short description

4

3

A5

14445

This amendment signals and carries Dynamic Range Control data in MPEG-4 Audio elementary streams. This enables the Dynamic Range Control to be used with e.g. the AAC family of codecs.

 

1.2Multi-Resolution Frame Compatible Stereoscopic Video with Depth Maps etc.

 

Std

Pt

Amd

Req

Short description

4

10

A1

14676

3V

 

1.3Pattern Based 3D Mesh Compression

 

Std

Pt

Amd

Req

Short description

4

16

4

 

This standard introduces a compressed representation for multi-connected 3D models, by taking advantage of discovering repetitive structures in the input models. It allows discovering the structures repeating in various positions, orientations and scaling factors. Then the 3D model is organized into “pattern-instance” representation. A pattern is the representative geometry of the corresponding repetitive structure. The connected components belonging to a repetitive structure are called instances of the corresponding pattern and represented by the pattern ID and their transformation, i.e. the combination of reflection, translation, rotation and possible uniform scaling, with regards to the pattern. The instance transformation consists of four parts: reflection part, translation part, rotation part and possible scaling part.

 

1.4Web 3D Graphics Coding

 

Std

Pt

Amd

Req

Short description

4

16

E5

 

Adaptation of a 3DG codec for inclusion in a web browser.

 

1.5Metadata for Realistic Material Representation

 

Std

Pt

Amd

Req

Short description

4

16

 

 

This parts defines the realistic material representations to specify makeup information on a human face. Combined with the spectrum light source and the spectrum textures of objects specified in the MPEG-4 Part 11, this information enables to reproduce a realistic avatar appearance

 

1.6Font compression and streaming

 

Std

Pt

Amd

Req

Short description

4

18

1

 

Usage of font data streams decoder specific information, and tools to carry font data streams in ISO/IEC 14496-12

 

1.7Composite Font Representation

 

Std

Pt

Amd

Req

Short description

4

22

 

 

The advances in developing the Unicode Standard and the addition of new characters that represent most of the world's languages and writing systems resulted in a significant increase of the Unicode character repertoire to more than 100,000 characters. However, due to the finite size of many bitfields, the existing ISO/IEC 14496-22:2009 specification only provides the capabilities for a single font to support up to 64K glyphs. The Composite Font Standard is intended to resolve the existing limitation by providing a solution that would allow linking of existing OFF fonts (and, possibly, fonts in other formats) into a single Composite Font to be used as a virtual font by any compliant implementation.

 

1.8Open Font Format

 

Std

Pt

Amd

Req

Short description

4

22

E3

14330

The third edition of the OFF standard consolidates all previously issued corrigendum and amendments and extends the existing technology providing support for a larger set of world's languages, adding new text layout features and introducing support for colour fonts and MATH expression layout

 

1.9Web Video Coding

Std

Pt

Amd

Req

Short description

4

29

E1

 

WVC coding aims to deliver a video coding standard based on royalty-free technology with a performance similar to AVC Baseline Profile

 

1.10Video Coding for Browsers

 

Std

Pt

Amd

Req

Short description

4

31

E1

 

Video Coding for Browsers (VCB) is a video coding standard based on technology intended to fall underType-1 licensing. The standard is suitable to facilitate real time Web video conferencing as well as video content consumption through Web browsers. Its compression quality for relevant application cases is similar or beyond AVC constrained baseline profile.

 

1.11Contract Expression Language

 

Std

Pt

Amd

Req

Short description

21

20

 

 

S

 

1.12Media Contract Ontology

 

Std

Pt

Amd

Req

Short description

21

21

 

 

S

 

1.13Parser instantiation for BSD

 

Std

Pt

Amd

Req

Short description

B

4

?

 

This amendment is intended to fully realize the bitstream parser FU instantiation from BSD. By this, the description functionality of RVC-BSDL will be improved to allow the generic description of the bitstream syntax of all state-of-the-art MPEG codecs.

 

1.14Media Tool Library

Std

Pt

Amd

Req

Short description

C

4

 

 

The Media Tool Library (MTL) is a collection of video and graphics coding tools, called Functional Units (FUs).

Each FU implements a specific processing block in the coding process and it is specified by both a textual specification, that provides its purpose, and a reference implementation expressed in RVC-CAL Actor Language (RVC-CAL). The textual description provides to each FU a name, a short description of its functionality, the standard and the Profile it comes from, and the properties of its input and output data.

 

1.15SAOC Dialog enhancement

 

Std

Pt

Amd

Req

Short description

D

3

3

 

Extension to be used as a supplementary audio codec in the broadcasting environment to efficiently control the dialogue level in a backwards-compatible way.

 

1.16Audio Dynamic Range Control

 

Std

Pt

Amd

Req

Short description

D

4

?

 

The Dynamic Range Control (DRC) standard specifies technology for coding and applying DRC and Program Level control to audio. It is designed to work with any MPEG audio codec, including MPEG-4 AAC, HE-AAC, MPEG-D USAC and MPEG-H 3D Audio.

 

1.17Media Context and Control – Control Information

 

Std

Pt

Amd

Req

Short description

V

2

 

 

The scope of MPEG-V part2 is defined as the capability descriptions of actuators/sensors in the real world, the user's sensory preference information, which characterize devices and users, and the sensor adaptation preferences information, which characterize sensors and users, so that appropriate information to control devices (actuators and sensors) can be generated. More particularly, the third edition of this standard introduces data types related to capabilities and preferences with respect to automobile sensors, 5D theatre effects and camera framework.

 

1.18Media Context and Control – Sensory Information

 

Std

Pt

Amd

Req

Short description

V

3

 

 

To enhance the experience of users while consuming media resources by stimulate also other senses than vision or audition, e.g., olfaction, mechanoreception, equilibrioception, thermo-(re)ception, or proprioception. That is, in addition to the audio-visual content of, e.g., a movie, also other sense shall be stimulated giving her/him the sensation of being part of the particular media which shall result in a worthwhile, informative user experience. This motivates the annotation of the media resources with metadata as defined in this part of ISO/IEC 23005 that steers appropriate devices capable of stimulating these other senses. More particularly, the third edition of this standard introduces the scent sensory information

 

1.19Media Context and Control – Virtual World Object Characteristics

 

Std

Pt

Amd

Req

Short description

V

4

 

 

This part defines a base type of attributes and characteristics of the virtual world objects which is shared by both avatars and the generic virtual objects. The base type of the virtual world object characteristics is composed of data types describing an identity, sound resources, scent resources, controlling, input events, and a behavior model. The virtual world object base type is inherited to both avatar metadata and virtual object metadata to extend the specific aspects of each of metadata.

 

1.20Media Context and Control – Data Formats for Interaction Devices

 

Std

Pt

Amd

Req

Short description

V

5

 

10990

Specifies syntax and semantics of the data formats for interaction devices, Device Commands and Sensed Information, required for providing interoperability in controlling and sensing interaction devices. The interaction devices include both actuators and sensors to provide a wide range of interaction capabilities in real as well as virtual world. More particularly, the third edition of this standard introduces data types related to automobile sensors, 5D theatre effects and various types of cameras.

 

1.21Media Context and Control – Common Types and Tools

 

Std

Pt

Amd

Req

Short description

V

6

 

 

Specifies syntax and semantics of the datatypes and tools common to the tools defined in other parts of ISO/IEC 23005. To be specific, basic data types which are used as basic building blocks in more than one tools of ISO/IEC 23005, color-related basic types which are used in light and color related tools to help specifying color related characteristics of the devices or commands, and time stamp types which can be used in device commands and sensed informations to specify timing related information. Also several classification schemes which are used in more than one parts of ISO/IEC 23005 are defined in the annex of this part. Other tools to be developed shall be included in this part of ISO/IEC 23005, if those tools are to be commonly used with more than one tool which are defined in different parts of ISO/IEC 23005. Most of the tools defined  are not intended to be used alone, but to be used as a part or as a supporting tool of other tools defined in other part of ISO/IEC 23005.

 

1.22HEVC Range Extensions

Std

Pt

Amd

Req

Short description

H

2

1

 

The purpose of this amendment is to provide the capability to support high fidelity video signals in HEVC. In high end consumer and professional environment, greater sample accuracy and alternative chroma sampling structure are used. This amendment is to extend the capability of ISO/IEC 23008-2 for those applications.

1.23HEVC Multiview Extensions

Std

Pt

Amd

Req

Short description

H

2

2

 

The purpose of this amendment of HEVC is to provide the simple capability to support stereoscopic and multiview video. Stereoscopic and multiview video are emerging to the market and this extension provides a simple and efficient video coding with no changes at lower decoder levels, such that re-purposing of existing devices would be possible. Monoscopic video can be extracted as base view.

 

1.243D HEVC

 

Std

Pt

Amd

Req

Short description

H

2

4

 

Encoding a rich representation of 3D video content by supplementing HEVC to encode additional views and depth map information jointly in an efficiently-coded manner.

 

1.25HEVC Scalable Extensions

Std

Pt

Amd

Req

Short description

H

2

?

 

The purpose of this amendment HEVC is to provide scalability at the bitstream level, such that a sub-bitstream would allow reconstruction of a video with lower resolution or lower quality. Hierarchies of sub-bitstreams can be supported, where the lowest level (base layer) would also be extractable and decodable by a legacy device (e.g. conforming to main profile of HEVC).

 

1.26Coding of screen content

 

Std

Pt

Amd

Req

Short description

H

2

?

 

For applications like screen sharing, wireless display and remote computing, efficient coding of the computer screen content is important. Since this content has statistical properties different to natural video, MPEG investigates technology focussing on this material.

 

1.273D Audio

Std

Pt

Amd

Req

Short description

H

3

1E

14458

Audio

 

1.283D Audio Profiles

 

Std

Pt

Amd

Req

Short description

H

3

A1

14757

Audio

 

1.29HEVC Image Sequences

Std

Pt

Amd

Req

Short description

H

12

?

 

HEVC is very efficient in coding intra images. In order to support applications like camera recording a sequence of still images, MPEG will extend its file format to support the sequences of images along with the meta data that is typically associated with single images.

 

1.30Free Viewpoint Television

 

Std

Pt

Amd

Req

Short description

?

?

?

 

MPEG has been engaged in various aspects of 3D video standardization since 2001. MVC enabled the efficient coding of multiple camera views. 3DV (3D Video) enables viewing adaptation and display adaptation of multiview displays. Currently, super multiview 3D displays are emerging, anticipated as the next generation of auto-stereoscopic display, providing ultra-realistic 3D visualization and navigation at acceptable cost. Since these super multiview 3D displays require a huge number of multi-view images to be rendered in real-time, synthesized from a lower number of input camera views (for cost reasons), a new coding standard is essential to realize their services and products in the market.

 

1.31Internet Video Coding

 

Std

Pt

Amd

Req

Short description

??

??

 

 

Internet Video Coding (IVC) aims to develop a Type 1 video coding standard with a performance as good as possible under the given constraints.

 

1.32Higher Dynamic Range and Wide Gamut Content Distribution

 

Std

Pt

Amd

Req

Short description

?

?

?

 

In the near future, we will have displays able to display many more colors that todays TV sets. MPEG investigates ways to enable transport of video with a wider color gammut and high dynamic range.

 

1.33Processing and Sharing of Media under User Control

 

Std

Pt

Amd

Req

Short description

?

?

?

 

In the actual use of MPEG technologies, there are many contexts that require the processing of the media to be private. This exploration investigates whether MPEG can provide support for these application scenarios.

 

1.34Genome Compression

 

Std

Pt

Amd

Req

Short description

?

?

?

 

R

 

2Composition coding

2.1Scene Description Based Collaborative Applications

 

Std

Pt

Amd

Req

Short description

4

11

E2

 

The Widgets Extensions amendment of 23007-1 contains a small set of extensions coming from early implementation experience. The protocol for widget migration has been defined to ensure the interoperability of widget managers sharing the same service protocol. The possibility to migrate a widget to multiple other widget managers has been added. A simple capabilities exchange mechanism has been added for a widget manager to be able to migrate the appropriate version of a widget to another widget manager.

 

2.2MMT Composition Information

 

Std

Pt

Amd

Req

Short description

H

11

?

 

MMT defines a composition layer to enable the authoring and delivery of rich media services. The Composition Information (Cl) is authored using HTML5 and thus exhibits all the Capabilities and tools availablefor HTML5. In addition, MMT CI provides tools to support dynamic media scenes and their delivery over unicast channels, authoring of content for secondary screens, as well as separation of media dynamics from scene setup. This is achieved in a backward compatible manner using a dedicated CI file that is in XML format.

 

3Description coding

3.1Compact Descriptors for Visual Search

 

Std

Pt

Amd

Req

Short description

7

13

E1

 

Relating contents of images with databases requires to describe the images and match the description of the image to the one available in the database. This activity strives to create compact descriptors that allow for efficient search of images in databases.

 

3.2Compact Descriptors for Video Analysis

 

Std

Pt

Amd

Req

Short description

?

?

?

 

This activity wants to extend content description from images to video. Applications are in video retrieval, computer vision, automotive, security and other application where many video streams need to be evaluated. While traditional implementations compress many video streams and analyze the decoded video at a central site, this activity will enable the “Analyze-Then-Compress” (ATC) paradigm where descriptors are extracted at the source, compressed and evaluated at a central site.

 

3.3User Description

 

Std

Pt

Amd

Req

Short description

?

?

1st Ed

 

The MPEG User Description (MPEG-UD) aims to provide interoperability among various personalized applications and services. A user can store all his information in the MPEG-UD. The MPEG-UD may be safely and securely managed by the users, e.g. by separating between public and private encrypted data. Some data is static in while other data is dynamic.

 

3.4Multiple text encodings, extended classification metadata

 

Std

Pt

Amd

Req

Short description

7

5

5

14334

This amendment extends text and classification descriptors for descriptive metadata for audiovisual content. These extensions address existing data types and add some elements and attributes. The first one improve internationalisation for text-based metadata by identifying the script(s) used and enables carrying strings represented in multiple character sets in the same XML document, using encoding. The other extends classification metadata in order to: (a) improve interoperability with existing metadata formats used in professional media production, (b) increase cardinality of some elements and (c) add attributes and a version element.

 

3.5Green Metadata

 

Std

Pt

Amd

Req

Short description

?

?

E1

 

Provides metadata for energy-efficient media decoding and presentation as well as energy-efficient media encoding based on encoder resources or receiver feedback. This standard defines compact signaling to facilitate reduced consumption from the encoding, decoding and presentation of media content without any degradation in the Quality of Experience (QoE). When power levels are critically low, consumers may prefer to sacrifice their QoE for reduced energy consumption. Green MPEG will provide this capability by allowing energy consumption to be traded off with the QoE.

 

3.6Storage of traceable media signatures

 

Std

Pt

Amd

Req

Short description

?

?

?

 

This adds support for a general framework for sample “variants” in the ISOBMFF. This would be used by a forensic “watermarking” system to modify the base sample, but is independent of the “watermarking” algorithm. Variants are sample data that may be used by a decoder and DRM system to ultimately output video or audio that is marked in a way that can be unique to individual decoders or decoder product models. The application of the variants during the decode process is under control of the DRM system (and ultimately the content provider).

 

4Systems support

4.1Audio Synchronization

 

Std

Pt

Amd

Req

Short description

?

?

?

 

Audio synchronization technology allows a “second screen” (e.g. smartphone) content presentation to synchronize with a related “first screen” (e.g. home theatre) content presentation. Synchronization of the presentations is achieved by computing a normative fingerprint from the first screen content and transmitting it in a compressed format as a payload in the second screen bitstream. From its microphone input, he second screen device can compute the fingerprint of the first screen content, compute the time offset between the computed fingerprint and the transmitted fingerprint, and make that available to the implementation so that the two presentations can be synchronized.

 

4.2Signalling of Transport profiles, signalling MVC stereo view association and MIME type registration

 

Std

Pt

Amd

Req

Short description

2

1

 

 

MPEG transport did not specify a profile as all the capabilities were signaled using the relevant bits in the headers. The recent development of streaming standards required signaling of a transport profile where some capabilities were limited. Second, stereoscopic MVC specifications did not associate the views with left or right eye display. This amendment specifies signaling of 'transport profiles' and extends the MVC descriptor to signal view to eye association. The amendment also includes an Annex to list the current mime types being used in various applications.

 

4.3Signalling and Transport of SAOC-DE in AAC

 

Std

Pt

Amd

Req

Short description

4

3

4

 

Mechanism to signal and transport SAOC Dialog Enhancement (SAOC-DE) profile data in the context of MPEG-4 Audio.

 

4.4Coding-independent codepoints

 

Std

Pt

Amd

Req

Short description

B

8

?

 

This part of ISO/IEC 23001 specifies defines various code-points and fields which document aspects, that are bit-rate and compression independent, of a video or audio stream. They describe the characteristics of the signal before the signal is actually compressed by any encoder that is suitable for compressing such an input signal, or after decompressing the signal.

 

4.5MMT Forward Error Correction Codes

 

Std

Pt

Amd

Req

Short description

H

10

 

 

MMT defines several tools for error correction Such as Forward Error Correction (FEC) and retransmissions. A set of FEC codes is defined in 23008-10 that is shares a Common FEC framework. Additional codes may also be added. Some of the Currently defined FEC codes have enhanced support for Layered media

 

4.6MMT Cross Layer Interface

 

Std

Pt

Amd

Req

Short description

H

1

2

 

MMT introduces the formats and procedures for Cross Layer Interfaces; thus, enabling Communication between the different protocol layers of an MMT entity or between MMT entities to exchange feedback about an ongoing MMT delivery session in a standardized way. This feedback information maybe faster and more accurate than end-to-end feedback information.

 

4.7Uniform signalling for timeline alignment

 

Std

Pt

Amd

Req

Short description

?

?

 

 

To investigate the required tools at the systems level to allow a media presentation packaged and delivered in one format over one network type to be “augmented” by another media presentation, possibly using different packaging and/or delivery means.

 

5IPMP

5.1Common Encryption Format for ISO Base Media File Format

 

Std

Pt

Amd

Req

Short description

B

7

2

 

Way to encrypt media (audio, video, etc.) in files of the ISO base media file format family. By using a common encryption format, a single media asset can be used by several services and devices using different digital rights management systems, and the implementation complexity that would be consequent on having duplicate files and formats for the same content can be reduced or eliminated. 

 

5.2Support of Sparse Encryption

 

Std

Pt

Amd

Req

Short description

B

9

1

 

S

 

6Digital Item

7Transport and File formats

7.1Carriage of additional audio profiles & levels

 

Std

Pt

Amd

Req

Short description

2

1

?

 

This is to add support for a general framework to add sample “variants” to the ISOBMFF. This would be used by a forensic “watermarking” system to modify the base sample, but is independent of the “watermarking” algorithm. Variants are sample data that may be used by a decoder and DRM system to ultimately output video or audio that is marked in a way that can be unique to individual decoders or decoder product models. The application of the variants during the decode process is under control of the DRM system (and ultimately the content provider).

 

7.2Delivery of Timeline for External Data

 

Std

Pt

Amd

Req

Short description

2

1

6

 

This amendment defines tools to identify and synchronize external data associated with an MPEG program in MPEG-2 transport streams. The tools allow:

  • alignment of media timelines regardless of PCR discontinuities through a variety of possible time codes,
  • signaling of URLs of associated data and their types (mime types, ISOBMFF, MPEG-DASH, MMT),
  • announcement of upcoming associated data

 

7.3Carriage of Layered HEVC in MPEG-2 TS

 

Std

Pt

Amd

Req

Short description

2

1

7

 

This amendment will specify layered coding techniques, namely multiview and scalable HEVC coding. Layered media streams need to be supported by the transport formats in a way that different layers can be encapsulated and transported individually.

 

7.4Transport of MVC depth video and HEVC low delay

 

Std

Pt

Amd

Req

Short description

2

1

 

 

This amendment specifies transport of MVC depth video sub-bitstream as defined in Annex I of ISO/IEC 14496-10 including association with other MVC sub-bitstreams. The amendment also adds the signaling (using HEVC video descriptor) to indicate use of HEVC low-delay coding mode in each access unit where the STD buffer management is performed using the HEVC HRD parameters.

 

7.5Carriage of Green Metadata

 

Std

Pt

Amd

Req

Short description

2

1

?

 

This amendment  specifies a format for carriage of green metadata that enables display power reduction in MPEG-2 TS. The metadata consists of frame statistics and quality indicators. This metadata allows the client to attain a specified quality level by scaling frame-buffer pixels and to reduce power correspondingly by decreasing the display backlight or OLED voltage.

 

7.6Carriage of 3D Audio

 

Std

Pt

Amd

Req

Short description

2

1

?

 

This amendment defines stream type, descriptors and buffer model to carry MPEG-H 3D audio bitstream in MPEG-2 TS. Two stream types will be assigned to distinguish main stream from auxiliary stream. Descriptors will provide information on user selectable and/or modifiable audio objects and information on which object contains either supplementary or main audio. T-STD extension will allow splitting an encoded audio scene into several elementary streams. One single audio decoder decodes all elementary streams to one audio presentation. Each of those elementary streams carries one or more encoded channel signals.

 

7.7Enhanced audio support and other improvements

 

Std

Pt

Amd

Req

Short description

4

12

2

 

Enhancement of the codec-independent support for audio in the file format, adding the ability to declare downmixing and dynamic range control of audio, and describe the loudness of the program, before or after downmix or dynamic range control. It also supports the codec-independent declaration of channel layout, and identifying the carriage of DRC coefficients both in the audio stream, and in a separate track.

 

 

7.8Font streams and other improvements to file format

 

Std

Pt

Amd

Req

Short description

4

12

3

 

This amendment modifies the description and format of some existing fields of the decoderSpecificInfo and also provides a definition of font data box for ISO Base Media File Format.

 

7.9Ordered combination of separate tracks

 

Std

Pt

Amd

Req

Short description

4

12

?

 

Tools describing in which order media tracks may be combined for decoding purposes, using implicit aggregation rules

 

7.10Carriage of MVC+D in ISO Base Media File Format

 

Std

Pt

Amd

Req

Short description

4

15

 

 

This amendment specifies the storage of video bitstreams consisting of multiple views and the associated depth, encoded based on Annex I of ISO/IEC 14496-10. The design is based on the MVC file format, which is specified in Clause 7 of ISO/IEC 14496-15, in a backwards-compatible manner. In the design, storage of the texture and depth of a particular view in either separate tracks or the same track is supported. The design also includes the signalling of various indications, such as the presence of texture and/or depth for each view, as well as whether the texture or depth component or both of a view is required for the presentation of another view.

 

7.11Enhanced carriage of HEVC

Std

Pt

Amd

Req

Short description

4

15

?

 

This amendment specifies the storage of video bitstreams consisting of multiple views and the associated depth, encoded based on Annex I of ISO/IEC 14496-10. The design is based on the MVC file format, which is specified in Clause 7 of ISO/IEC 14496-15, in a backwards-compatible manner. In the design, storage of the texture and depth of a particular view in either separate tracks or the same track is supported. The design also includes the signalling of various indications, such as the presence of texture and/or depth for each view, as well as whether the texture or depth component or both of a view is required for the presentation of another view. The amendment also adds the signaling (using HEVC video descriptor) to indicate use of HEVC low-delay coding mode in each access unit where the STD buffer management is performed using the HEVC HRD parameters

 

7.12Carriage of Layered HEVC

 

Std

Pt

Amd

Req

Short description

4

15

4

14576

S

 

7.13Enhanced audio support

 

Std

Pt

Amd

Req

Short description

4

12

4

14324

This amendment adds explicit support for the   newly developed Dynamic Range Control tools, and for loudness measures, in the ISO base media file format, applicable to audio streams stored in the file format

 

7.14Timed Metadata Metrics of Media in the ISO Base Media File Format

 

Std

Pt

Amd

Req

Short description

B

10

E1

 

Specifies a storage format for commonly used, timed metadata metrics of media, such as quality related PSNR, SSIM, and others, for carriage in metadata tracks of the ISO Base Media File Format.

 

7.15Sample Variants in ISOBMFF

 

Std

Pt

Amd

Req

Short description

B

12

E1

14598

S

 

7.16Additional technologies for MPEG Media Transport

 

Std

Pt

Amd

Req

Short description

H

1

1

 

Additional tools for MMT such as Asset Delivery Characteristics which describes required Quality of Service (QoS) for delivery of Assets and Automatic Repeat request (ARQ) messages.

 

7.17Image File Format

 

Std

Pt

Amd

Req

Short description

H

12

E1

 

Support for

  1. sequences, timed or untimed, with or without audio etc.
  2. single still images, the simple case, maybe based on JPX

 

7.18MMT Implementation Guidelines

 

Std

Pt

Amd

Req

Short description

H

12

E1

 

The MMT Implementation Guidelines describe the usage of MMT for different media delivery scenarios. It describes the different functions that MMT provides and shows using examples how they can deployed separately or together to realize a media delivery service.

 

7.19Extended Profiles and time synchronization

 

Std

Pt

Amd

Req

Short description

DA

1

1

 

This amendment will add support of UTC timing synchronization and Inband event synchronization

 

7.20Spatial Relationship Description, Generalized URL parameters and other extensions

 

Std

Pt

Amd

Req

Short description

DA

1

2

 

This amendment to ISO/IEC 23009-1 adds the ability for MPD authors to express:

-       Spatial relationships between representations in the MPD;

-       Flexible parameter insertions in URLs used to query media segments;

-       Role @values compatible with the kind values used in the W3C HTML5 recommendation;

Different signaling of client authentication and content authorization methods

 

 

7.21MPEG-DASH Implementation Guidelines

 

Std

Pt

Amd

Req

Short description

DA

3

E1

 

This technical report contains a set of guidelines for design and deployment of streaming media delivery systems using ISO/IEC 23009 (MPEG-DASH) standard.  This includes guidelines for content generation, guidelines for client implementation, and examples of deployment scenarios.  

 

7.22Server and Network Assisted DASH

 

Std

Pt

Amd

Req

Short description

DA

 

 

 

S

 

8Multimedia architecture

8.1MPEG-M API

 

Std

Pt

Amd

Req

Short description

M

2

E3

 

The 3rd Edition of the MPEG-M API introduces a set of interfaces facilitating the usage of the MPEG-V related technologies. Particularly, the API allows to access MPEG-V sensor data from a local or remote resource, query any effect description, modify and packetize it in an MPEG-V compliant format. The MPEG-V engine described in this version of the standard is called Sensory Effect Processing (SEP) Engine and is responsible for processing the sensory effect metadata contained in a 4D media and rendering the sensory effects in a synchronized way with respect to the actual audio-visual content. The SEP engine can interact with existent Media Framework Engine. 

 

8.2MPEG-V Architecture

 

Std

Pt

Amd

Req

Short description

V

1

E5

 

MPEG-V (Media context and control), published in ISO/IEC 23005, provides an architecture and specifies associated information representations to enable the interoperability between virtual worlds, e.g., digital content providers of a virtual world, (serious) gaming, simulation, and with the real world, e.g., sensors, actuators, vision and rendering, robotics. MPEG-V is applicable in various business models/domains for which audiovisual contents can be associated with sensorial effects that need to be rendered on appropriate actuators and/or benefit from well-defined interaction with an associated virtual world.

MPEG-V Part 1 specifies the Architecture of MPEG-V by specifying the components, APIs and use cases. In the 3rd Edition of the document, use cases related to automobile sensors and makeup avatar.

 

9Application formats

9.1Augmented Reality AF

 

Std

Pt

Amd

Req

Short description

A

13

E2

 

Augmented Reality Application Format (ARAF) is a collection of a subset of the ISO/IEC 14496-11 (MPEG-4 part 11) Scene Description and Application Engine standard, combined with other relevant MPEG standards (e.g. ISO/IEC 23005 - MPEG-V), designed to enable the consumption real-time 2D/3D hybrid content. The Second edition particularly focuses on tools such as 3D video, 3D audio, image and video descriptors for recognition and tracking and server side processing.

9.2Mixed and Augmented Reality Reference Model

 

Std

Pt

Amd

Req

Short description

A

14

E1

 

MAR (Mixed and Augmented Reality) reference model aims at defining the domain of mixed/augmented reality, the main concepts, various terms and their definitions, and an overall system architecture analyzed from various viewpoints, principles on which the document will be built. MAR reference model specifies a set of minimum functions and performance levels that should be supported by an MAR system. The reference model contains a list of possible use cases and their validation with respect to the architecture proposed by the reference model

 

9.3Multimedia Preservation

 

Std

Pt

Amd

Req

Short description

A

15

E1

14338

The objective of the Multimedia Preservation Description Information (MPDI) framework is to provide a standardized description to multimedia content to enable users to plan, execute, and evaluate preservation operations to achieve the objectives of digital preservation. 

 

9.4Multisensory Effects Application Format

 

Std

Pt

Amd

Req

Short description

A

?

?

 

The objective of this exploration is to group together a set of MPEG technologies for addressing technologies such as 4D Media. 4D Media is a new type of media service provided by a combination of the traditional media and its sensory effects. In 4D Media, traditional media data could be represented by conventional audio or video data and accompanying sensory effect metadata represented by using the MPEG-V standard. 4D Media can provide with sensory effects, giving real-sense experience and immersion.

 

9.5Publish/Subscribe Application Format (PSAF)

 

Std

Pt

Amd

Req

Short description

A

?

?

 

Publish/Subscribe (PubSub) is an established communication paradigm where senders do not communicate information directly to intended receivers but rely instead on a service that mediates the relationship between senders and receivers. While generic PubSub specifications exist, there are some specific features that are typical of a multimedia application that can be easily supported by a media-friendly PubSub format based on MPEG technology.

 

9.6Adaptive Screen Content Sharing Application Format

 

Std

Pt

Amd

Req

Short description

A

?

?

 

The basic idea for ASCS MAF is to treat a screen as a composition of its screen components (e.g., windows, background images, and icons), to model a screen as a manifest of the screen and its components, to share and transport the manifest and its components via adequate protocols, and to update this manifest from time to time according to updates to the initial screen.

 

10Reference implementation

10.1New levels for AAC profiles, uniDRC support and AAC block length Reference Software

 

Std

Pt

Amd

Req

Short description

4

5

37

 

 

 

10.2MVC plus depth extension of AVC Reference Software

 

Std

Pt

Amd

Req

Short description

4

5

33

 

Provides a reference implementation of the MVC-plus-depth extension of AVC

10.3Multi-resolution Frame Compatible Stereo Coding extension of AVC Reference Software

 

Std

Pt

Amd

Req

Short description

4

5

34

 

Provides a reference implementation of the MFC extension of AVC

 

10.43D extension of AVC Reference Software

 

Std

Pt

Amd

Req

Short description

4

5

35

 

This amendment will provide example software for the combined texture and depth coding extensions for MPEG-4 AVC (MPEG-4 part 10).

 

10.5Pattern based 3D mesh compression Reference Software

 

Std

Pt

Amd

Req

Short description

4

5

36

 

This standard introduces the reference software implementation for the Pattern based 3D mesh compression as defined in MPEG-4 AFX.

 

10.6Video Coding for Browsers Reference Software

 

Std

Pt

Amd

Req

Short description

4

5

38

14674

xxx

 

10.7CEL and MCO Reference Software

 

Std

Pt

Amd

Req

Short description

21

8

1

 

xxx

 

10.8MPEG-7 Visual Reference Software

 

Std

Pt

Amd

Req

Short description

7

6

2E

 

MPEG-7 Reference Software: Update of MPEG-7 Reference Software to make it compatible with current compilers and external libraries and ensure conformance. The 2nd edition of ISO/IEC 15938-6 will address the visual part of the MPEG-7 Reference Software

 

10.9CDVS Reference Software

 

Std

Pt

Amd

Req

Short description

7

13

1

 

xxx

 

10.10ARAF reference software

 

Std

Pt

Amd

Req

Short description

A

14

1

 

ARAF Reference Software provides an implementation of the ARAF PROTOs.

 

10.11Media Tool Library Reference Software

 

Std

Pt

Amd

Req

Short description

C

4

1

 

This work provides VTL conformance testing and RVC simulation model (RSM).

VTL conformance testing comprises of input/output patterns that can be used for testing FU-level conformance. The FU conformance testing is compliant to the VTL standard (ISO/IEC 23002-4:2010). RSM is the reference implementation of RVC framework defined from ISO/IEC 23001-4:2010 and ISO/IEC 23002-4:2010

 

10.12SAOC and SAOC Dialogue Enhancement Conformance

 

Std

Pt

Amd

Req

Short description

D

1

?

 

xxx

 

10.13MPEG-DASH Reference Software

 

Std

Pt

Amd

Req

Short description

D

2

E2

 

 

 

10.14MPEG-V – Reference Software

 

Std

Pt

Amd

Req

Short description

V

5

 

 

Provides the reference implementation of the XML and binary version of MPEG-V types

 

10.15MMT Reference Software

 

Std

Pt

Amd

Req

Short description

H

4

E1

 

Reference software for MMT version 1 (23008-1)

 

10.16HEVC Reference Software

 

Std

Pt

Amd

Req

Short description

H

5

E1

 

Reference software for HEVC version 1 (23008-2)

 

10.173D Audio Reference Software

 

Std

Pt

Amd

Req

Short description

H

6

E1

 

Reference software for 3D Audio (23008-3)

 

11Conformance

11.1New levels for AAC profiles and uniDRC support

 

Std

Pt

Amd

Req

Short description

4

5

37

14447

PDAM 5 specifies where and in which format the MPEG-D DRC metadata is carried in AAC. It also contains the specification of the SAOC DE profile and level indication.

 

11.2Multi-resolution Frame Compatible Stereo Coding extension of AVC Conformance

 

Std

Pt

Amd

Req

Short description

4

4

42

13744

Conformance bitstreams for the MFC extension of AVC (14496-10:2012/Amd.5)

 

11.33D-AVC Conformance

 

Std

Pt

Amd

Req

Short description

4

4

43

 

Conformance bitstreams for 3D AVC

 

11.4Pattern based 3D mesh compression Conformance

 

Std

Pt

Amd

Req

Short description

4

27

6

 

Provides the conformance data set for Pattern-based 3D mesh compression as defined in MPEG-4 AFX

 

11.5Video Coding for Browsers Conformance

 

Std

Pt

Amd

Req

Short description

4

31

E1

 

This amendment will provide a specification and associated test set for testing encoder and decoder conformance to the Video Coding for Browsers specification in ISO/IEC 14496-31

 

11.6CDVS Conformance

 

Std

Pt

Amd

Req

Short description

7

13

1

 

xxx

 

11.7CEL and MCO Conformance

 

Std

Pt

Amd

Req

Short description

21

14

1

 

xxx

 

11.8ARAF Conformance

 

Std

Pt

Amd

Req

Short description

A

14

1

 

ARAF Conformance provides examples validating the ARAF PROTOS. The examples are encapsulated in MPEG-4 files which can be played by an ARAF player.

 

11.9Media Tool Library Conformance

 

Std

Pt

Amd

Req

Short description

C

4

1

 

This work provides VTL conformance testing and RVC simulation model (RSM).

VTL conformance testing comprises of input/output patterns that can be used for testing FU-level conformance. The FU conformance testing is compliant to the VTL standard (ISO/IEC 23002-4:2010). RSM is the reference implementation of RVC framework defined from ISO/IEC 23001-4:2010 and ISO/IEC 23002-4:2010

11.10SAOC and SAOC Dialogue Enhancement Conformance

 

Std

Pt

Amd

Req

Short description

D

1

?

 

xxx

 

11.11MPEG-V – Conformance

 

Std

Pt

Amd

Req

Short description

V

4

E4

 

Provides the conformance of the XML and binary version of MPEG-V types.

 

11.12MMT Conformance

 

Std

Pt

Amd

Req

Short description

H

4

E1

 

Conformance for MMT version 1 (23008-1)

 

11.13HEVC Conformance

 

Std

Pt

Amd

Req

Short description

H

5

E1

 

Conformance for HEVC version 1 (23008-2)

 

11.143D Audio Conformance

 

Std

Pt

Amd

Req

Short description

H

6

E1

 

Conformance for 3D Audio (23008-3)

12Maintenance

12.1Systems coding standards

Std

Pt

Cor

Req

Short description

 

 

 

 

Collection of defect reports and development of corrigenda in the systems coding area

 

12.2Video coding standards

Std

Pt

Cor

Req

Short description

 

 

 

 

Collection of defect reports and development of corrigenda in the video coding area

 

12.3Audio coding standards

 

Std

Pt

Cor

Req

Short description

 

 

 

 

Collection of defect reports and development of corrigenda in the audio coding area

 

12.43DG coding standards

 

Std

Pt

Cor

Req

Short description

4

16

 

 

Collection of defect reports and development of corrigenda in the 3DG coding area

 

12.5Systems description coding standards

 

Std

Pt

Cor

Req

Short description

7

1

 

 

Collection of defect reports and development of corrigenda in the systems description coding area

 

12.6Visual description coding standards

 

Std

Pt

Cor

Req

Short description

7

3

 

 

Collection of defect reports and development of corrigenda in the visual description coding area

 

12.7Audio description coding standards

 

Std

Pt

Cor

Req

Short description

7

4

 

 

Collection of defect reports and development of corrigenda in the audio description coding area

 

12.8MPEG-21 standards

 

Std

Pt

Cor

Req

Short description

21

 

 

 

Collection of defect reports and development of corrigenda for MPEG-21 standards

 

12.9MPEG-A standards

 

Std

Pt

Cor

Req

Short description

A

 

 

 

Collection of defect reports and development of corrigenda for multimedia application standards

Files: 
Published in meeting: