INTERNATIONAL
ORGANISATION FOR STANDARDISATION
ORGANISATION INTERNATIONALE DE NORMALISATION
ISO/IEC JTC1/SC29/WG11
CODING OF MOVING PICTURES AND AUDIO
ISO/IEC JTC1/SC29/WG11 N4518
Pattaya, December 2001
Source: Requirements
Author: Rob Koenen
Title: From MPEG-1 to MPEG-21: Creating an Interoperable Multimedia Infrastructure
This overview first appeared in EE Times [1] as a part of a special section on MPEG [2]. The EE Times version of the article is here [3]
The debate over MPEG-4’s future is heating up. Its specifications have been available for some time, the MPEG-4 Industry Forum (M4IF) just welcomed its 100th member and the Internet Streaming Media Alliance (ISMA) released its open specification for Internet streaming based on MPEG-4 this month. The MPEG-4 community anxiously awaits news from patent holders about MPEG-4 licensing, slated to start in January. MPEG-4 is the first open, comprehensive multimedia content representation standard that will support many multimedia industries, like MPEG-2 supports the DVD and television industries. The object-based MPEG-4 standard is both state-of-the art and future-proof; it can easily incorporate improvements in technology if and when they materialize. In fact, a joint ITU-T / MPEG team is adding a new video coding part over the next year, capturing the cutting edge of video coding technology. This part will also be a standard of the International Telecommunication Union (ITU-T).
Meantime, the Moving Picture Experts Group has not sat still after getting MPEG-4 ready for prime time. Recently, it finalized the first version of the International MPEG‑7 Standard for Content Description, to be published by ISO within the next few months. MPEG-7 will complement MPEG-4, not replace it. MPEG-4 defines how to represent content; MPEG-7 specifies how to describe it. And on the horizon there is yet another ISO MPEG standard, MPEG-21, which aims to provide a truly interoperable multimedia framework. The essence of all MPEG efforts is interoperability – interoperability for the consumer. Interoperability means that consumers can be sure to be able to use the content and not be bugged by incompatible formats, codecs, metadata, and so forth.
MPEG-1 and MPEG-2 provide interoperable ways of representing audiovisual content, commonly used on digital media and on the air. MPEG-4 extends this to many more application areas through features like its extended bitrate range, its scalability, its error resilience, its seamless integration of different types of ‘objects’ in the same scene, its interfaces to digital rights management systems and its powerful ways to build interactivity into content. MPEG-7 defines an interoperable framework for content descriptions way beyond the traditional ‘metadata’. MPEG-7 has descriptive elements that range from very ‘low-level’ signal features like colors, shapes and sound characteristics, to high level structural information about content collections. MPEG-7 is also unique in its tools for structuring information about content. MPEG-7 and MPEG-4 form a great couple, especially when MPEG-4 objects are used. With MPEG-7, it is now possible to exchange information about multimedia content in interoperable ways, making it easier to find content and identify just what you wanted to use. MPEG-7 information will be added to broadcasts; personal video recorders and search engines can use it, and it greatly facilitates managing multimedia content in often large content repositories. Audiovisual archives are currently hard to search from outside the organizations that own them, because they all employ their own metadata schemes. MPEG-7 will lift that barrier.
All this interoperability sounds promising, but what is achieved may be almost completely undone by efforts to protect the digital assets. As digital multimedia spreads to many different platforms, transmission speeds increase and storage costs fall, digital rights management (DRM) becomes a necessity to protect the value of content. In its current form, DRM could unfortunately go against the very goal of interoperability, as it locks up the ‘standardized content’ using non-standardized protection mechanisms. This development is actually not even recent: many proprietary conditional access (CA) systems make standard MPEG-2 TV content inaccessible to people that happen to own the wrong set top box — even when they can receive the signal and are willing to pay the associated content fees.
Very early on, MPEG understood that more interoperability in DRM is crucial to an open multimedia infrastructure. While recently a very popular topic in almost every digital forum, already five years ago, representatives from rights owners, technology providers and CA/DRM providers sat down together to discuss the issue in the context of MPEG-4. This gave us the ‘hooks’: a set of standard interfaces to proprietary Intellectual Property Management and Protection (IPMP) systems, deeply embedded in MPEG-4 Systems. These were a step in the right direction: if you want to play content you ‘only’ need to plug in the right IPMP system, and where to obtain it can be signaled in the bitstream.
But it was not enough. A portable music player cannot download an IPMP system – interoperability lost. In the summer of 2000 MPEG started to work on more interoperable IPMP for MPEG-4. (Note that IPMP is MPEG-speak for digital rights management.) This work is now at ‘Committee Draft stage (the first public draft) and will be finalized in May 2002. Interoperability in DRM is a very hard question. It requires standardized trust. Content owners must for instance be able to trust all the players that consume the content. This type of trust is very difficult to standardize; the necessary trust infrastructure is not readily available. But, building on its 5-year experience with often difficult IPMP discussions, MPEG is perhaps the only forum where the issue can be technically addressed.
This brings us to the MPEG-21 Multimedia Framework. To achieve true end-to-end interoperability, more is needed than the interoperable IPMP terminal architecture mentioned above. According to its Technical Report (an almost final version is here), MPEG-21’s goal is to describe a ‘big picture’ of how different elements to build an infrastructure for the delivery and consumption of multimedia content – existing or under development – relate to each other. In setting the vision and starting the work, MPEG‑21 has drawn much new blood to MPEG, including representatives from major music labels, the film industry and technology providers.
The MPEG-21 world consists of Users that interact with Digital Items. A Digital Item can be anything from an elemental piece of content (a single picture, a sound track) to a complete collection of audiovisual works. A User can be anyone who deals with a Digital Item, from producers to vendors to end-users. Interestingly, all Users are ‘equal’ in MPEG-21, in the sense that they all have their rights and interests in Digital Items, and they all need to be able to express those. For example: usage information is valuable content in itself; an end-user will want control over its utilization. A driving force behind MPEG-21 is the notion that the digital revolution gives every consumer the chance to play new roles in the multimedia food chain. There are 6 billion potential MPEG‑21 Users out there: producers, packagers, resellers, distributors, ...
MPEG-21 seeks to use existing standards where possible, to facilitate their integration and to fill in gaps. It does so together with appropriate other standards fora. Highly laudable and rather abstract, but MPEG is currently drafting a number of very interesting, concrete and commercially relevant parts of the MPEG-21 standard. Counting the MPEG-21 Technical Report as part number one, the second part of MPEG-21 will be ready in summer 2002. This is the Digital Item Declaration, a concise and powerful XML-based schema for declaring Digital Items. Arguably more ambitious is MPEG-21’s third part: the Digital Item Identification and Description. This work solves the problem of uniquely identifying digital content in a global way, and giving a resolution mechanism along with the unique identification. Imagine you found a piece of content — got it from a friend, stumbled across it on the Web, received it on a CD — and you want to ‘consume’ it. The content is protected, but the Digital Item Identification tells you where to go to find information about its rights.
The rights information is coded using the two further MPEG-21 parts, the Rights Expression Language, REL (part 5) and the Rights Data Dictionary, RDD (part 6). These two parts together allow the expression of rights in an interchangeable form, using a standardized syntax (REL) and standardized terms (RDD). The Call for Proposals for these parts is out; proposals are due end November; the standards will be ready early 2003. Probably the Rights Expression Language will be based on XML, but equally likely it will also have a compact, binary representation to be used under bandwidth-constrained, real-time conditions. Between parts 3 and 5 above, the work on more interoperable IPMP in MPEG-4 was recently added to MPEG-21 as its fourth element, because it applies to MPEG-7, -2 and -1 just as well.
The 7th framework element will be a unified description of environments in which content is being used. This covers networks, terminals and access conditions. The goal is to achieve ‘Universal Multimedia Access’, where content can adapt itself seamlessly to dynamic consumption circumstances.
MPEG-21 is developed using a staggered approach, the various parts following each other in time. Future MPEG-21 work items will likely include: Content Representation (how the media resources are represented beyond the existing MPEG standards) Content Handling and Usage (interfaces for managing content), and Event Reporting.
MPEG-4 is now proving its viability in the market as an open standard for multimedia. The ecosystem is coming to life: players, servers, hardware and software, testing systems, IP cores, and authoring tools are all being readied. It will mean a major step towards more interoperability in multimedia. MPEG-7 will help manage the growing abundance of content, and MPEG-21 will make trusted interaction with content much more transparent, creating a level playing field for all participants in the multimedia food chain. Users will only stand to benefit.
[1] www.eetimes.com
[2] Junko Yoshida (ed.), MPEG Standards,
In Focus, EE Times, 12 November 2001 http://www.eetimes.com/story/getHeadlines?category=2582&the_date=
11/12/01&title=MPEG+Standards&header=in_focus/system_design/header.html
&footer=in_focus/system_design/footer.html
Rob Koenen is Senior Director of Technology Initiatives at InterTrust Technologies Corporation. He chairs MPEG’s Requirements Group and is the President of the MPEG-4 Industry Forum.