INTERNATIONAL ORGANISATION FOR STANDARDISATION
ORGANISATION INTERNATIONALE DE NORMALISATION
ISO/IEC JTC1/SC29/WG11
CODING OF MOVING PICTURES AND AUDIO
ISO/IEC JTC1/SC29/WG11 N11967
March 2011, Geneva, Switzerland
Source |
Systems |
Status |
Approved |
Title |
Advanced User Interaction: 1-pager |
Author |
Seong Yong Lim, Jihun Cha, Electronics and Telecommunications Research Institute (ETRI), Korea |
MPEG-U Advanced User Interaction Interface (AUI)
Providing Tools for Interacting via Advanced UI devices
Technology evolution in the field of user interface has been rapidly progressing for recent years. Even though the most dominant user interaction devices are still mouse, keyboards for PC and remote controllers for TV, lots of evidences for using advanced sensing technologies are published. For instance, major game device manufactures released motion-based game titles which are interfaced with a human operator via a motion sensor. On the other hand, MPEG has developed various scene description technologies such as Binary Format for Scenes (BIFS) and Lightweight Application Scene Representation (LASeR). These technologies can be widely used by industries and applications on fixed and mobile devices in order to represent a scene composed of video, audio, 2D graphics objects, and animations. However current MPEG standards mostly focus on basic interaction devices such as pointing and keying devices. It reflects that the need of describing common data formats for the above mentioned Advanced User Interaction (AUI) interfaces has been highlighted for the improvements of capabilities and the new deployment of interactive rich media services.
This part of ISO/IEC 23007 specifies advanced user interaction interfaces to support various advanced user interaction devices. The AUI interface is a part of the bridge between scene descriptions and system resources. A scene description is a self-contained living entity composed of video, audio, 2D graphics objects, and animations. Through the AUI interfaces or other existing interfaces such as DOM events, a scene description accesses interesting system resources to interact with users. In general, a scene composition is conducted by a third party and remotely deployed.
Advanced user interaction devices such as motion sensors and multi touch interfaces generate the physical sensed information from user’s environment. By a recognition process, a set of physical information can be converted to a pattern with semantics which is more useful to a scene description. For instance, some feature points drawn by user’s finger can be understood as a circle which is specified with the center position of a circle and a radius value. Therefore, this part provides a set of data formats which defines geometric patterns, symbolic patterns, touch patterns, posture patterns and their composite patterns.