Based on the use cases, the following formats, codecs, and packaging formats are of relevance for AR conversational:
-
General
-
2D Video Formats and video compression codecs
-
Audio Formats and audio compression codecs supporting mono, stereo, and/or spatial audio
-
In addition, for downlink
-
Immersive media 3D Formats such as static and dynamic point clouds or meshes
-
Decoding tools for such formats
-
Composed Scene Graph and Scene Description
-
In addition, for uplink
-
Immersive media 2D Video Formats with depth
-
Immersive media 3D Formats such as static and dynamic point clouds or meshes
-
Encoding tools for such formats
-
Streaming of sensor information (e.g., gyroscope, accelerometer) as well as pose information
Table 6.5.7-1 shows the list of potential instantiations and how they may be composed from each building block described in
clause 6.5.1.
The list of potential standardization area that has been collected is provided in the following:
-
Immersive media format and profile with integration into relevant 5G architecture
-
Scene description format, functionality, and profile as an entry point of immersive media
-
Scene description update mechanism
-
Relevant subset of media codecs for different media types and formats
-
CMAF encapsulation of immersive media for 5G media streaming
-
Media payload format to be mapped into RTP streams
-
Capability exchange mechanism and relevant signalling (e.g., SDP)
-
Protocol stack and content delivery protocol for various architecture options as identified in Table 6.5.3-1
-
Functionalities to support split rendering and network-based media processing allocation with 5G edge/MRF
-
Required QoS and QoE for AR/MR conversational service