Content for TR 26.998 Word version: 18.1.0

0… 4… 4.2… 4.2.2… 4.2.2.2 4.2.2.3 4.2.2.4 4.2.3… 4.3… 4.4… 4.5… 4.6… 4.6.4… 4.6.5… 4.6.8… 5 6… 6.2… 6.2.4… 6.2.4.2 6.2.5… 6.3… 6.3.4… 6.3.4.2 6.3.5… 6.4… 6.4.4 6.4.5… 6.5… 6.5.4 6.5.5 6.5.6… 6.6… 6.6.4 6.6.5… 7… 8… 8.9 9 A… A.2 A.3… A.4 A.5 A.6 A.7…

Based on the use cases, the following formats, codecs, and packaging formats are of relevance for AR conversational:

General
- 2D Video Formats and video compression codecs
- Audio Formats and audio compression codecs supporting mono, stereo, and/or spatial audio
In addition, for downlink
- Immersive media 3D Formats such as static and dynamic point clouds or meshes
- Decoding tools for such formats
- Composed Scene Graph and Scene Description
In addition, for uplink
- Immersive media 2D Video Formats with depth
- Immersive media 3D Formats such as static and dynamic point clouds or meshes
- Encoding tools for such formats
- Streaming of sensor information (e.g., gyroscope, accelerometer) as well as pose information

Table 6.5.7-1 shows the list of potential instantiations and how they may be composed from each building block described in clause 6.5.1.

Building Block	Instantiation#1: MTSI extension	Instantiation#2: DCMTSI extension
Call setup and control	Conventional MTSI	Conventional MTSI with Data Channel
Media Formats	as specified in clause 6.5.6	as specified in clause 6.5.6
Delivery	RTP/UDP/IP, SCTP/DTLS/UDP/IP	RTP/UDP/IP, SCTP/DTLS/UDP/IP
5G system integration	Need policy exchange for AR-(DC)MTSI client (P-CSCF and PCF)	Need policy exchange for EDGAR-DCMTSI client (P-CSCF and PCF)

The list of potential standardization area that has been collected is provided in the following:

Immersive media format and profile with integration into relevant 5G architecture
Scene description format, functionality, and profile as an entry point of immersive media
Scene description update mechanism
Relevant subset of media codecs for different media types and formats
CMAF encapsulation of immersive media for 5G media streaming
Media payload format to be mapped into RTP streams
Capability exchange mechanism and relevant signalling (e.g., SDP)
Protocol stack and content delivery protocol for various architecture options as identified in Table 6.5.3-1
Functionalities to support split rendering and network-based media processing allocation with 5G edge/MRF
Required QoS and QoE for AR/MR conversational service