This clause describes the enhancements to the IMS architecture supporting data channel services to additionally support Avatar communication services.
For Avatar communication over IMS data channel, the list of Avatar ID(s) is downloaded to the UE by following options:
Pre-configured in the UE: The Avatar ID List is provisioned or downloaded to the UE before a data channel for avatar call is setup. The Avatar Representations may be downloaded together. The UE and the BAR may interact by means out of the scope of 3GPP.
Through bootstrap data channel: The Avatar ID List is fetched by the DC AS from the BAR, and transferred from the DC AS to the DCSF, and downloaded to UE through bootstrap data channel.
Through application data channel: The Avatar ID List is fetched by the DC AS from the BAR, and downloaded to the UE through application data channel.
Avatar ID List contains the list of Avatar IDs that are identifying the Avatar Representation and/or the associated information. The associated information may include the information for the user selection in the avatar calls (e.g. the metadata associated with the Avatar such as thumbnails, URLs, the associated data channel applications).
For downloading of Avatar ID List through bootstrap data channel, the DC AS URL(s) may be pre-configured in the DCSF, or fetched by DCSF. When the downloading of Avatar ID List is requested through bootstrap data channel, the DCSF requests DC AS to fetch the Avatar ID List from the BAR for the authorized user of the UE.
The BAR and/or the DC AS may be deployed either in the trusted domain or untrusted domain of operator's network.
After selection of Avatar ID to use in the avatar call by the user of the UE, the Avatar Representation identified with the selected Avatar IDs may be provided from the BAR to MF, and it may be transferred to the local UE (if not locally available) and/or to the remote UE according to the rendering mode via application data channel if needed.
The avatar media is rendered by processing the Avatar Representation with the animation data.
The avatar media rendering process may be performed by the sending UE, the receiving UE, or the network according to the rendering mode.
The animation data may be generated by the sending UE, the receiving UE, or the network (e.g. from audio, video, and/or metadata such as UE sensors collected data).
To support Avatar communication, the data channel architecture and functions described in clause AC.2 is enhanced as below:
Base Avatar Repository (BAR):
The BAR stores Avatar Representation, its Avatar ID, and the associated information as described in clause AC.11.1. One or more Avatar Representation(s), is stored for a user, and each Avatar Representation is identified with Avatar ID.
The BAR may provide Avatar ID List to the DC Application Server, and the DC Application Server may further provide to DCSF.
The BAR provides Avatar Representation and the associated information to the MF directly or after URL redirection from the DC AS.
DCSF:
If the Avatar ID List is downloaded through bootstrap data channel, the DCSF may fetch the Avatar ID List from the DC AS, and send to MF via MDC1.
MF:
The MF supports the UE downloading the Avatar ID List and associated information through bootstrap data channel or application data channel.
The MF supports the UE downloading the Avatar Representation through bootstrap data channel or application data channel.
The MF may retrieve the Avatar Representation and the associated information from BAR via MDC2.
The MF may support Avatar media processing and transcoding (e.g. ASR/TTS).
DC AS:
The DC AS may provide the DC AS URL(s) to DCSF via DC3/N33 or DC4 for downloading of Avatar ID List through bootstrap data channel.
The DC AS retrieves the Avatar ID List from BAR(s) via MDC2.
The DC AS sends the Avatar ID List either to DCSF via MDC3 for downloading through bootstrap data channel, or to MF via MDC2 for downloading through application data channel.
The DC AS may retrieve the Avatar Representation and the associated information from BAR and sends to MF via MDC2.
The DC AS supports Avatar media negotiation and provides the instruction for Avatar media processing.
Figure AC.11.3.1-1 depicts a typical call flow procedure of avatar ID list download procedure through bootstrap data channel. The main steps in the call flow are as follows:
Sending UE centric: When the UE initiates an avatar communication with the peer UE, the UE that owns the avatar representation should download the avatar representation from BAR, if the Avatar representation is not locally available. The sending UE performs avatar animation and sends the animated video stream to the peer UE.
Receiving UE centric: When the UE initiates an avatar communication with the peer UE, the UE that owns the avatar representation should sends the avatar ID to the peer UE. The peer UE downloads the avatar representation from BAR based on the received avatar ID, performs the avatar animation, and displays the animated video stream.
Figure AC.11.3.2.1-1 depicts a typical call flow procedure of sending UE centric IMS Avatar communication. The main steps in the call flow are as follows:
IMS session and bootstrap data channel have been established. UE1 downloads avatar communication application. The avatar ID list is downloaded as described in clause AC.11.3.1. Then UE1 chooses an avatar ID and runs the avatar communication application. If the avatar representation is locally available on the UE1, then step 2-6 can be skipped.
UE1 downloads Avatar representation from DC AS using avatar ID and UE1's identity (e.g., IMSI or MSISDN) through the established application data channel.
The DC AS should authenticate and authorize the UE1 request and download Avatar representation using avatar ID from the BAR, the BAR responses with avatar representation data.
Figure AC.11.3.2.2-1 depicts a typical call flow procedure of receiving UE centric IMS Avatar communication. The main steps in the call flow are as follows:
IMS session and bootstrap data channel have been established. UE1 downloads avatar communication application. The avatar ID list is downloaded as described in clause AC.11.3.1. Then UE1 chooses an avatar ID and runs the avatar communication application.
UE1 performs avatar animation negotiation with the DC AS and UE2. The negotiation includes usage of the UE1's avatar ID and the animation data types (e.g. text and/or facial expression) supported by UE1.
The DC AS should authenticate and authorize the UE2 request and download avatar representation using UE1's avatar ID from the BAR, the BAR responses with UE1's avatar representation data. The DC AS should authorize the UE2 download request based on the negotiation result from step 4.
UE1 may send metadata (e.g. facial expression) to UE2 through the established application data channel or audio/video through RTP channel between UE1 and UE2.
IMS session and bootstrap data channel have been established. UE1 downloads avatar communication application. The avatar ID list is downloaded as described in clause AC.11.3.1. Then UE1 chooses an avatar ID and runs the avatar communication application.
If the negotiation result is successful in step 4, the UE1 initiates new P2A2P application data channels, which are used for avatar data transmission between the UE1/UE2 and the network. During the P2A2P application data channel establishment procedure, the DCSF will retrieve the avatar media processing specification, and instruct the MF via IMS AS how to establish the data channel and corresponding avatar media processing specification.
The UE1/UE2 may send avatar metadata (e.g. facial expression, FOV) through the P2A2P ADC established in step 5 and/or audio/video through established RTP channels in step 6.
MF performs avatar animation based on the avatar metadata received from application data channel and/or audio/video. MF should support different media processing according the avatar type, e.g., animation with no rendering for 2D avatar, animation and rendering for 3D avatar.
MF sends the animated video stream to UE2, and sends the animated video stream to UE1 if required.
The above procedures also apply for the transition from audio/video communication between two users to avatar communication. In an ongoing audio/video communication, one of UEs decides to switch the communication to avatar mode, it will interact with the IMS Core and peer UE to establish application data channel for the avatar communication. Then two UEs will exchange avatar media; and the original video media between two UEs may be stopped.