This scenario is as follows in
Figure 4.1-20, referring to a type of
"5G EDGe-Dependent AR (EDGAR) UE" in
TR 26.998, with the main characteristic that the binaural rendering process is shared between the Cloud/Edge and the 5G UE. The 5G UE connects to Cloud/Edge through an embedded 5G modem, the 5G UE and End Device (e.g. type 1 or 2) connect through WiFi or 5G sidelink, maybe through Bluetooth for audio. End Device sends Pose Information to 5G UE and Cloud/Edge if needed, and the Cloud/Edge and 5G UE provide the capabilities of decoding and rendering together. The 5G UE decodes and post-renders ISAR formatted audio to head-tracked binaural audio end re-encodes it with a stereo encoder. The 5G UE also relays pose information to the Cloud/Edge. The End Device decodes and outputs binaural audio received from 5G UE in a pass-through manner without post-rendering modifications.
In Variation A, as depicted in
Figure 4.1-21, audio playback is done through built-in loudspeakers of the End Device.
In Variation B, as depicted in
Figure 4.1-22, a pair of earbuds/headphones is used to playback the binaural audio instead of the built-in speakers used in Variation A. The pose estimation function is still performed by the End Device. Variation B is expected to be more prevalent than Variation C described below due to possibly better pose estimation capability by the End Device.
In Variation C, as depicted in
Figure 4.1-23, TWS Earbuds/Headphones perform stereo decoding of the binaural audio and binaural audio playback. In addition, they perform pose estimation and provide pose information to the End Device or directly to the 5G UE. The End Device may be used to relay pose information and coded audio between TWS Earbuds/Headphones and the 5G UE.