Tech-invite3GPPspaceIETFspace
21222324252627282931323334353637384‑5x

Content for  TS 26.118  Word version:  18.0.0

Top   Top   None   None   Next
0…   4…   4.2   4.3   4.4   4.5…   5…   5.1.4…   5.1.5…   5.1.6…   5.1.7…   5.2…   6…   7…   A…   B…   C…

 

0  Introductionp. 8

The present document provides technologies for interoperable Virtual Reality services with focus on streaming and consumption.
Virtual Reality (VR) is the ability to be virtually present in a space created by the rendering of natural and/or synthetic image and sound correlated by the movements of the immersed user allowing interacting with that world.
Suitable media formats for providing immersive experiences are specified to enable Virtual Reality Services in the context of 3GPP bearer and user services.
Up

1  Scopep. 9

The present document defines interoperable formats for Virtual Reality for streaming services. Specifically, the present document defines operation points, media profiles and presentation profiles for Virtual Reality. The present document builds on the findings and conclusions in TR 26.918.

2  Referencesp. 9

The following documents contain provisions which, through reference in this text, constitute provisions of the present document.
  • References are either specific (identified by date of publication, edition number, version number, etc.) or non-specific.
  • For a specific reference, subsequent revisions do not apply.
  • For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same Release as the present document.
[1]
TR 21.905: "Vocabulary for 3GPP Specifications".
[2]
TR 26.918: "Virtual Reality (VR) media services over 3GPP".
[3]
Recommendation ITU-R BT.709-6 (06/2015): "Parameter values for the HDTV standards for production and international programme exchange".
[4]
Recommendation ITU-R BT.2020-2 (10/2015): "Parameter values for ultra-high definition television systems for production and international programme exchange".
[5]
Recommendation ITU-T H.264 (04/2017): "Advanced video coding for generic audiovisual services" | ISO/IEC 14496-10:2014: "Information technology - Coding of audio-visual objects - Part 10: Advanced Video Coding".
[6]
Recommendation ITU-T H.265 (02/2018): "High efficiency video coding" | ISO/IEC 23008-2:2018: "High Efficiency Coding and Media Delivery in Heterogeneous Environments - Part 2: High Efficiency Video Coding".
[7]  Void.
[8]
TS 26.247: "Transparent end-to-end Packet-switched Streaming Service (PSS); Progressive Download and Dynamic Adaptive Streaming over HTTP (3GP-DASH)".
[9]
ISO/IEC 14496-15: "Information technology - Coding of audio-visual objects - Part 15: Carriage of network abstraction layer (NAL) unit structured video in ISO base media file format".
[10]
ISO/IEC 23001-8: "Information technology -- MPEG systems technologies -- Part 8: Coding-independent code points".
[11]
Recommendation ITU-R BT.2100-1: "Image parameter values for high dynamic range television for use in production and international programme exchange".
[12]
TS 26.116: "Television (TV) over 3GPP services; Video profiles".
[13]
ISO/IEC 23090-2: "Coded representation of immersive media -- Part 2: Omnidirectional media format".
[14]
ISO/IEC DIS 23091-2: "Information technology -- Coding-independent code points -- Part 2: Video".
[15]
TS 26.260: "Objective test methodologies for the evaluation of immersive audio systems".
[16]
TS 26.259: "Subjective test methodologies for the evaluation of immersive audio systems".
[17]
ISO/IEC 14496-12: "Information technology -- Coding of audio-visual objects -- Part 12: ISO base media file format".
[18]
ISO/IEC 23009-1: "Information technology -- Dynamic adaptive streaming over HTTP (DASH) -- Part 1: Media presentation description and segment formats".
[19]
ISO/IEC 23008-3:2015: "Information technology -- High efficiency coding and media delivery in heterogeneous environments - Part 3: 3D audio", ISO/IEC 23008-3:2015/Amd2:2016: "MPEG-H 3D Audio File Format Support ", ISO/IEC 23008-3:2015/Amd 3:2017: "MPEG-H 3D Audio Phase 2", ISO/IEC 23008-3:2015/Amd 5: "Audio metadata enhancements".
[20]
RFC 6381:  "The 'Codecs' and 'Profiles' Parameters for "Bucket" Media Types", R. Gellens, D. Singer, P. Frojdh, August 2011.
[21]
AES69-2015: "AES standard for file exchange - Spatial acoustic data file format", 2015.
Up

3  Definitions, symbols and abbreviationsp. 10

3.1  Definitionsp. 10

For the purposes of the present document, the terms and definitions given in TR 21.905 and the following apply. A term defined in the present document takes precedence over the definition of the same term, if any, in TR 21.905.
bitstream:
a bitstream that conforms to a video encoding format and certain Operation Point.
field of view:
the extent of visible area expressed with vertical and horizontal angles, in degrees in the 3GPP 3DOF reference system.
operation point:
a collection of discrete combinations of different content formats including spatial and temporal resolutions, colour mapping, transfer functions, rendering metadata and the encoding format.
pose:
position derived by the head tracking sensor expressed by (azimuth; elevation; tilt angle).
receiver:
a receiver that can decode and render any bitstream that is conforming to a certain Operation Point.
viewport:
the part of the 3DOF content to render based on the pose and the field of view.
Up

3.2  Symbolsp. 10

For the purposes of the present document, the following symbols apply:
α
yaw of the 3GPP 3DOF coordinate system
β
pitch of the 3GPP 3DOF coordinate system
γ
roll of the 3GPP 3DOF coordinate system
ϕ
azimuth of the 3GPP 3DOF coordinate system
θ
elevation of the 3GPP 3DOF coordinate system

3.3  Abbreviationsp. 10

For the purposes of the present document, the abbreviations given in TR 21.905 and the following apply. An abbreviation defined in the present document takes precedence over the definition of the same abbreviation, if any, in TR 21.905.
3DOF
3 Degrees of freedom
ACN
Ambisonics Channel Number
API
Application Programming Interface
AVC
Advanced Video Coding
BMFF
Base Media File Format
BRIR
Binaural Room Impulse Response
CMP
Cube-Map Projection
CIBR
Common Informative Binaural Renderer
DASH
Dynamic Adaptive Streaming over HTTP
DRC
Dynamic Range Control
EOTF
Electro-Optical Transfer Function
ERP
EquiRectangular Projection
ESD
Equivalent Spatial Domain
FFT
Fast Fourier Transform
FIR
Finite Impulse Response
FOA
First Order Ambisonics
FOV
Field Of View
GPU
Graphics Processing Unit
HDR
High Dynamic Range
HDTV
High Definition TeleVision
HEVC
High Efficiency Video Coding
HMD
Head Mounted Display
HOA
High Order Ambisonics
HRD
Hypothetical Reference Decoder
HRIR
Head-Related Impulse Responses
HRTF
Head-Related Transfer Function
HTTP
HyperText Transfer Protocol
IFFT
Inverse FFT
IRFFT
Inverse RFFT
MAE
MPEG-H Audio Metadata information
MCC
Metrics Collection and Computation
MHAS
MPEG-H Audio Stream
MIME
Multipurpose Internet Mail Extensions
MPD
Media Presentation Description
MPEG
Moving Pictures Experts Group
NAL
Network Abstraction Layer
OMAF
Omnidirectional MediA Format
PCM
Pulse Code Modulation
RAP
Random Access Point
RFFT
Real FFT
RWP
Region-Wise Packing
SDR
Standard Dynamic Range
SEI
Supplemental Enhancement Information
SN3D
Schmidt semi-normalisation
SOFA
Spatially Oriented Format for Acoustics
SPS
Sequence Parameter Set
SRQR
Spherical Region-wise Quality Ranking
VCL
Video Coding Layer
VST
Virtual Studio Technology
VUI
Video Usability Information
VR
Virtual Reality
Up

Up   Top   ToC