Content for TR 26.935 Word version: 18.0.0

4.1 Introduction 4.2 Tests over DCH radio channels 4.3 Tests over HSDPA/EUL radio channels
...

1 Scope p. 8

The present document provides information on the performances of default speech codecs in packet switched conversational multimedia applications. The codecs under test are AMR-NB (Adaptive Multi-Rate Narrowband) and AMR-WB (Adaptive Multi-Rate Wideband). In addition, several ITU-T codecs (G.723.1, G.729, G.722 and G.711) are included in the testing. Experimental test results from the speech quality testing are reported to illustrate the behaviour of these codecs.

The results give information of the performance of PS conversational multimedia applications under various operating and transmission conditions (e.g., considering radio transmission errors, IP packet losses, end-to-end delays, and several types of background noise). The performance results can be used e.g. as guidance for network planning and to appropriately adjust the radio network parameters.

2 References p. 8

The following documents contain provisions which, through reference in this text, constitute provisions of the present document.

References are either specific (identified by date of publication, edition number, version number, etc.) or non-specific.
For a specific reference, subsequent revisions do not apply.
For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same Release as the present document

[1]

ITU-T Recommendation P.800: "Methods for Subjective Determination of Transmission Quality".

[2]

ITU-T Recommendation P.831: "Subjective performance evaluation of network echo cancellers".

[3]

ITU-T Recommendation G.711: "Pulse code modulation (PCM) of voice frequencies".

[4]

ITU-T Recommendation G.729: "Coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear-prediction (CS-ACELP)".

[5]

ITU-T Recommendation G.723.1: "Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s".

[6]

ITU-T Recommendation G.722: "7 kHz audio-coding within 64 kbit/s".

[7]

RFC 1889: "RTP: A Transport Protocol for Real-Time Applications".

[8]

RFC 3267: "Real-Time Transport Protocol (RTP) Payload Format and File Storage Format for the Adaptive Multi-Rate (AMR) Adaptive Multi-Rate Wideband (AMR-WB) Audio Codecs".

[9]

TS 34.121: "Terminal Conformance Specification, Radio Transmission and Reception (FDD)" (downlink).

[10]

TS 25.141: " Base Station (BS) conformance testing (FDD)" (uplink).

[11]

TR 25.853: "Delay budget within the access stratum".

[12]

TS 26.235: "Packet switched conversational multimedia applications; Default codecs".

[13]

TS 26.071: "AMR speech Codec; General description".

[14]

TS 26.171: "AMR speech codec, wideband; General description".

[15]

TS 25.322: "Radio Link Control (RLC) protocol specification".

[16]

RFC 3095: "RObust Header Compression (ROHC): Framework and four profiles: RTP, UDP, ESP, and uncompressed".

[17]

TS 34.108: "Common test environments for User Equipment (UE) conformance testing".

[18]

ETSI TR 101 112: "Universal Mobile Telecommunications System (UMTS); Selection procedures for the choice of radio transmission technologies of the UMTS" (UMTS 30.03 v3.1.0).

[19]

TS 26.114: : "IP Multimedia Subsystem (IMS); Multimedia Telephony; Media handling and interaction"

[20]

ITU-T Recommendation P.805 (P.CONV): "Subjective evaluation of conversational quality"

3 Abbreviations p. 9

3.1 Abbreviations p. 9

For the purposes of the present document, the following abbreviations apply:

AMR-NB (or AMR)

Adaptive Multi-Rate Narrowband Speech Codec

AMR-WB

Adaptive Multi-Rate Wideband Speech Codec

ANOVA

Analysis of Variance

ASY

ASYmmetric conditions

BLER

Block Error Rate

CDF

Cumulative Distribution Function

CMR

Codec Mode Request

COND

Test CONDitions

Core Network

Conversational Quality

CRC

Cyclic Redundancy Check

DCH

Dedicated Channel

Downlink

DMOS

Degradation Mean Opinion Score

DPCH

Dedicated Physical Channel

DTCH

Dedicated Traffic Channel

Eb/No

Ratio of energy per modulating bit to the noise spectral density

EID

Error Insertion Device

FER

Frame Erasure Rate, Frame Error Rate

GAL

Global Analysis Laboratory

Global Quality (of the conversation)

High Mobility

High Traffic

HSDPA/EUL

High Speed Downlink Packet Access/Enhanced UpLink

InterAction (with your partner)

Internet Protocol

ITU-T

International Telecommunication Union - Telecommunications Standardization Sector

JBM

Jitter Buffer Management

LAB

Listening LABoratory

Low Mobility

LowTraffic

MAC

Medium access control

MANOVA

Multivariate Analysis of Variance

Log-MAP

Logarithmic Maximum A Posteriori

MOS

Mean Opinion Score

Narrowband

PerCeption of impairments (also: Personal Computer)

PDCP

Packet Data Convergence Protocol

PDU

Protocol Data Unit

Sound Pressure Level (in Pascal)

Packet Loss

plc

Packet Loss Concealment

Radio Conditions

Packet Switched

Radio Bearer

RAB

Radio Access Bearer

RCV

Receive

RLC

Radio Link Control

ROHC

Robust Header Compression

RRM

Radio Resource Management

RTCP

Real-Time Control Protocol

RTP

Real-time Transport Protocol

SYM

SYMmetric conditions

TB size

Transport Block size

Transport Format

ToC

Table of Content

TrCH

Transmission Channel

TTI

Transmission Time Interval

UDP

User Datagram Protocol

User Equipment

Uplink

Unacknowledged Mode

UMD

Unacknowledged Mode Data

difficulty UnderStanding (your partner)

VOIP

Voice over IP

Voice Quality (of your partner)

Wideband

XMIT

Transmit

4 General Overview p. 10

4.1 Introduction p. 10

The performance of default speech codecs (AMR-NB and AMR-WB) for packet switched conversational multimedia [12, 19] was characterised over DCH channels and over HSDPA/EUL radio channels.

The testing over DCH channels was carried out from October 2003 until February 2004. Further subjective testing was carried out from June until October 2007 in order to characterize the performance over HSDPA/EUL radio channels. The main purpose of the latter testing was to evaluate and verify adequate performance of the AMR-NB and AMR-WB speech codecs used as defined in IMS Multimedia Telephony TS 26.114 with a specific focus on jitter buffer management.

4.2 Tests over DCH radio channels p. 10

The tests over DCH channels were separated into two phases: Phase 1 considered the default speech codecs AMR-NB [13] and AMR-WB [14] in various operating conditions. Phase 2 considered also several other codecs including ITU-T codecs G.723.1 [5], G.729 [4], G.722 [6] and G.711 [3].

In Phase 1, France Telecom R&D acted as host laboratory. The subjective testing laboratories were ARCON for the North American English language, France Telecom R&D for the French language and NTT-AT for the Japanese language. Phase 1 tests consisted of 24 test conditions both for the AMR codec (modes 6.7 and 12.2 kbit/s) and the AMR-WB codec (modes 12.65 and 15.85 kbit/s) with error conditions covering both IP packet loss of 0% and 3% and radio conditions with 10-2, 10-3 and 5 10-4 BLER (Block Error Rate). End-to-end delays of 300 and 500 ms were covered. Robust Header Compression (ROHC), an optional UMTS functionality, was included for some test cases for AMR-WB. Three types of background noise were used: car, street and cafeteria.

In Phase 2, France Telecom R&D acted as host and listening laboratory. Two languages were used (French and Arabic). The following codecs were tested: AMR-NB (modes 6.7 and 12.2 kbit/s), AMR-WB (modes 12.65 and 15.85 kbit/s), ITU-T G.723.1 (mode 6.4 kbit/s), ITU-T G.729 (mode 8 kbit/s), ITU-T G.722 (mode 64 kbit/s) and ITU-T G.711 (64 kbit/s). Transmission error conditions covered IP packet loss of 0% and 3%.

Siemens provided the real time air interface simulator for the Phase 1. France Telecom provided the IP core network simulator and terminal simulator used in both experiments Phase 1 and Phase 2. IPv6 was employed in the testing. (IPv6 is fully simulated over the radio interface. The CN simulator employs IPv4 but since the only impact is a marginal difference in the end-to-end delay - of the order of ~16 ìs - the use of a particular IP-version in CN part has no impact on the performance results.)

These tests were the first ever conversational tests conducted in any standardization body. Performance evaluation consisted of assessment of 5 aspects: 1) voice quality, 2) difficulty of understanding words, 3) quality of interaction, 4) degree of impairments, and 5) global communication quality. A 5-category rating scale was used for each aspect.

Dynastat performed the global analysis for Phases 1 and 2. The results are contained in Clause 7.

4.3 Tests over HSDPA/EUL radio channels p. 11

These listening-only tests characterized the performance of AMR-NB and AMR-WB speech codecs over HSDPA/EUL channels when conducting buffer adaptation to the network delay variations using a simple jitter buffer management (JBM) algorithm. The tests focused on the effect of channel errors and channel jitter to speech quality instead of the impact of overall end-to-end delay in speech conversation. The end-to-end delay impact was considered separately by conducting a delay analysis on the whole processed test material.

The subjective listening-only tests were conducted in Finnish and Swedish languages at Nokia and Ericsson, respectively. The tests consisted of eight different channel conditions in clean speech and in background noise conditions. AMR-NB was tested in 12.2 and 5.9 kbit/s modes, and AMR-WB at 12.65 kbit/s. The outstanding issue was to evaluate the performance of adaptive JBM operation in HSDPA/EUL channel conditions. The applied adaptive jitter buffer was a simple implementation conducting buffer adaptation mainly during discontinuous transmission, i.e. speech pauses, and not using any time scaling operation. A non-implementable fixed jitter buffer with the full a priori knowledge on the channel characteristics was used as a reference. Although the average end-to-end delays of both adaptive and fixed jitter buffers were the same, the number and locations of jitter buffer induced frame losses were different depending on the channel conditions.

The results are contained in Clause 8.A.

A program of Conversation Tests was organized to evaluate the performance of AMR-NB and AMR-WB for UMTS over HSDPA/EUL. Three test labs were contracted to conduct the conversation tests and deliver raw voting data to Dynastat, the Global Analysis Lab (GAL), for processing and statistical analysis.

Three conversation tests were conducted in each of three test labs. The test labs were FTRD, testing in the French language, BIT, testing in the Chinese language, and Dynastat, testing in North American English. Each of the three conversation tests involved a different speech codec:

Exp.1 - AMR operating at 5.9k bps
Exp.2 - AMR operating at 12.2k bps
Exp.3 - AMR-WB operating at 12.65k bps

The experiments were conducted according to specifications contained in the ITU-T Recommendation for Conversation Testing, P.805 [20]. Alcatel-Lucent provided the network impairment simulation test-bed. The raw voting data for each test lab and each Experiment was delivered to the GAL. The GAL conducted statistical analyses on the raw voting data and the results of those analyses are contained in Clause 8.B.