TS 22.076
Noise Suppression for the AMR codec –
Stage 1

3GPP‑Page ETSI‑search CONTENT_↓

V18.0.0 (PDF) 2024/03 12 p.

V17.0.0 2022/03 12 p.

V16.0.0 2020/06 12 p.

V15.0.0 2018/06 12 p.

V14.0.0 2017/03 12 p.

V13.0.0 2015/12 12 p.

V12.0.0 2014/09 12 p.

V11.0.0 2012/09 12 p.

V10.0.0 2011/04 12 p.

V9.0.0 2009/12 12 p.

V8.0.0 2008/12 12 p.

V7.0.0 2007/06 12 p.

V6.0.0 2005/01 12 p.

V5.0.0 2002/06 12 p.

V4.0.1 2001/10 12 p.

GSM Rel-99 v8.0.1 2001/08 11 p.

Rapporteur:: Mr. Usai, Paolino
ETSI

Content for TS 22.076 Word version: 18.0.0

1 Scope p. 5

The present document specifies the stage 1 description for the Noise Suppression feature for the AMR codec which enhances the input speech signal corrupted by acoustic noise. In analogy with ITU-T Recommendations I.130 [1], Stage 1 is an overall service description, from the service subscriber's and user's standpoints, that views the network as a single entity which provides services to the user.

2 References p. 5

The following documents contain provisions which, through reference in this text, constitute provisions of the present document.

References are either specific (identified by date of publication, edition number, version number, etc.) or non specific.
For a specific reference, subsequent revisions do not apply.
For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same Release as the present document.

[1]

ITU-T Recommendations I.130 (1988): "Method for the characterization of telecommunication services supported by an ISDN and network capabilities of an ISDN".

[2]

GSM 01.04 (ETR 350): "Digital cellular telecommunications system (Phase 2+); Abbreviations and acronyms".

[3]

GSM 03.50: "Digital cellular telecommunications system (Phase 2+); Transmission planning aspects of the speech service in the GSM Public Land Mobile Network (PLMN) system".

3 Definitions and abbreviations p. 5

GSM 01.04 [2] (ETR 350) provides a list of abbreviations and acronyms used in GSM specifications.

4 Description p. 5

Noise Suppression for the AMR codec is an optional feature designed to enhance speech quality in a range of environments where there is significant (acoustic) background noise. The noise suppression function is a preprocessing module that is used to improve the signal to noise ratio of a speech signal prior to voice coding. In so doing it may use functions and/or data from the AMR speech encoding function. It shall be possible to implement AMR Noise Suppression in the mobile station (operating on the uplink speech signal). The possibility to implement AMR Noise Suppression in the network (operating on the downlink speech signal) is for further study. The noise suppression specification shall be comprised of bit exact fixed point C code. Test vectors shall be defined to verify operation.

The AMR Speech decoder C-code should not be altered by the Noise Suppression.

It shall be possible for the network to disable the operation of the example noise suppression algorithm defined by this feature, whether that operation is operational in the network, the mobile station, or both locations.

4.1 Applicability of Noise Suppression to Basic Services. p. 5

This feature shall be applicable (as an option) to all speech calls where the narrowband AMR codec is utilised. Operation of noise suppression for wideband AMR is for further study.

4.2 Support in Mobile Stations (MS) p. 6

Support of the Noise Suppression feature shall require modifications to future mobile stations. Provision of the feature in AMR-capable mobile stations is a manufacturer dependent option.

Use of the feature in the network during a call should not place any requirements on its use within the MS. Similarly, use of the feature by the MS during a call shall not place any requirements on its use in the network.

The network shall be able to enable or disable this example optional noise suppression function both at call set-up and in call [Signalling between network and mobile to allow this control is under study in SMG2 WPA].

4.3 Support in the Network p. 6

Provision of the feature in the network should be an option.

Use of the feature in the network during a call should not place any requirements on its use within the MS. Similarly, use of the feature by the MS during a call should not place any requirements on its use in the network.

The network should be able to enable or disable this example optional noise suppression function both at call set-up and in call.

4.4 Parameters to be indicated and negotiated p. 6

[TBD]

4.5 Provision of Service p. 6

4.5.1 Location Independence p. 6

The Noise suppression feature shall be location independent.

4.5.2 Provision of service within and between networks p. 6

Provision of the feature is the same whether or not the call is wholly contained within a network or between networks.

4.5.3 Subscription and Billing Information p. 6

This feature shall not be provisioned on a per-subscriber basis and no record of the application of Noise Suppression is necessary for billing purposes.

4.6 Quality of Service (QoS) p. 6

4.6.1 Impact on Speech Quality p. 6

The following performance requirements are stated under the assumption that the noise suppressor is tested as an integral part of the AMR speech codec with the speech codec operating at the following rates [TBD]. The performance requirements must be met for all these stated speech codec rates.

4.6.1.1 Initial Convergence Time p. 6

The initial convergence time shall be a maximum of T seconds with T equal to 2s. The definition of this time interval shall be understood strictly in accordance with its means of use in subjective listening experiments. Its use shall be defined by a process whereby the first T seconds of each sample processed through the AMR speech codec with and without noise suppression active, is deleted before presentation to listeners. It is assumed that this process does not reduce intelligibility, or introduce clipping or similar effects into the resultant speech plus noise material.

To test the subjective effect of initial convergence, there will be a subset of subjective testing defined where this initial period of T seconds is not removed from the processed samples. These tests should be representative of the full range of noise conditions.

4.6.1.2 No Degradation in Clean Speech p. 7

The noise suppression function must not have a statistically significant distorting effect on clean speech, in comparison with the performance of the AMR codec without noise suppression applied. This requirement also applies to the case where tandeming of the standardised example noise suppression function occurs for mobile to mobile calls, in which case the reference condition is the tandem condition without noise suppression activated.

This requirement also applies when VAD/DTX is active.

4.6.1.3 No Artefacts in Residual Noise p. 7

The noise suppression function must not introduce any subjectively objectionable artefacts in the residual noise. This requirement also applies to the case where tandeming of the standardised example noise suppression function occurs for mobile to mobile calls, in which case the reference condition is the tandem condition without noise suppression activated.

This requirement also applies when VAD/DTX is active.

4.6.1.4 No Speech Clipping and no Reduction in Intelligibility p. 7

The noise suppression function should introduce no subjectively objectionable degradation such as clipping or distortion in the speech, and no reduction in intelligibility. This requirement also applies to the case where tandeming of the standardised example noise suppression function occurs for mobile to mobile calls, in which case the reference condition is the tandem condition without noise suppression activated.

This requirement also applies when VAD/DTX is active.

4.6.1.5 Quality Impact compared to AMR p. 7

The AMR speech codec with noise suppression activated must produce an output in noisy speech which is preferred amongst test listeners with statistical significance, compared to the case where noise suppression is not used. This requirement also applies to the case where tandeming of the standardised example noise suppression function occurs for mobile to mobile calls, in which case the reference condition is the tandem condition without noise suppression activated.

This requirement also applies when VAD/DTX is active.

4.6.2 Impact on Speech Path Delay p. 7

The one way algorithmic delay due to the activation of AMR noise suppression shall be no more than 7 ms in excess of the delay inserted by the AMR speech codec.

In handsfree case, this delay is part of the 39ms delay specified in GSM 03.50 [3].

4.7 Impact on Complexity p. 7

Table 4.1 defines complexity limits for AMR noise suppression.

Table 4.1: Complexity Requirements for AMR Noise Suppression

Quantity	Complexity Limit (Upper Bound)
Number of weighted operations per second	5 WMOPS
Scratch pad RAM	Re-use AMR speech encoder scratch pad RAM (or in the case of implementation which does not reside in the same device as the speech encoder, the available scratch pad RAM should be the same as that defined for the AMR speech encoder)
Static RAM	1,5 kwords
Data ROM	1 kword
Program ROM	2000 basic ETSI operations

4.8 Impact on Channel Activity p. 8

The AMR speech codec with noise suppression activated should not significantly increase channel activity when used in conjunction with DTX.

Channel activity increase will be measured thanks to the Voice Activity factor (VAF), defined as follows.

Let x be the VAF measured by the AMR VAD as an averaged value on all clean speech signals.

Let y be the VAF measured by the AMR VAD without AMR NS active as an averaged value on all clean speech + noise signals (where the applicable clean speech signal is the speech signal used in the measure of x).

Let w be the VAF measured by the AMR VAD with AMR NS active as an averaged value on all clean speech +noise signals (where the applicable clean speech signal is the speech signal used in the measure of x). w is required to be less than the maximum of y and x. Any case where w is greater than y should be further investigated.

For real word signals, w is required not to be significantly greater than y. Any case where w is greater than y should be further investigated.

These requirements shall apply to all standardized AMR VADs. (w,x,y) are determined using all VADs, and the requirements are checked relatively to each AMR VAD independently.

5 Interaction with supplementary services p. 8

5.1 General p. 8

This clause defines the interactions between GSM supplementary services and the Noise Suppression Feature.

The application of Noise Suppression shall not interfere with the provision or invocation of any supplementary services.

5.2 Explicit Call Transfer (ECT) p. 8

No adverse interaction. If the new party is a mobile station with support for the Noise Suppression feature, the noise suppression feature shall be invoked.

5.3 Call wait/Call hold. p. 8

No interaction.

5.4 Multiparty p. 8

No interaction.

5.5 Service Announcements p. 9

No interaction.

6 Interaction with Alternate and Followed by services p. 9

There shall be no impact on data transmission due the Noise Suppression Feature.

7 Interaction with other speech services p. 9

There is no requirement for Noise Suppression in ASCI services.

8 Interaction with DTMF and other signalling tones p. 9

DTMF and other signalling tones transmission performance during the application of Noise Suppression shall be no worse than the case where Noise Suppression is turned off.

9 Interaction with Lawful Intercept p. 9

In the case where lawful intercept is required in a call where Noise Suppression is activated, the Noise Suppression shall not cause any degradation in the speech quality received by the A and B parties.

10 Interaction with TFO p. 9

No interaction.

$ Change Request History p. 10

TS 22.076 Noise Suppression for the AMR codec – Stage 1

Content for TS 22.076 Word version: 18.0.0

1 Scope p. 5

2 References p. 5

3 Definitions and abbreviations p. 5

4 Description p. 5

4.1 Applicability of Noise Suppression to Basic Services. p. 5

4.2 Support in Mobile Stations (MS) p. 6

4.3 Support in the Network p. 6

4.4 Parameters to be indicated and negotiated p. 6

4.5 Provision of Service p. 6

4.5.1 Location Independence p. 6

4.5.2 Provision of service within and between networks p. 6

4.5.3 Subscription and Billing Information p. 6

4.6 Quality of Service (QoS) p. 6

4.6.1 Impact on Speech Quality p. 6

4.6.1.1 Initial Convergence Time p. 6

4.6.1.2 No Degradation in Clean Speech p. 7

4.6.1.3 No Artefacts in Residual Noise p. 7

4.6.1.4 No Speech Clipping and no Reduction in Intelligibility p. 7

4.6.1.5 Quality Impact compared to AMR p. 7

4.6.2 Impact on Speech Path Delay p. 7

4.7 Impact on Complexity p. 7

4.8 Impact on Channel Activity p. 8

5 Interaction with supplementary services p. 8

5.1 General p. 8

5.2 Explicit Call Transfer (ECT) p. 8

5.3 Call wait/Call hold. p. 8

5.4 Multiparty p. 8

5.5 Service Announcements p. 9

6 Interaction with Alternate and Followed by services p. 9

7 Interaction with other speech services p. 9

8 Interaction with DTMF and other signalling tones p. 9

9 Interaction with Lawful Intercept p. 9

10 Interaction with TFO p. 9

$ Change Request History p. 10

TS 22.076
Noise Suppression for the AMR codec –
Stage 1