Coding Group
Bits 7..4 |
Use of bits 3..0 |
---|---|
0000 |
Language using the GSM 7 bit default alphabet
Bits 3..0 indicate the language:
0000 German 0001 English 0010 Italian 0011 French 0100 Spanish 0101 Dutch 0110 Swedish 0111 Danish 1000 Portuguese 1001 Finnish 1010 Norwegian 1011 Greek 1100 Turkish 1101 Hungarian 1110 Polish 1111 Language unspecified |
0001 |
0000
GSM 7 bit default alphabet; message preceded by language indication.
The first 3 characters of the message are a two-character representation of the language encoded according to ISO 639 [12], followed by a CR character. The CR character is then followed by 90 characters of text (NOTE 1).
0001
UCS2; message preceded by language indication
The message starts with a two GSM 7-bit default alphabet character representation of the language encoded according to ISO 639 [12]. This is padded to the octet boundary with two bits set to 0 and then followed by 40 characters of UCS2-encoded message (NOTE 1).
An MS not supporting UCS2 coding will present the two character language identifier followed by improperly interpreted user data.
0010..1111
Reserved
|
0010.. |
0000 Czech
0001 Hebrew (NOTE 2)
0010 Arabic (NOTE 2)
0011 Russian (NOTE 2)
0100 Icelandic
0101..1111
Reserved for other languages using the GSM 7 bit default alphabet, with unspecified handling at the MS
|
0011 |
0000..1111
Reserved for other languages using the GSM 7 bit default alphabet, with unspecified handling at the MS
|
01xx |
General Data Coding indication
Bits 5..0 indicate the following:
Bit 5, if set to 0, indicates the text is uncompressed
Bit 5, if set to 1, indicates the text is compressed using the compression algorithm defined in TS 23.042 Bit 4, if set to 0, indicates that bits 1 to 0 are reserved and have no message class meaning Bit 4, if set to 1, indicates that bits 1 to 0 have a message class meaning: Bit 1 Bit 0 Message Class: 0 0 Class 0 0 1 Class 1 Default meaning: ME-specific. 1 0 Class 2 (U)SIM specific message. 1 1 Class 3 Default meaning: TE-specific (see TS 27.005)Bits 3 and 2 indicate the character set being used, as follows: Bit 3 Bit 2 Character set: 0 0 GSM 7 bit default alphabet 0 1 8 bit data 1 0 UCS2 (16 bit) [10] 1 1 Reserved |
1000 | Reserved coding groups |
1001 |
Message with User Data Header (UDH) structure:
Bit 1 Bit 0 Message Class: 0 0 Class 0 0 1 Class 1 Default meaning: ME-specific. 1 0 Class 2 (U)SIM specific message. 1 1 Class 3 Default meaning: TE-specific (see TS 27.005)Bits 3 and 2 indicate the alphabet being used, as follows: Bit 3 Bit 2 Alphabet: 0 0 GSM 7 bit default alphabet 0 1 8 bit data 1 0 UCS2 (16 bit) [10] 1 1 Reserved |
1010..1100 | Reserved coding groups |
1101 | I1 protocol message defined in TS 24.294 |
1110 | Defined by the WAP Forum [15] |
1111 |
Data coding / message handling
Bit 3 is reserved, set to 0.
Bit 2 Message coding: 0 GSM 7 bit default alphabet 1 8 bit data Bit 1 Bit 0 Message Class: 0 0 No message class. 0 1 Class 1 user defined. 1 0 Class 2 user defined. 1 1 Class 3 default meaning: TE specific (see TS 27.005) |
NOTE 1:
The language indication shall appear at the start of each Message Information Page (see TS 23.041) and the language indication on each Message Information Page shall be for the same language.
NOTE 2:
Message text in Hebrew, Arabic and Russian cannot be encoded in the GSM 7-bit default alphabet. For these languages UCS2 encoding shall be used.
|
VALUE
(hex) |
MEANING |
---|---|
00 | Concatenated short messages, 8-bit reference number |
01 | Special SMS Message Indication |
06 | SMSC Control Parameters |
08 | Concatenated short message, 16-bit reference number |
20 | RFC 822 E-Mail Header |
23 | Enhanced Voice Mail Information |
70-7F | (U)SIM Toolkit Security Headers |
80-89 | SME to SME specific use |