GB/T 8565.2-1988 Information processing - Coded character set for text communication - Part 2: Graphic character set
Some standard content:
National Standard of the People's Republic of China
Information Processing
Coded charcter sets for text communication
Part 2
Graphic character sets
1 Subject and scope of application
UDC 681. 3: 003
GB8565.2—88
This standard specifies the graphic character list and control function list for text communication and their coded representation. It is applicable to text communication in which graphic characters and control functions are represented by binary codes. It is used for: a) public communication networks, b) private communication networks, c) interchangeable media such as magnetic tapes and disks. It is applicable to text communication at the coding interface. It is not necessary to use the coded character set specified in this standard between the coding interface and the input/output device of the terminal equipment.
Although text communication is composed of characters and pictures, this standard only applies to text composed of characters. At present, this standard consists of the following three parts: Part I General Principles, Part II Graphic Character Set for Text Communication, and Part III Control Functions for Page Image Format. Part II and Part III can be used in conjunction with each other or separately, but they are always used in conjunction with Part I.
2 Referenced standards
GB 1988
GB 2311
GB2312
GB 5261Www.bzxZ.net
GB 7589
GB7590
GB 8045
Seven-bit coded character set for information processing interchange Extension method for seven-bit coded character set for information processing interchange Chinese character coded character set Basic set Supplementary control functions for text and symbol forming equipment Chinese character coded character set for information interchange Second supplementary set Chinese character coded character set for information interchange Fourth supplementary set Ancient Chinese seven-bit and eight-bit abbreviated graphic character set for information processing interchange GB 8565.1
GB 8665.3
Coded character set for information processing text communication Part 1 General provisions Part 3 Control functions for page-based imaging formats Coded character set for information processing text communication National Bureau of Standards 188-01 - 06 approved
19 issue- 07 - 01 implementation
8 terminology
adopt the terminology definitions in the first part
4 code structure
GB8665.2-88
This standard adopts the structure of the seven-bit code and the structure of the eight-bit code. 4.1 seven-bit code structure
The structure of the seven-bit code is shown in Figure 1:
94 or%
graphic character line
Figure 1 seven-bit code structure
The seven-bit code occupies columns 0 to 7, which is divided into control character set and graphic character set. It consists of the following parts:
surrounding, 32 control character areas in columns 0 and 1: b. Columns 2 to 7 can be either the space character of byte group 2/0, the erase character of byte group 7/15, or the graphic character set area consisting of 94 graphic characters of byte group 2/1 ~7/14.
It can also be the graphic character set area consisting of 96 graphic characters of byte group 2/0 ~ 7/15. GB 85 65.2 —88
4.2 Structure of eight-bit code
The eight-bit code of this standard is formed by adding one bit to each byte group of the seven-bit code, and it consists of 256 eight-bit byte groups. When the eighth bit is "0", 128 byte groups are specified, and when the eighth bit is "1", another 128 byte groups are specified. The structure of the eight-bit code is shown in Figure 2.
The eight-bit code is used in columns 00 to 15. These characters can be divided into the following parts: 32 control character areas in columns 00 and 01: space character in bit group 02/00:
graphic character set area consisting of 94 graphic characters in bit group 02/01~07/14; erase character in bit group 07/15;
32 control character areas in columns 08 and 09, graphic character set area consisting of 94 graphic characters in bit group 10/01~15/14 in columns 10~15 or graphic character set area consisting of 96 graphic characters in bit group 10/00~15/15. 0001
Figure Yu Waiting
Figure 2 Structure of eight-bit code
Graphic character
5 Coding
GB 86.2—88
Based on the structure of seven-bit code and eight-bit code, this standard specifies single-byte graphic character set and double-section graphic character set. The escape sequence ESC2/0F or ESC02/00F is used in the seven-bit code environment and the eight-bit code environment respectively, and the use form of the graphic character set is suitable. Note: GB2311 specifies in detail the use form of the termination character in the escape sequence and the corresponding graphic character set. 5.1 Single-byte graphic character set
5.1.1 The graphic characters of GB1988 are used as the basic set of single-byte graphic characters. Their bit group representation is 2/1~7/14 in the seven-bit code and 02/01~07/14 in the eight-bit code.
Table 1.2 Foreign phonetic letters and minority phonetic letters are used as auxiliary sets of single-byte graphic characters. In the seven-bit code, the bit group representation of the graphic character is 2/1~7/14 (or 2/0~7/15), indicating that the escape sequence of the graphic character set is ESC1F. When I is 2/9, 2/10 and 2/11, it indicates the G1, G2 and G3 sets consisting of 94 graphic characters respectively. When I is 2/13, 2/14 and 2/15, it indicates the G1, G2 and G3 sets consisting of 96 graphic characters respectively. In the eight-bit code, the bit group of the graphic character is represented by 10/00~15/15. The escape sequence indicating the graphic character set is ESC IF. When I is 02/09, 02/10 and 02/11, it indicates the G1, G2 and G3 sets consisting of 94 graphic characters respectively. When 1 is 02/13, 02/14 and 02/15, it indicates the G1, G2 and G3 sets consisting of 96 graphic characters respectively. The single-byte graphic character set code table is shown in Tables 1 to 6. Note: ① The value of the termination character F is determined by the registration machine. ② The meaning and regulations of the graphic characters in the graphic character set can be found in the corresponding national or international standards. The single-byte graphic character set code table will be supplemented in the future. 5.1. Regarding the explanation of the Latin letter and non-letter graphic character set code table, these "non-stepping" diacritical marks represented by the auxiliary set bit groups 4/1~4/8, 4/10, 4/11 and 4/13~4/15 (12/01~12/08, 12/10, 12/11 and 12/13~12/15 in Table 4) can only be used in combination with a spacing character or a basic Latin letter. The "non-stepping" underscore character represented by the auxiliary set bit group 4/12 (12/12 in Table 4) can be combined with any graphic character in the graphic character set code table, including letters with accents, diacritical marks or spacing characters. b. Bit groups 4/0, 4/9, 5/6 to 5/11 and 6/5 (12/00, 12/09, 13/06 to 13/11 and 14/15 in Table 4) of the auxiliary set are reserved for future standardization. For example, additional graphic characters may be assigned. Bit groups 4/0 and 4/9 (12/00 and 12/09 in Table 4) are reserved for possible additional "non-grid" characters. Note ① In some interactive visual data systems, the character represented by bit group 5/15 of the basic set can be used as a delimiter, and its graphic representation may be different from the lower butterfly "diazole.
② In some systems using GB1988, the character represented by bit group 7/14 of the basic set can be represented by the upper horizontal bar "-graphic character. 5.2 Two-byte graphic character set
5.2.1 In the seven-bit code, any graphic character is represented by two bytes. Each byte is represented by the seven-bit code in GB1988 and GB2311. Of the two bytes, the high-order byte is the first byte and the low-order byte is the second byte. In the seven-bit code, the bit groups of the graphic characters are 2/1 to 7/14, indicating that the escape sequence of the graphic character set is ESC2/4, 4/1 or ESC2/41F. When I is 2/82/9, 2/10 and 2/11, it indicates the G0.G1.G2 and G3 sets composed of 94×94 graphic characters respectively. When I is 2/13, 2/14 and 2/15, it indicates the G1, G2 and G3 sets composed of 96×96 graphic characters respectively. 5.2.2. In the eight-bit code, any graphic character is represented by two bytes. Each section is represented by the eight-bit code in GB2311. Of the two bytes, the high-order byte is the first byte and the low-order byte is the second byte. In the eight-bit code environment, the escape sequence composed of escape characters is used to expand the code, and this expansion method is the same as in the seven-bit code environment. The code table of the double-byte graphic character set is shown in Table 7
5. Dynamically redefining graphic character set
To meet the needs of different businesses, the dynamically redefining graphic character set can be used to supplement single-byte and double-byte graphic characters. These characters GB8565.2-88
can be alphabetic characters, specific symbols or pixel symbols of pictures. Once the dynamically redefining graphic character set (DRCS) is specified, it can be regarded as a component of the graphic character master table, and the table can be specified as G0, G1, G2 and G3 sets using appropriate ESC sequences. The escape sequence for specifying a single-byte or double-byte dynamically redefined graphic character set is ESC12/0F or ESC2/4I2/0F, respectively. Where I is 2/8, 2/9, 2/10 and 2/11, it specifies the GD, G1, G2 and G3 sets consisting of 94 or 94×94 graphic characters, respectively. When I is 2/13, 2/14 and 2/15, it specifies the G1, G2 and G3 sets consisting of 96 or 96×96 graphic characters, respectively. Note that the value of the common stop character F is determined by the registration authority. The purpose of specifying a graphic character subset is to facilitate communication with a certain type of equipment that can only present text with a limited number of graphic characters less than the standard each time it communicates. This standard specifies the use of control function identification graphic character subsets (IGS) to specify (see GB8565.3 for details) single-byte graphic character subsets and double-byte graphic character subsets. The standard subset of graphic characters (see Appendix A) is related to the national language and the services used.
GB 85 65.288
Table 1 Nationally Common Code Table of Basic Set of Graphic Characters for Text Communication0
Note: In the seven-bit code, it is represented by bit group 2/1~7/14. In the eight-bit code, it is represented by bit group 02/01~07/14. ..comGB 85 65.2—
Table 2 Internationally Common Reference Code Table of Basic Set of Graphic Characters for Text Communicationorbzbt
Character: In the t-value code. It is represented by bit group 2:1-7/14. In the eight-bit code, it is represented by the bit group 02/01 to 07/14: GB 8, 2
Table 3 Code table of the auxiliary set of Latin letters and non-letter graphic characters for text communication bzn
Note: In the seven-bit code, it is represented by the bit group 2/1~7/14. In the eight-bit code, it is represented by the bit group 10/01-15/14. ..comtbb
GB 8565.2—88
Table 4 Basic set and auxiliary set code table of Latin letters and non-letter graphic characters for text communication 000102030405060708091011121314Q
Note: In the eight-bit code, it is represented by the bit group 02/01~07/14 and 10/01~15/14, C
GB 85 65. 2 -8
Table 5 Mongolian seven-bit code table for text communication abtbabi
Note: In the seven-bit code, it is represented by the bit group 2/1~7/14. In the eight-bit code, it is represented by the bit group 02/0107/14. ..com Table 6 Mongolian 8-bit code table for text communication GB8565.2-88
0405106f07108
Note: In the 8-bit code, the bit groups 02/01~07/14 and 10/01~15/14 are used to represent it. GB8565.2-88
Appendix A
Standard subset of graphic characters
(Supplement)
This standard specifies the standard subset of single-byte graphic characters and the standard subset of double-byte graphic characters: A1 Standard subset of single-byte graphic characters
Standard subset of single-byte graphic characters are:
a: Graphic character subset corresponding to five-bit code in seven-bit code environment (see Table A1)) b, Latin alphabet and non-alphabet graphic character subset for intelligent user telegraph (see Table A2) c. Auxiliary subsets of non-Latin letters and non-letter graphic characters for visual data (see Table A3) A2 Standard subset of double-byte graphic characters
A2.1 Double-byte Chinese character communication subset (see Table A4). A2.1.1 Number of characters collected in the Chinese character communication subset
The Chinese character communication subset collects Chinese characters, months, dates and times, a total of 705 graphic characters, including: 520 Chinese characters collected in the second auxiliary set, 92 Chinese characters collected in the fourth auxiliary set; 69 characters used to represent concave dates and time; 23 variant characters eliminated in the "First Batch of Variant Studies Arrangement Table" are selected, including names, place names, audio characters, font sizes and radicals, and 1 restored Chinese character "啦". A2.1.2 Coded representation
Any graphic symbol in the Chinese character communication subset is represented by two bytes. The first byte of the two bytes is the first byte, and the second byte is the second byte.
A2.1.8 Graphic character sets in a seven-bit code environment are indicated by the four-character escape sequence ESC2/41F, which indicates G1, G2 and G3 nests respectively when the escape sequence is 2/9, 2/10 and 2/11.
Note: The value of the termination character F is determined by the registration authority. A2.1.4 Method of indicating the position of graphic characters in the code table The code table is divided into 94 areas, each with 94 bits. The areas are numbered from 1 to 94, and the first byte identifies the bit number, which is also from 1 to 94, and the second byte identifies it. The position of any graphic character in the code table is identified by the area number and bit number in which it is located. The graphic character codes of areas 90 to 94 and area 13 conform to the graphic character codes in GB7589. The graphic characters in area 14 conform to the graphic character codes in GB7590. Area 15 contains 69 dates and times used in telegraph communication services, 23 variant characters for names of people, places, font sizes and character roots, and 1 restored character "喇". A2.1.5 Comparison table of Chinese character communication subset exchange codes and telegraph codes (see Table A5)comtbb
GB 8565.2-88
Table 4 Basic set and auxiliary set code table of Latin letters and non-alphabetic graphic characters for text communication 000102030405060708091011121314Q
Note: In the eight-bit code, it is represented by the bit group 02/01~07/14 and 10/01~15/14, C
GB 85 65. 2-8
Table 5 Mongolian seven-bit code table for text communication abtbabi
Note: In the seven-bit code, it is represented by the bit group 2/1~7/14. In the eight-bit code, it is represented by the bit group 02/0107/14. ..com Table 6 Mongolian 8-bit code table for text communication GB8565.2-88
0405106f07108
Note: In the 8-bit code, the bit groups 02/01~07/14 and 10/01~15/14 are used to represent it. GB8565.2-88
Appendix A
Standard subset of graphic characters
(Supplement)
This standard specifies the standard subset of single-byte graphic characters and the standard subset of double-byte graphic characters: A1 Standard subset of single-byte graphic characters
Standard subset of single-byte graphic characters are:
a: Graphic character subset corresponding to five-bit code in seven-bit code environment (see Table A1)) b, Latin alphabet and non-alphabet graphic character subset for intelligent user telegraph (see Table A2) c. Auxiliary subsets of non-Latin letters and non-letter graphic characters for visual data (see Table A3) A2 Standard subset of double-byte graphic characters
A2.1 Double-byte Chinese character communication subset (see Table A4). A2.1.1 Number of characters collected in the Chinese character communication subset
The Chinese character communication subset collects Chinese characters, months, dates and times, a total of 705 graphic characters, including: 520 Chinese characters collected in the second auxiliary set, 92 Chinese characters collected in the fourth auxiliary set; 69 characters used to represent concave dates and time; 23 variant characters eliminated in the "First Batch of Variant Studies Arrangement Table" are selected, including names, place names, audio characters, font sizes and radicals, and 1 restored Chinese character "啦". A2.1.2 Coded representation
Any graphic symbol in the Chinese character communication subset is represented by two bytes. The first byte of the two bytes is the first byte, and the second byte is the second byte.
A2.1.8 Graphic character sets in a seven-bit code environment are indicated by the four-character escape sequence ESC2/41F, which indicates G1, G2 and G.3 nests when the escape sequence is 2/9, 2/10 and 2/11 respectively.
Note: The value of the termination character F is determined by the registration authority. A2.1.4 Method of indicating the position of graphic characters in the code table The code table is divided into 94 areas, each with 94 bits. The areas are numbered from 1 to 94, and the first byte identifies the bit number, which is also from 1 to 94, and the second byte identifies it. The position of any graphic character in the code table is identified by the area number and bit number in which it is located. The graphic character codes of areas 90 to 94 and area 13 conform to the graphic character codes in GB7589. The graphic characters in area 14 conform to the graphic character codes in GB7590. Area 15 contains 69 dates and times used in telegraph communication services, 23 variant characters for names of people, places, font sizes and character roots, and 1 restored character "喇". A2.1.5 Comparison table of Chinese character communication subset exchange codes and telegraph codes (see Table A5)comtbb
GB 8565.2-88
Table 4 Basic set and auxiliary set code table of Latin letters and non-alphabetic graphic characters for text communication 000102030405060708091011121314Q
Note: In the eight-bit code, it is represented by the bit group 02/01~07/14 and 10/01~15/14, C
GB 85 65. 2-8
Table 5 Mongolian seven-bit code table for text communication abtbabi
Note: In the seven-bit code, it is represented by the bit group 2/1~7/14. In the eight-bit code, it is represented by the bit group 02/0107/14. ..com Table 6 Mongolian 8-bit code table for text communication GB8565.2-88
0405106f07108
Note: In the 8-bit code, the bit groups 02/01~07/14 and 10/01~15/14 are used to represent it. GB8565.2-88
Appendix A
Standard subset of graphic characters
(Supplement)
This standard specifies the standard subset of single-byte graphic characters and the standard subset of double-byte graphic characters: A1 Standard subset of single-byte graphic characters
Standard subset of single-byte graphic characters are:
a: Graphic character subset corresponding to five-bit code in seven-bit code environment (see Table A1)) b, Latin alphabet and non-alphabet graphic character subset for intelligent user telegraph (see Table A2) c. Auxiliary subsets of non-Latin letters and non-letter graphic characters for visual data (see Table A3) A2 Standard subset of double-byte graphic characters
A2.1 Double-byte Chinese character communication subset (see Table A4). A2.1.1 Number of characters collected in the Chinese character communication subset
The Chinese character communication subset collects Chinese characters, months, dates and times, a total of 705 graphic characters, including: 520 Chinese characters collected in the second auxiliary set, 92 Chinese characters collected in the fourth auxiliary set; 69 characters used to represent concave dates and time; 23 variant characters eliminated in the "First Batch of Variant Studies Arrangement Table" are selected, including names, place names, audio characters, font sizes and radicals, and 1 restored Chinese character "啦". A2.1.2 Coded representation
Any graphic symbol in the Chinese character communication subset is represented by two bytes. The first byte of the two bytes is the first byte, and the second byte is the second byte.
A2.1.8 Graphic character sets in a seven-bit code environment are indicated by the four-character escape sequence ESC2/41F, which indicates G1, G2 and G.3 nests when the escape sequence is 2/9, 2/10 and 2/11 respectively.
Note: The value of the termination character F is determined by the registration authority. A2.1.4 Method of indicating the position of graphic characters in the code table The code table is divided into 94 areas, each with 94 bits. The areas are numbered from 1 to 94, and the first byte identifies the bit number, which is also from 1 to 94, and the second byte identifies it. The position of any graphic character in the code table is identified by the area number and bit number in which it is located. The graphic character codes of areas 90 to 94 and area 13 conform to the graphic character codes in GB7589. The graphic characters in area 14 conform to the graphic character codes in GB7590. Area 15 contains 69 dates and times used in telegraph communication services, 23 variant characters for names of people, places, font sizes and character roots, and 1 restored character "喇". A2.1.5 Comparison table of Chinese character communication subset exchange codes and telegraph codes (see Table A5)
Tip: This standard content only shows part of the intercepted content of the complete standard. If you need the complete standard, please go to the top to download the complete standard document for free.