Standard ICS number:Information technology, office machinery and equipment>>35.040 character set and information encoding
Standard Classification Number:Electronic Components and Information Technology>>Information Processing Technology>>L71 Encoding, Character Set, Character Recognition
associated standards
Publication information
publishing house:China Standard Press
Publication date:1981-10-01
other information
Release date:1980-03-09
Review date:2004-10-14
Drafting unit:North China Institute of Computing Technology
Focal point unit:National Information Technology Standardization Technical Committee
Proposing unit:The Fourth Ministry of Machinery Industry of the People's Republic of China
Publishing department:State Administration of Standards
This standard specifies the basic graphic characters and their binary encoding representation for Chinese character information exchange. It is suitable for information exchange between general word processing, Chinese character communication and other systems. GB 2312-1980 Basic set of Chinese coded character sets for information exchange GB2312-1980 Standard download and decompression password: www.bzxz.net
Some standard content:
People's Republic of China National Standard Chinese coded character set for information exchange Basic set 1. Scope of application GB2312-80||tt| |This standard specifies the basic graphic characters and their binary encoding representation for Chinese character information exchange. It is suitable for information exchange between general Chinese character processing, Chinese character communication and other systems. 2. Main relevant standards 2.1GB1988-80 "Seven-bit coded character set for information processing and exchange" 2.2GB2311-80 "Expansion method of seven-bit coded character set for information processing and exchange" 3. Term meaning|| tt||3.1 bit group: An ordered set of binary bits representing a character. 3.2 Code, coded character set: establish a character set with a clear set of rules; and establish a one-to-one correspondence between the characters in the character set and their bit groups. 3.3 bytes: A string of binary bits processed as a unit. 3.4 Graphic character: It is a character different from control characters. Usually this character has a visual graphic, which can be visually expressed by handwriting, printing or display. 3.5 Escape sequence: A bit string used to expand the code. It consists of two or more byte groups, the first of which is the escape character (ESC). 3.6 Specify: In some pre-established character sets, use a prescribed method to indicate a specific character set. 3.7 Call: Make the specified encoding character set available for use. 3.8 means (a) uses the specified bit group according to the meaning of the characters in the specified and called character set. (b) Use escape sequences in their supplementary control function sense. 4. Graphic characters 4.1 Graphic character code table The graphic character code table is shown in Table 1. 4.2 Blank positions In the code table, any blank position that is not filled with graphic characters will be used as a further standardized area. 4.3 Types and quantities of graphic characters This standard includes general symbols, serial numbers, numbers, Latin letters, Japanese kana, Greek letters, Russian letters, Chinese pinyin symbols, Chinese phonetic letters, Chinese characters, etc., a total of 7445 graphics character. 4.3.1 There are 202 general symbols. These include intervals, punctuation marks, operation symbols, unit symbols and tabulation symbols. National standards Standard Released by the General Administration Proposed by the Fourth Ministry of Machinery Industry of the People's Republic of China on May 1, 1981 Implemented on the same day North China Institute of Computing Technology drafted 25.s No., etc. See Appendix 1 for its name. GB2312—80 4.3.2 serial number 60 pieces. Among them, there are 20 in total from 1 to 20., 20 in total from (1) to (20), 10 in total from ① to @, and 10 in total from (1) to (10). 4.3.3 Numbers 22. Among them, there are 10 numbers from 0 to 9, and 12 numbers from I to 1. 4.3.4 Latin letters There are 52 Latin letters. Among them, there are 26 uppercase letters A**Z and 26 lowercase letters a..Z. 4.3.5 Japanese Kana There are 169 Japanese Kana. Among them, there are 83 hiragana (including lowercase kana with voiced, semi-voiced, kobe, and auspicious sounds), and 86 katakana (including lowercase kana with voiced, semi-voiced, and aura, auspicious sounds, etc.). 4.3.6 Greek letters There are 48 Greek letters. Among them, there are 24 uppercase letters A****9Q and **024 lowercase letters. The Russian alphabet has 66 letters. Among them, there are 33 uppercase letters A.. and 33 lowercase letters a*. 4.3.7 Chinese Pinyin symbols There are 26 Chinese Pinyin symbols (including letters with tone symbols and other symbols). 4.3.8 4.3.9 Chinese phonetic alphabet There are 37 Chinese phonetic alphabet spoons. 4.3.10 Chinese characters There are 6763 Chinese characters, divided into two levels. There are 3755 Chinese characters in the first level and 3008 Chinese characters in the second level. See Appendix 2 for the classification and arrangement of Chinese characters. 5. Encoding 5.1 encoding representation This standard uses two bytes to represent any graphic character, and each byte is represented by the seven-bit encoding in GB1988-80 and GB2311-80 . The first byte of the two bytes is the first byte, and the latter byte is the second byte. 5.2 Specification and calling in a seven-bit coded character set, specified with the three-character escape sequence ESC2/44/1, and called by a shift-in character (SI). 5.3 Representation method of the position of graphic characters in the code table The code table is divided into 94 areas, each area has 94 bits. The area numbers range from 1 to 94, identified by the first byte; the bit numbers also range from 1 to 94, identified by the second byte. The position of any graphic character in the code table is identified by its area code and bit number. The area code and location number are connected with a hyphen. Example: The Chinese character "" is represented by 16--01. Note: The area code and bit number have the following correspondence with the column number and row number in GB1988--80: column number/line number area code bit number||tt| |254 2/1~7/14 1~ 94 1~94 Position on the code table bit number||tt ||Area code 1--01 1-02 1-03 1--04 105 106| |tt||1-07 1-08 109 1-10 1-11 112 1- 13 1-14 1—15 1-16 117 1-18 119 1—-20 1—21 1-22 123 1—24 1-25 1-— 26 1—27 1-28 1-29 1-30 character symbol (SP) V Speciality Power >
District Name Interval Gan number Period Interval point GB2312--80 Record Attachment 1|| tt||symbol name general called tone symbol (Yinping) tone symbol (upper tone) diacritical mark||tt| |Same symbol Duplicate symbol Dash Number range Double vertical bar Three-dot ellipsis Left single quote||tt ||right single quotation mark left double quotation mark right double quotation mark left hexagonal bracket right hexagonal bracket left single book title number Right single book title number Left double book title number Right double book title number Left corner quotation mark Right corner quotation mark Left double corner quotation mark Right double Corner quotation mark Left empty square quotation mark Right empty square quotation mark Left solid square quotation mark Position on the code table area code number||tt ||1—31 1—32 1--33 1—34 135 136 137|| tt||1--38 1—39 140 141 1-42 1—43 144| |tt||1—45 1—46 1-47 1—48 1-49 150 1-51 1—52 1—53 1-54 155 1-56 1-57| |tt||1-58 1-59 1-60 word 1 symbol +||tt ||: ^ V n U n E 1 )|| tt||的 P S oc A Name Schedule 1 Name|| tt||right real square quotation mark positive and negative signs, plus and minus signs multiplication sign division sign ratio sign logical multiplication, "AND ” symbol logical addition, “or” symbol and [multiplication] product and intersection belongs to ||tt ||definition symbol root sign vertical sign parallel to angle arc circle integral| |tt||Ring integral is equal to is exactly equal to is approximately equal to is approximately proportional to and is not equal to || tt||not less than not greater than less than or equal 255 position on the code table bit number area code|| tt||1-61 1-62 1-63 1--64 1-65 1-66||tt ||1-67 1-68 1-69 1-70 1-71 172 173| |tt||174 1-75 1-76 177 1-78 179 1-80| |tt||1-81 182 1-83 1—84 185 186 1-87| |tt||1—88 189 1-90 191 1-92 193 1-94| |tt||3--01 3-02 256 Characters 8 : .||tt| |% 9 $ o % No ☆ ★ O · 口 A A poly + + +|| tt||name greater than or equal to infinity becauseWww.bzxZ.net so male symbol female symbol degree| |tt||minutes seconds celsius symbol dollar symbol international currency symbol cent symbol pound symbol| |tt||Thousands section symbol number symbol empty star solid star empty circle solid circle||tt ||double empty circle rhombus solid rhombus square solid square triangle real triangle meter No. right arrow left arrow up arrow down arrow lead empty symbol sense sign|| tt||Double quotes GB2312--80 Name Position on the code table Area code Bit number 3- 03 3—04 3-05 3-06 3-07 3-08 3--09 3-10 3-11 3-12 3~13 3--14 3-15| |tt||3--26 3-27 3-28 3-29 3~30 3--31| |tt||3-32 3-59 3-60 3-61 362 3-63 3-64 3-~91 3-92 393 3--94 9-04 905| |tt||906 9-07 9--08 9-~09 word symbol #| |tt||? % & ) * + ? ?||tt ||E 3 A Name Digital mark Continued with the appendix! Name Currency symbol (yuan) Percent sum, parallel symbols right single quote, tone symbol left parenthesis right parenthesis asterisk positive sign, plus sign follow sign negative sign, minus sign period, decimal point slash, division sign colon semicolon less than symbol equal sign greater than symbol No. Commercial symbols (unit price) left square bracket backslash right square bracket arrow, tone mark bottom Horizontal bar left single quotation mark, tone mark left curly bracket whole line right curly brace upper horizontal bar xx (Tab symbols, the same below) Position on the table area code number 9-10 911 912 913 914 9 —15 9—16 9—17 9—18 9-19 9—20 9-- 21 9—22 9--23 924 9—25 9—26 9--27| |tt||9—28 9—29 9-30 9—31 9—32 933 9--34 9—35 9—36 9--37 9--38 9—39||tt| |9-40 9—41 .9—42 9—43 9-44 Characters . 1 7 r r 7 7 7 7|| tt||L L L L J F F F||tt| |F F + F + + + 名 虚3- 虚3Y-Y 虚4x-x 虚4X-X 虚4- 虚4Y-Y||tt| |xy Xy xY XY -x- y -xy -xY -XY x+y X+y x+Y X+Y yx yx Yx YX x+yy X+yy x+Yy x+yy x+YY||tt| |X+Yy X+yY X+YY yxy yXy Yxy yxy YxY GB2312—80 称 CODES表上上电影 区号位号 9—45 9-46|| tt||9—47 948 9--49 9—50 9—51 9—52 953 9—54 9—55 9—56 9—57 958 959 9—60 9--61 9-62 9-63 9-64 9-65 966 9—67 9-68 9-69 9—70 971 9-72||tt| |973 9--74 9--75 9-76 9--77 978 979| |tt||character + + T T 4 T T||tt ||+ + + + + + + + + + + + + + + + +| |tt||+ + + + 名 结附表1 称 YXy -XY YXY xxy xXy Xx- XXy xxY XXY XxY XXY x+yx x+yx X+yx X+yx X +Yx X+YX X+yx X+YX x+yxy +-X- X+ yxy X+yXy x+yxy X+yXY X+YXY X+YX- X+YXy x+yxy X+-xy X+YX- X+yXY X+YXY X+YXY| |tt||X+YXY 257 GB2312--80 附约2 汉字的分级与电影 1汉字的分级|| tt||本标准收汉字6763个,分成两级。第一级汉字3755个,置于16区至55区;第一级汉字3008个,置于56区至87区,见表3。|| tt||2. (乙,图像、、乙等笔形)语序.起笔全部第一笔,依次类推。第二级汉字按部首电影,本标准的部首的部首典的部首典电影的部首典电影,略有改会。部 first time order and 同部 首字攼面数 分得, 同画数的字以笔形 order 横(一),直(「),泇()),点(、),折(乙, including、丁、乙等笔形)for the sequence.起笔上演第一笔,依这种推。 The above-mentioned arrangement method and 部首only apply to the standard.初稿(1963年进发)(初稿)(文字改式进行社, 1966年进行), 《新华字典(新华字典) 1979年电影重排本), 《Modern Chinese Dictionary(中国社社科学院语言研究设计理实室缶,1979年Published) and 《辞海(上海辞书群社社,1979年Published).汉字的字形以中华人人共和国文化部、中国文字改式上海公司的《单件汉字字形表(1964年)我们,住是的字数字《新华字典》《新华字典》《新日汉语词识》及《辞海》。258 1区 2 39 40 59 60 79 80 94 01 02 (SP)、 <> Un O 2区||tt ||19 20 39 59 60 79 80 94 01 03 0 《》 04 F v E 8 : 05 」 1 附表2 06 v 早 07 · 2 GB2312—80 ?????????????????? ??????表 08 )||tt ||09 专 @ ZA 口 10 11 【】||tt| |手 了 ℃ s → 12 土 13 × 14 15 , 16 17 “” +:>V||tt| |8 丰 18 19 [ %No☆★ Q ←=|| tt||030405060708091011121314151617181902 1. 2. 3. 4.5.6.7.8.9. 10.11.12.13.14.15.16.17.18.19.20.(1) (2) (3)(4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) (16) (17) ) (18) (19) (20) ① ② @?@ ?? ? ? I 3区||tt ||01 19 20 39 59 60 79 80 94 01 ! 5 4 H p q 02| |tt||6 J r II IV 03 # 7||tt ||— s 04 V 05 (一) (二) (三) (四) ((五) (六)(七) ( (九) (十)VIIIXXXIXI VI VI 06 ¥%& 8 9 : MN L a t u b V 07| |tt||, o c w 08 09 () =|| tt||P d x Q e y 10 11||tt| |1314 12 15 +, * >? R f||tt| |Z S g 16 0 1 18 2 19 3 @ABCDE FG T h U () V j k x Y Z Imn 0 259|| tt||4区 01 19 20 39 40 59 60||tt ||79 80 94 5区 19 2 39 59||tt| |60 79 80 94 01 02 03 04 05 南南 06 07 GB2312—80 08 09 10 11 12 13 14 15 16 17 18 19| |tt||3 拓 妈の乐乐人活 01 02 了了 03||tt ||04 05 06 07 08 09 10 11 12 13 14 15 16 17 19 18 才才 岁 卜=八 年子 6区 01 20 39||tt| |59 60 79 80 T e 260 01 A 02 B 03 F Φx 入 少 04|| tt||2 力行 05 E 06 07 08 ZH?||tt ||μv 09 10 K
公 11 12 13 AMN 14 间 B|| tt||α 15 0 16 Ⅱ 8 gxo 0元p|| tt||口 17 P 8 19 18 M T||tt| |n 7区 01 19 20 39 59 60 79 ~ 94 01 A 02 E 03 B| |tt||04 05 瓜 06 E 07 E GB2312—80| |tt||08 09 3 10 11 H 12 13||tt ||14 15 JIMH K TyQXIbbO B 6 a K FO 8区 19 20 39 40 59 60 79 94 心 9区 19 20 39 40| |tt||59 60 79 80 94 JIMH 01 a||tt ||0 02 d 0 刃 去 世 01 02 03 0 3 r Ie 16 0 17| |tt||18 Ⅱ 3 19 C 这 H xIy||tt ||OIpcTy bHba 04 d 0 力 乡 05 e 0 06 é e 07 e 08 e| |tt||09 i 10 11 12 13 0 14||tt ||6 15 0 16 0 17 G 18 心 与文 19 心 上户 么Y 马儿×山 030405| |tt||06 07 08 09 10 11 12 13||tt ||14 15 16 17 19 18 厂厂 ++t++++|| tt||261 Tip: This standard content only shows part of the intercepted content of the complete standard. If you need the complete standard, please go to the top to download the complete standard document for free.