Drafting unit:China Standards Research Center, China Institute of Scientific and Technical Information, Encyclopedia Publishing House
Focal point unit:China Standards Research Center
Proposing unit:National Technical Committee on Terminology Standardization and National Technical Committee on Information and Documentation Standardization
Publishing department:State Administration of Quality and Technical Supervision
This standard specifies the three-letter codes for 431 languages (including collective languages). Each language has its corresponding code for terminology (T) and code for bibliography (B). Except for 23 languages, the T code and B code of other languages are the same. This standard is applicable to terminology, lexicography, linguistics, bibliography, etc., and also to any information processing and work field that needs to express languages in code form, including occasions for information exchange (especially computer systems) in library, information service and publishing work. GB/T 4880.2-2000 Language Name Code Part 2: 3-letter Code GB/T4880.2-2000 Standard Download Decompression Password: www.bzxz.net
This standard specifies the 3-letter codes for 431 languages (including collective languages). Each language has its corresponding code for terminology (T) and code for bibliography (B). Except for 23 languages, the T code and B code are different, while the T code and B code of other languages are the same.
This standard is applicable to terminology, lexicography, linguistics, bibliography, etc. It is also applicable to any information processing and work field that needs to express languages in code form, including occasions for information exchange (especially computer systems) in library, information service and publishing work.
Some standard content:
ICS 01. 140. 20 National Standard of the People's Republic of China GB/T4880.2—2000 eqv ISsO 639-2:1998 Codes for the representation of names of languages-Part 2:Alpha-3code Published on October 17, 2000 Implemented on May 1, 2001 Published by the State Administration of Quality and Technical Supervision GB/T4880.2—2000 ISOForeword Quoted 3 Definitions… Language codes 4.1 Format of language codes 4.2 Registration of new language codes 4.3 Application of language codes 4.4 Application of country and region codes 5 Structure of language code tables Table 1 Three-letter language codes sorted by the Chinese name (Pinyin) of the languageTable 2 Three-letter language codes sorted by the English name of the languageTable 3 Three-letter language codes sorted by the bibliographic code (GB/T4880).2/B)Table 4 Three-letter language codes sorted by the terminological code (GB/T This standard is equivalent to GB/T 4880.2-2000. This standard adopts GB/T 639-2:1998 "Language Name Codes-Part 2:3 Parent Codes". All language codes are completely consistent with this international standard. Taking into account the wide application of language codes in terminology (such as terminology database) and bibliography (such as document cataloging), especially the fact that 23 codes used in terminology are different from those used in bibliography. Therefore, the code group used in terminology (T) and the code group used in bibliography (B) of this international standard are retained. For the convenience of domestic users, the Chinese name of each language is added, and the 3-letter language code table sorted by the Chinese name (pinyin) of the language replaces the 3-letter language code table sorted by the French name of the language. Appendix A of this standard is the appendix of Tier. This standard is jointly proposed by the National Technical Committee for the Promotion of Terminology and the National Technical Committee for the Promotion of Information and Documentation. This standard is under the jurisdiction of the China Standards Research Center. This standard was drafted by the China Standards Research Center, the China Institute of Scientific and Technical Information, and the Encyclopedia Publishing House. This standard is interpreted by the National Technical Committee for the Promotion of Terminology and the National Technical Committee for the Promotion of Information and Documentation. The main drafters of this standard are: Li Li, Ding Xinli, Liang Zhanping, Shen Xialan, Zhang Ai, Xiao Xiajing, Xu Junrong, and Tian Ye. GB/T4880.2—2000 ISO before The International Organization for Standardization (ISO) is a worldwide alliance of national standardization bodies (IS(R) member bodies). The formulation of international standards is carried out through ISO's technical committees. Member bodies interested in a professional field for which a technical committee has been established have the right to participate in the technical committee. Official and non-official international organizations associated with ISO also participate in this work. ISUO works closely with the International Electrotechnical Commission (IEC) in the field of electrotechnical standardization. Draft international standards adopted by the technical committee are circulated to the member bodies for voting. Publication as an international standard requires approval by at least 75% of the member bodies voting. ISO639-2 International Standard was jointly developed by ISO/TC37/SC2 Technical Committee on Terminology (Principles and Coordination), Subcommittee 2 "Vocabulary Planning" and ISO/TC46/SC4 "Information and Documentation Technical Committee, Subcommittee 4 - "Application of Computers in Information and Documentation". ISO639 consists of 2 parts under the general title \ Language Name Code: Part 1: 2-letter code (revised [SO639: 1988) Part 2: 3-letter code Appendix A is the standard appendix. It is part of the ISO 639-2 standard. Introduction ISO639 provides two language code groups to represent language names, one is a 2-letter code group (ISO639-1) and the other is a 3-letter code group (ISO639 2). ISO639-1 is mainly used for terminology, lexicography and linguistics. ISO 639-2 includes not only all languages in ISO 639-1, but also languages outside ISO 639-1, and also includes a collection of languages for special purpose coding when more detailed coding is required. The languages listed in ISO 639-1 are a subset of the languages in ISO 639-2. Each language code in the 2-letter code group has a corresponding code in the 3-letter code group, but not every language code in the 3-letter code group has a corresponding code in the 2-letter code group. Both code groups are open-ended and have been used for terminology, lexicography, information and literature (e.g. for libraries, information services and publishing houses) and linguistics. This standard also includes guidelines for the establishment of language codes and some methods for their application. This standard provides two sets of three-letter codes for representing language names, one for terminology and the other for bibliographical applications. The codes for all but 23 languages are the same in both sets: these 23 languages have different codes in the terminology and bibliographical code sets because the rules for standardizing the language codes are different (see 4.1). These language codes are primarily used to indicate language when exchanging information (especially computer systems) in library, information services and publishing work. These codes have become widely used in library systems and in applications where terminologists and lexicologists need to indicate language by code. 1 Scope National Standard of the People's Republic of China Codes for the representation of names of languages-Part 2: Alpha-3 code GB/T 4880.22000 eqv IS0 639-2. 1998 This standard specifies the 3-letter codes for 431 languages (including collective languages). Each language has its corresponding code for terminology (T) and code for dictionaries (B). Except for 23 languages, the T codes and B codes of other languages are the same. This standard is applicable to terminology, lexicography, linguistics, dictionaries, etc. It is also applicable to any information processing and work field that requires the representation of languages in the form of codes, including the field for information exchange (especially computer systems) in library, information service and publishing work. 2 Referenced Standards The clauses contained in the following standards constitute the clauses recommended by this standard through reference in this standard. When this standard is published, the versions shown are valid. All standards will be revised. The parties using this standard should explore the possibility of using the latest versions of the following standards. GB/I2659-2000 Codes for names of countries and regions in the world (ncq1S03166:1997) GB/T 4880-1991 Codes for names of languages (eqVIS0639:1988) 3 Definitions This standard adopts the following definitions. 3.1 Language code languagecodewwW.bzxz.Net Uses a combination of characters representing a language and a group of languages. 3.2 Collective language code colicctive language code A language code used to represent a group of languages. 4 Language code 4.1 Form of language code A language code consists of 3 lowercase letters, without using diacritical marks and umlauts. These codes are not abbreviations of languages, but rather the identification of a language. The language code source is the language name. It includes two code groups, one for monthly recording (GB/T4880.2/B) and the other for terminology (GB/T488G.2/T). The rules for selecting the language code form in the B code group are: ...· Countries using the language have priority; Codes already used in national or international directory databases; : The native language form or English form of the language. The rules for selecting the language code form in the T code group are: Approved by the State Administration of Quality and Technical Supervision on October 17, 2000 and implemented on May 1, 2001 The native language form of the language; - Countries using the language have priority. GB/T 4880.22000 There are 23 languages with different codes in these two code groups. It is the development trend of language codes to base them on the native form of the language as much as possible, unless the country using the language requires the use of another language code. The catalog code group or terminology code group must be used as a whole, and the two parties exchanging information must make a clear choice before the exchange. When used for cataloging, users should use the GB/T4880.2/B code group, and when used for terminology, the GR/T4880.2/T code group should be used. In order to ensure continuity and stability, the code can only be changed when it is necessary. If changed, the original code flow should not be assigned to other languages within 5 years. In order to ensure the continuity of the system of many application language codes, if the name of the language or its abbreviation changes, the code in GB/T4880.2/B should not be changed. When GB/T1880.2 is adapted to languages using other alphabets (such as Cyrillic alphabet), the rules of this standard should be followed to standardize the language code. 4.1.1 Collective language codes Collective language codes may be used when there are a relatively small number of related documents or when it is desirable to write, record or create a relatively small number of related documents. The words "languages" or "(other)" appearing in the following tables are part of the language name and may be used to indicate that a language code is a nested language code. Collective language codes should not be used when a separate language code exists for a particular language. If a collective language code is necessary, it should be as specific as possible. 4.1.2 Special cases When several languages are used and it is not necessary to specify all language codes, the language code "tnul" (for multilingual) should be used. When one or more languages must be specified and these languages cannot be identified, the language code "und" (for undetermined language) should be used. 4.1.3 Scripts and dialects Although a language is written in one script, a single language code should normally be used. Additional standards may be established for information about the script or writing system of a language. Dialects of a language are usually indicated by the language code of the language. If a collective language code is assigned to a language, its dialects are also assigned the same collective language code. If a language has a separate language code, its dialects are included in that code, rather than in the collective language code to which they belong. However, in a few cases, a language and its dialects each have their own separate language code. 4.1.4 Local application codes Codes from "qaa\ to \qtz\ are reserved for local languages, including the treatment of local dialects. These codes can only be used locally and cannot be exchanged internationally. 4.1.5 Ancient languages Modern languages that are not given separate language codes are assigned codes based on the codes of the main languages to which they belong, not the codes of the modern languages from which these modern languages evolved. For example, old Frisian is assigned the language code \gem\ of the Orocoronal languages (others), not the language code \fry of the modern language Frisian. To prevent misunderstandings, help can be obtained from the registration authority. 4.2 Registration of new language codes The registration authority for three-letter language codes is the Library of Congress (Library of Congress Washington, DC 20540-4102 USA Network Development and MARC States Office>). The 2-letter language code is registered by the International Terminology Information Center (Infoterm, Simmeringcr Hauptstrassc 24, A-lll0 Vicnna, Austria)1) and approved by the ISO Council. 4.3 Application of language codes GB/T 4880.2—2000 The language code can be used in the following specific examples. Each example gives an example of how to use the language code. 4.3.1 Indicate the language in which the document is written or recorded. Example: UNIMARC format. Field 1CI publication language. A booklet is published in Danish (language code \dan" for Danish), Field 101:0# Sadan 4.3.2 Indicates the language used when creating bibliographic records (sort records, catalog records, and other records). Example: UNIMARC format, field 100 Cataloging language, symbol positions 22-24... - The book is written in Dutch (IJutch), but the catalog records are written in English (language code \eng" for English). Field 100 positions 22-24: eng 4.3.3 Indicate the language that conference delegates can use to speak (you can also choose another standard, such as GB/T 4880: 1991) Example: The codes "eng", "fre", and "rus" on the list of delegates published by an ISO conference indicate that conference delegates can speak in English (eng), French (fre) and Russian (rtus). 4.3.4 Indicate the original language of the document Example. United Nations document: ST/DCS/Rev. 2 cng fre document is bilingual: English (eg) and French (fre), 4.4 Application of country and region codes The country and region codes in GB/T 2659 can be used in combination with language codes to indicate the region where a term, phrase or language is used, Example 1: n spool of thread (eng US) -a bobbin of cotton (eng GB) Example 2: Information (zho CN) Information (zho Tw) Note: The Chinese (zho) term "information" is used in Mainland China (CN); the Chinese (zho) term "information" is used in Taiwan (TW). 5 Structure of the language code table The language code is divided into 4 tables: Table 1 3-letter language codes sorted by the Chinese name (pinyin) of the language Table 2 3-letter language codes sorted by the English name of the language Table 3 3-letter language codes sorted by the diary code (GB/T4880.2/B) Table 4 3-letter language codes sorted by the terminology code (GB/T4880.2/T) 2) UNIMARC (UJNIversal Marchine Readable Cetaloging) format is used to exchange diary data. 3 Chinese name Abkhaz Adamme Albany" Algonquian languages Altaic languages (other) Afar Afrikaner Afrikani Akkadian Akan Arabic Aramaic Akonpaho Arapek Araucan A Liuzhong Language Amharic Apache Languages Acholi Assamese Athapascan Languages Azerbaijani Awaddi Avar Avestan Ephik Egyptian (Japanese) Ekchuk Elamite Ewe Ewondo Nymara GB/T4880.2—2000 Table 1 Three-letter language codes in order of their Chinese name (Pinyin) English name Abkhazian Adangme Albaniana Algonquian languages Altaic (Other) Afrikaans Afrihili Akkadian Arabic Aramaic Aranaho Arawak Araucanian Amharic Apache languages Assamese (Assami) Athapascan languages Azerbaijani Awadhi Avaric Avestan Egyptian(Ancient) Ekajuk Elamite Ewonda Aymara 3) For the reasons of different codes, see 1.1. GB/T 4880.2/B Self-recorded code GB/T4880.2/T Technical code Irish Chinese name Irish, Middle Ages (before 900) Irish, Middle Ages (900-1200) Essanian Australian languages Ojibwa Oxsey (after 1500) Oryan Olodien Osage Ossetian Austranesian languages (others) O Tomi Speech Babushin Languages (Other) Badong Pali Bamelik Languages Bassa Bashkir Basque\ Batak (Indonesian) Baya Belarusian Berber Languages Bambara Banda Bantu Languages (Other) Pangasinan Pampanga Bulgarian North American Indian Languages (Other) Beja GB/T 4880.2—2000 Table 1 (continued) English name Irish.Old(to 900) Irish,Middle(900-1200) Estonian Austrslien languagrs Ojibwa Orritan (pust 1500) Ossetie Austronesian(Other) Otomian languages Papuan(Other) Balinese Bamileke languuges ||tt | |tt||North American indian (Other) GB/T 4880.2/B Monthly Record Code GB/T 4880.2/T Term Code Bemba Bihari Bikol Bini Ru language name Bislama Balochi Icelandic" Polish Baltic Languages Jing Pohnpei Speaks Persian" Persian, Old (c. 600--400 B.C.) Pahlavi Bhojpuri Bhutanese Bugi Bulagi Bunayat Breton Chagatai Chamorro Korean Chushi Dakota Dravidha Dayak Tai languages (other) Tanyou Bihari Bislama Baluchi GB/T 4880.2—2000 Table 1 (continued) English name Icelandie3) Pulish(Polski) Baltic(Other) Pohnpeian Persian3 Persian.Old (ca. 600-400 BC ) | |tt | (Other) Tai(Othcr) Danish 8 GB/T 4880.2/T G3/T 4880. 2/B Catalogue code Term code German: Chinese name German, Old Highland (ca. 750-1050) German. Middle Highland (ca. 1050-1500) Diveti Dyoula Tida Dinka Dushaola Dogra Dogdanbu Multilingual+ Ndebeao, Northern||tt| |Enderle, Southern Ndunga Nzima Faroese French\ French: Old (842-c. 1400) French: Middle (c. 1400-1600) Fanti Philippine Languages (Other) Phoenician Fijian Finno-Ugric Languages (Other) Finnish 1See 4.1.2 for an explanation of this code. GB/T 4880. 2--- 2000 表「(续) 英语名称 German German,Old High (ca.750-1050) Gelan,Middle High (cH, 1050-1500) I Divehi Dogrib Multiple languagest) Russian Ndebele.North Ndebele,South Ndanga Faroese Frenchs) French,Old(842-ca.140u) French,Middle (c.1400-1600) Sanskrit Philippine(Other) Phenicien Fijian Finno-Ugrian(Other) Finnish|| tt||GB/T 4880.2/B GB/T 4880.2/T Ear recording code Terminology code Chinese name Frisian Friulian Fula Gaelic (Scottish) Ganda Kongo Caucasian Languages (other) Khmer |Yulundaluo Gothic Glebogic Greenlandic Pugian| Gonde| |tt||Gujarati Syriac Guarani Interlingua A Interlingua E Kazakh| |tt||Haida Hanjiang\ Sino-Tibetan languages (other) Hausa Dutch" Dutch ·Medieval (c. 1050-1350) ·Hittite Frisian GB/T 4880.2--2000 Table 1 (continued) English name Friulian Gaelic(Scots) Caucasian(Other) Gorantala Gothie Kalaallisut Geargiana Gujarati Syriac Guarani Interlingua (Intcrnatignal Auxiliary) Language Association) Interlingue Kazakh Haiela Chinese3 Sino-Tibetan(Other) Duich: Dutch .Middle (ca. 1050-1350) Hercro Hittite GB/T4880.2/BGB/T4880.2/T Directory Code Term code Tip: This standard content only shows part of the intercepted content of the complete standard. If you need the complete standard, please go to the top to download the complete standard document for free.