title>Terminology work-Computer applications-Vocabulary - GB/T 17532-1998 - Chinese standardNet - bzxz.net
Home > GB > Terminology work-Computer applications-Vocabulary
Terminology work-Computer applications-Vocabulary

Basic Information

Standard ID: GB/T 17532-1998

Standard Name:Terminology work-Computer applications-Vocabulary

Chinese Name: 术语工作 计算机应用 词汇

Standard category:National Standard (GB)

state:Abolished

Date of Release1998-01-01

Date of Implementation:1999-06-01

Date of Expiration:2005-12-01

standard classification number

Standard ICS number:General, Terminology, Standardization, Documentation>>Vocabulary>>01.040.01 General, Terminology, Standardization, Documentation (Vocabulary)

Standard Classification Number:General>>Basic Standards>>A22 Terms and Symbols

associated standards

alternative situation:Replaced by GB/T 17532-2005

Procurement status:=ISO/DIS 1087-2-2-1996

Publication information

publishing house:China Standard Press

ISBN:155066.1-15943

Publication date:2004-04-15

other information

Release date:1998-11-05

Review date:2004-10-14

Drafting unit:State Language and Writing Commission

Focal point unit:Institute of Language and Writing Application, State Language and Writing Committee

Publishing department:State Administration of Quality and Technical Supervision

Introduction to standards:

This standard specifies the terms used in language and information processing in terminology work and terminology compilation. This standard applies to the research, development, maintenance and management of terminology databases, and can also be used as a reference in other work involving terminology data processing. GB/T 17532-1998 Vocabulary of computer applications in terminology work GB/T17532-1998 Standard download decompression password: www.bzxz.net
This standard specifies the terms used in language and information processing in terminology work and terminology compilation. This standard applies to the research, development, maintenance and management of terminology databases, and can also be used as a reference in other work involving terminology data processing.


Some standard content:

GB/T17532—1998
This standard adopts ISO/DIS1087-2-21996 "Terminology Vocabulary Part 2 Computer Application". Based on the actual situation in my country and the characteristics of Chinese language and Chinese characters, this standard has made the following changes when adopting ISO/DIS1087-2-2:1996:
1. According to the naming convention of my country's terminology standards, the standard name is changed to "Terminology Computer Application Vocabulary". 2. The contents of some term notes are directly introduced into the definition, and the contents in the definition are changed to notes. 3. Some Chinese examples are added.
4. Eight terms related to Chinese characters are introduced from GB/T12200.2--1994 "Chinese Information Processing Vocabulary Part 02: Chinese Language and Chinese Characters".
5. Appendix B Bibliography is deleted.
This standard is one of the series of national standards for terminology databases. The supporting standards that have been issued are: GB/T13725--1992 General principles and methods for establishing terminology databases Magnetic tape format for recording and exchanging terminology and dictionary entries GB/T13726--1992
GB/T15387.2-1994 Guide to terminology database development GB/T15387.1--1994 Guide to document compilation for terminology database development GB/T15625--1995 Guide to technical evaluation of terminology databases GB/T16785--1997 Coordination of concepts and terms in terminology work GB/T15786--1997 Data categories for computer applications in terminology work This standard is proposed by the National Technical Committee for Terminology Standardization. This standard is under the jurisdiction of the Language and Character Application Research Institute of the National Language and Character Working Committee. This standard was drafted by the National Language and Character Working Committee, China Institute of Standardization and Information Classification and Coding, Encyclopedia of China Publishing House, Institute of Computational Linguistics of Peking University, China Institute of Scientific and Technical Information, and other units. This standard is interpreted by the Computer-Assisted Terminology Subcommittee of the National Technical Committee for Terminology Standardization. Appendix A and Appendix B of this standard are informative appendices. The main drafters of this standard are: Feng Zhiwei, Li Li, Wang Yuli, Yu Shiwen, Zhou Zhiyou, Song Zhengliang, Li Zhu, etc. 516
GB/T17532—1998
ISO Foreword
The International Organization for Standardization (ISO) is a worldwide alliance of national standardization bodies (ISO member bodies). The formulation of international standards is usually carried out through ISO's technical committees. Member bodies interested in a professional field for which a technical committee has been established have the right to participate in the committee. Official and unofficial international organizations associated with ISO also participate in this work. ISO works closely with the International Electrotechnical Commission (IEC) in the field of electrotechnical standardization. The draft international standards adopted by the committee must be circulated to the member bodies for voting, and at least 75% of the member bodies must vote to approve them when they are published as international standards.
International Standard ISO 1087-2 was prepared by Technical Committee ISO/TC37/SC3 Terminology (Principles and Coordination), Subcommittee 3, "Subcommittee on Computer-Assisted Terminology". As a technical revision of Chapter 7 of ISO 1087:1990, this standard annuls and replaces this part of the standard. The technical revision of the main chapters of ISO 1087:1990 will be developed into ISO 1087-1. Annexes A and B are informative annexes.
0 Introduction
National Standard of the People's Republic of China
Terminology work--Computer applications—VocabularyGB/T 17532--1998
eqv IS0/DIs 1087-2-2:1996
The items of this standard are given in sequence, and each part has a general title. Under normal circumstances, the following items should in principle be defined by the previous items.
The format of the items shall comply with the provisions of GB/T1.6--1997. The order of the contents of the items is as follows:
Item number
Preferred term (bold)
Abbreviated form (bold)
Permitted term
Rejected term: "(rejected)" is indicated in brackets. Professional fields are indicated in angle brackets>
Definition referenced terms are marked in bold and followed by the item number in brackets. Example
In addition to the item number, preferred terms and definitions appear only in appropriate places. 1 Scope
This standard specifies the terms used in terminology work and terminology compilation. Terms related to language and information processing. This standard applies to the research, development, maintenance and management of terminology databases, and can also be used as a reference in other work involving terminology data processing.
2 Cited standards
The provisions contained in the following standards constitute the provisions of this standard through reference in this standard. When this standard is published, the versions shown are valid. All standards will be revised, and parties using this standard should explore the possibility of using the latest versions of the following standards. GB/T12200.2-1994 Chinese Information Processing Vocabulary Part 02: Chinese and Chinese Characters GB/T12991—1991 Information Processing System Database Language SQL (idtISO/IEC9075:1989) GB/T15237-1994 Basic Vocabulary of Terminology (neqISO1087:1990) GB/T 5271.8—1993
Vocabulary Part 08: Control, integrity and security (eqvIS02382-8:1986)Data processing
ISO/IEC2382-1:1993Information technology
ISO 2382-4:1987
Information processing systems
ISO 2382-6:1987
Information processing systems
ISO/IEC 2382-9:1994
ISO/IEC 2382-23:1994
Information technology
Information technology
Approved by the State Administration of Quality and Technical Supervision on November 5, 1998 518
Part 1: Basic terms
Part 4: Organization of data
Part 6: Preparation and processing of data
Part 9: Data communication
Part 23: Text processing
Implementation on June 1, 1999
3 General concepts
3.1 Information
GB/T 17532—1998
Information processing Knowledge about an object (such as facts, concepts, events, ideas, processes, etc.) has a specific meaning in a certain context.
1 This clause is quoted from ISO/IEC 2382-1.
2 In other application areas, the definition of information is different. 3.2 Data data
Formal representation of information (3.1) for communication, interpretation and processing. Note: This clause is adapted from ISO/IFC2382-1.
3.3 Data processing data processing
Systematic operations on data (3.2). Examples: arithmetic or logical operations on data, merging (9.5) or sorting (9.4) of data, assembly or compilation of programs, and operations on text (3.6), such as text editing (12.3), sorting, merging, storage, retrieval, display (9.6), printing, etc. Note: This term cannot be used as a synonym for information processing (3.4). 3.4 Information processing information processing Systematic operations on information (3.1), which includes data processing (3.3). Note:
1 This clause is adapted from ISO/IEC2382-1.
2 This term cannot be used as a synonym for data processing (3.3). 3.5 Language processing languageprocessing Systematic operations on language, including data processing (3.3). 3.6 Text text
Structured data (3.2) composed of characters (6.1), symbols, words, phrases, sentences, paragraphs, tables or other sequences of symbols used to express meaning. Its interpretation is mainly based on the knowledge of a natural or artificial language by the person reading the text. Note: This clause is quoted from IS0/IEC2382-1.
Text corpus text corpus
Corpus corpus
<Natural language processing Machine-readable text prepared, encoded or stored according to predetermined rules (see 7.4) or an organized collection of parts of a text.
Note: A text corpus can be defined differently by field, volume or era, e.g., a specific journal or mathematical text from 1986 onwards. A text corpus can be used as raw material for further linguistic analysis or terminology work. 3.8 Type type
<Natural language processing unit of language that represents a definite class in a text (3.6). Note: Such a unit of language is usually an uninterrupted string of characters that is not separated by delimiters (3.10). 3.9 Token
<Natural language processing unit of language (3.8) in a text. Example: In English, if all forms of good are defined as types, then good, better and best are all examples of the word form good.
3.10 Delimiter
Separator
One or more characters (6.1) used to indicate the beginning or end of a string (6.13). 519
1 This clause is quoted from ISO2382-4.
2 Blank or punctuation marks are often used as delimiters. 3.11 Word form
Any morphological and syntactic variant of a given word.
Example: In English indicate: indicates visitor: visitor's.
GB/T 17532 -1998
1 In inflectional languages, word forms are often inflectional forms, for example, gogoes in English; and go, went. gone. 2 This definition does not include orthographic variants. 3.12 Word form paradigm
<Natural language processing) A cluster of word forms (3.11) belonging to a given word. 3.13 base form
reference form
The form (3.11) selected according to lexicographical conventions to represent the various forms in a morphological pattern (3.12). Example: In English, bind is the base form of bind, bound, binds, binding, etc. Notes
1 This clause is adapted from GB/T15237-1994.
2 The term "basic form" can also be applied to phrase terms. 3.14 Multi-word term includes terms with more than two words.
compressed form
normalized form(discarded)reduced form(discarded)
A string that has been compressed (8.4).
Example: The string \input/output-algorithm\ becomes its compressed form "inputoutputalgorithm" Note: The terms "reduced form" and "normalized form\" may cause misunderstandings and it is recommended to avoid using these two terms. 3.16 Deinflection
Deinflection of the inflectional component in the word form (3.11). 3.17 Deinflected word form The tone segment (3.18) that remains after the deinflection (3.16) is removed. 3.18 word part
word segment
a string of characters (6.13) taken from a word form (3.11) for a special purpose. 3.19 lemmatization
the process of generating a base word form (3.13) from a given word form (3.11). Example: In English, "go" is the basic form (3.13) of "goes" by canceling the inflection (3.16); while "go" is the basic form (3.13) of "went" by transforming irregular verbs, which does not conform to the standard inflectional rules. Note: Such a result is also called word form restoration. 3. 2Q. Parsing
The operation of decomposing a given structure into its constituent components according to a given algorithm. Example: In the English sentence "all unsaturated fatty acids are not degradable by biological methods", "allunsaturated fatty acids" and "biological methods" can be regarded as phrase terms (3.14), which can be extracted (8.9).
GB/T17532-1998
Note: Parsing does not necessarily provide a complete analysis of a sentence. terminological data collection3.21
a collection of data (3.2) containing information (3.1) about various concepts in a particular professional field. terminological entry3.22
terminological data (3.2) about a concept contained in a terminological data collection (3.21). Note: A terminological entry may contain more than two records (7.9). 3.23 homograph
two or more word forms or words that have the same written form but represent different concepts (semantic homographs) or different syntactic functions (syntactic homographs).
Examples: in English, lead (lead) and lead (leadership); bark (bark) and bark (bark).
in Chinese, instrument (a person's appearance) and instrument (an instrument for measuring temperature, pressure, etc.); cuckoo (cuckoo) and cuckoo (azalea). Note: Word forms (3.11) that have different written forms but become the same after compression (8.4) are not considered homographs. 3.24 Disambiguation
The process of differentiating homographs (3.23) by assigning them appropriate concepts or appropriate syntactic functions, or differentiating homographs by assigning them different linguistic interpretations, e.g., in Chinese, differentiating the "白" in "白跑" and "白纸" into adverbs and adjectives, respectively; differentiating the syntactic structures in "学习文件" into verb-object structures and modifier-predicate structures, respectively.
4 Data organization
4.1 Sorting value
sort value
sorting value
The position of an element in a character set (6.2) according to a predetermined order. E.g., in French, the letter A has a lower sorting value than the letter B. Whether a lowercase letter has the same sorting value as its corresponding uppercase letter depends on the needs of the actual application. Letters with diacritical marks are sometimes treated as their corresponding base letters and sometimes as different alphabets.
4.2 sort key
sort key
A character string (6.13) used to satisfy the requirements of sort (9.4) and merge (9.5) operations. Example: When sorting book data to produce an author list, the author's last name is the first sort key and the author's first name is the second sort key. If there are two titles for the same author, the year of publication or the title is used as an additional sort key. 4.3 alphabetical ordering When organizing character strings (6.13), the position of each string (6.12) in the table is uniquely determined by the sort value (4.1) starting from the head of the string (6.12).
1 Letters with diacritical marks and ligatures, as well as other symbols with numbers, superscripts, and subscripts, can be used as special sort values. 2 The rules for alphabetical ordering can vary from language to language. Reverse alphabetical ordering4.4
When organizing strings (6.13), the position of each string (6.12) in the table is uniquely determined by the classification value (4.1) starting from the end of the string (6.12).
4.5 Frequency order frequencyorder
In a specific text (3.6) or text corpus (3.7), the order of types (3.8) is arranged according to the order of increasing or decreasing frequency of occurrence of class examples (3.9).
GB/T 17532-—1998
Note: In general, the type of table is a word form (3.11) frequency table or a base word frequency table. 4.6 Permutation
Sort each content word in a phrase-type term as a keyword. Example: In English, when permuting the string "millions of instructions per second\[MIPs],"instructions,mil-lions of per seconds\,\second.millions of instructions per\ and other forms appear in the sorting list of content words such as\instructions\ and "second\, respectively. This ensures that any desired component in the term can appear in the sorting position of the corresponding letter.
Note: This clause is adapted from GB/T15237.-1994. 4.7 Concatenalion
Two or more character strings (6.13) are merged in a specific order to form a new string (6.12), whose length is equal to the sum of the lengths of the individual character strings (6.13).
5 Screening of term data
exclusion list
non-use word list
stop word list list
A list of character strings (6.13) that are arbitrarily selected to be disregarded in data processing (3.3). NOTES
1 In terminology work, it may be useful to produce a list of words that ignore function words (pronouns, articles, etc.). 2 Sometimes, character strings in a disuse list may be retained (e.g. in a word index (5.4)), but their frequencies are not noted. 5.2 inclusion list
plus word list
A list of character strings (6.13) that are to be retained or considered for further data processing (3.3). Example: All sentences containing "bank" or "credit institution" are extracted; all entries beginning with "M" are retrieved in the name index; all words beginning with "anti-" are selected. NOTE: A list of character strings (3.18) or other character strings (6.13) may also contain word fragments (3.18) or other character strings (6.13), if appropriate. 5.3 Free-text search free-text search is performed in a text corpus (3.7) and can search for any type (3.8) of search (8.7). 5.4 Concordance
<Natural Language Processing> An alphabetical list of word forms (3.11) extracted (8.9) from the original text, including the word form (3.11) to be searched and the previous and following parts of the word form (3.11) in the original text. Note: The usual forms of concordance are KWIC (keywords in context) concordance and sentence concordance. 5.5 Index index
<Natural Language Processing> A collection of ordered strings (6.13) of data elements of the same type extracted from a source. Note: This clause is adapted from GB/T 15237--1994. 5.6 term index
An index (5.5) compiled with reference to term entries (3.22) in a vocabulary or terminology database (7.6). Note: The terms in a term index may be sorted alphabetically or in some other system. alphabetical index
An index arranged in alphabetical order (5.5). 5.8 systematic index
An index in which all terms are arranged to reflect the systematic relationship between concepts in a particular professional field (5.5). 5.9 permuted index
GB/T 17532--1998
An index (5.6) listed in a permuted (4.6) manner. reverse index
An index (5.5) listed in reverse alphabetical order (4.4). Inverted index
Lists the index of all types (3.8) and all corresponding class examples (3.9) in the text (3.6) [(5.5). 2 Word form index
Word index
The index of all word forms (3.11) in the text (3.6), where word forms (3.11) that appear more than twice in the text (3.6) are represented by a separate type (3.8), and their number of occurrences (i.e., corresponding class examples (3.9)) and their original text references are indicated. 5.13 Lemmatized word index The word form index (5.12) represented by the base form (3.13) of the word form (3.11). 6 Characters
6.1 Characters character
Members of a set of elements used to represent, organize, or control data (3.2). NOTE
1 This clause is derived from ISO 2382-4.
2 Characters may be divided into graphic characters and control characters. Character set character set
A finite set of different characters (6.1) which is complete for a given purpose. Example: the international reference version of ISO 646,
128 ASCII characters.
NOTE: This clause is derived from ISO 2382-4.
6.3 Alphabet
The set of alphabetic characters (6.6) arranged in a recognized linear order. NOTE
1 This clause is adapted from ISO 2382-4.
2 This definition also covers the alphabets of natural languages ​​consisting of symbols (6.1) containing letters with diacritical marks. 6.4 graphic character a character (6.1) that has a visual representation and is usually written, printed or displayed (9.6), and is different from a control character (6.5). NOTE
1 This clause is taken from ISO 2382-4.
2 A graphic character may be an alphabetic character (6.6), a digit (6.7) or a special character (6.9). 6.5 control character a character (6.1) that performs a control function in a specific context. NOTE
1 This clause is taken from ISO 2382-4.
2 A control character (6.1) starts, modifies or stops a data processing (3.3) operation. 6.6 alphabetic character a letter of the graphic character (6.4) used in the written representation of a given language. NOTE This includes letters with diacritical marks. 6.7
Digit
Numeric character
A character that represents a non-negative integer (6.1).
Note: This clause is adapted from ISO 2382-4.
GB/T 17532—1998
6.8 Alphanumeric character Any character (6.1) that is either an alphabetic character (6.6) or a number (6.7). 6.9 Special character Any graphic character (6.4) that is not an alphanumeric character (6.8). For example, any punctuation mark, the ampersand [&], the percent sign [%], the plus sign [+], etc. are special characters. Note: This clause is adapted from ISO 2382-4.
6. 10 Space character A control character (6.5) that causes the print or display position to move forward one position along a straight line. Note: This clause is adapted from ISO 2382-4.
blank
A character (6.1) representing an empty position in a sequence of graphic characters (6.4). Note: This clause is adapted from ISO 2382-4.
6.12 string
A sequence of elements with the same properties as a whole. Example: a sequence of characters (6.1), binary bits or pulses. Note: This clause is adapted from ISO 2382-4.
6.13 character string
A sequence of multiple characters (6.1) not separated by whitespace. Note: A character string often represents a word. 6.14 pattern
A string (6.1) defined when some characters (6.1) are replaced by combining characters (6.1).12). Note: This clause refers to GB/T 12991.
6. 15 Chinese character Chinese character The text that records the written Chinese language.
Chinese character character form
Specially refers to the two-dimensional graphics that make up the square Chinese character (6.15). Note: This clause is quoted from GB/T12200.2.
6.17 Stroke
The smallest connected stroke unit that makes up the regular script Chinese character form (6.16). The most basic strokes of Chinese characters include horizontal (一), vertical (1), dot (、), left-falling stroke ()), and right-falling stroke (乙). www.bzxz.net
Note: This clause is quoted from GB/T12200.2.
3 Stroke order
The order and direction of the strokes (6.17) when writing each Chinese character (6.15). Note: This clause is quoted from GB/T12200.2.
6.19 Stroke count
The number of strokes (6.17) that make up a Chinese character (6.15) or a Chinese character component (6.20). Note: This clause is derived from GB/T12200.2.
6.20 Chinese character component Chinese character component A character-forming unit composed of strokes (6.17) that has the function of assembling Chinese characters (6.15). Modern Chinese character components can be divided into character-forming components and non-character-forming components according to whether they form a character.
Example: Character-forming component: 木心口也
Non-character-forming component: 』
Note: This clause is derived from GB/T12200.2.
6.21 Radical indexing companent GB/T 17532 -1998
~Some components that can be used to form characters in batches. All characters that contain a certain component are arranged together in the dictionary, and the component is listed at the beginning as the leading unit, which becomes the basis for looking up characters and is called the radical. Radicals are mostly composed of radicals. For example, wood (pine, cypress, poplar, frame, apricot, forest, phase Note: This entry is quoted from GB/T12200.2.
6.22 Radical
The traditional name of the unit of compound characters, formerly known as compound characters (composed of two or more Chinese character components), with the left being the radical and the right being the side, and modernly collectively referred to as the radical. Radicals are originally single characters. In ancient Chinese characters (6.15), radicals generally have the same shape as when they form a single character. In modern Chinese characters (6.15), there are some changes in radicals, such as "心" on the left is "十", "刀" on the right is "』"; "火" at the bottom is "", and the left "埠" and the right "邑" are both "阝", etc.
Note: This entry is quoted from GB/T12200.2.
7 Data storage
7.1 Data media data medium
Physical medium used to record data (3.2) and retrieve data (3.2). Examples: disk, CD-ROM, microfilm.
1 This clause is adapted from ISO/IEC 2382-1.
2 In the field of library and documentation work, data media are considered to be data carriers. 7.2 format
<Data processing> A predetermined organizational form of data (3.2). NOTE Format can be further divided into input format, storage format and output format. 7.3 data acquisition The process of collecting data (3.2) or loading and storing data (3.2) in a data processing system. 7.4 machine-readable data machine-readable data Data (3.2) that can be directly loaded into a data processing system, referred to as machine-readable data. Note: Usually refers to data on disks, tapes and CDs. Under certain circumstances, data on printed materials can also be regarded as machine-readable data by using optical character recognition (OCR).
7.5 Database database
A collection of data (3.2) organized according to a conceptual structure. Note: This clause is adapted from ISO/IEC 2382-1.
7.6 Terminological database database (7.5) containing terminological data. 7.7 Database data bank
A collection of databases (7.5) containing a management framework. 7.8 Term bank term bank
Terminological database terminological data bank database (7.7) containing terminological data (3.2). 7.9 Record record
A collection of data elements (7.11) treated as a unit. Note: This clause is adapted from ISO 2382-4.
7. 10 file file
a collection of named records (7.9) stored or processed as a unit. 525
Note: This clause is adapted from ISO 2382-4.
7.11 data element data element
GB/T 17532--1998
a unit of data (3.2) that has distinguishing characteristics in a certain context. Note: This clause is adapted from ISO 2382-4.
7.12 data field data field
a variable-length or fixed-length portion of a record that is stored for a specific data element (7.11). 7.13 data field directory data field directory a location index (5.5) of a data field (7.12) within a record (7.9), including the number (7.16), length and position of each data field (7.12) within the record (7.9). 7.14 data categorydata category
data element typedata element type
a statement of the type of a given data field (7.12). 7.15 identifieridentifier
one or more characters (6.1) used to identify or name a data category (7.14). NOTE
1 This clause is adapted from ISO 2382-4.
2 The naming of the identifier may indicate certain properties of the data category (7.14). 7.16 label
an identifier (7.15) or a keyword used to identify data (3.2) within or attached to a set of data elements (7.11). NOTE This clause is adapted from ISO 2382-4.
repeatable category
7.17 Repeatable category
Data category (7.14) that can appear more than twice in the same record (7.9). Example: In a multilingual terminology entry (3.22), some categories of some records (7.9) may be repeated for each language. In a monolingual terminology entry (3.22), categories such as synonyms, source, context, etc. may also be repeated. 8 Information retrieval
8.1 Information retrieval
information retrieval
The action, method or process of finding information (3.1) from stored data (3.2). NOTE
1 This clause is adapted from ISO/IEC 2382-1.
2 Information retrieval can be performed in either interactive or batch processing. 8.2 query language query language
<Information retrieval> a data manipulation language used by users to retrieve or modify data (3.2) stored in a database (7.5). 8.3 truncation
<Information retrieval> the process of converting word forms (3.11) into word segments (3.18) for the purpose of searching or selecting. Note: Truncation can be automatic or user-selectable. The direction of truncation can be left to right or right to left. 8.4 compression
<Natural language processing> the process of converting word forms (3.11) into simplified forms for the purpose of data processing (3.3). Note: Compression generally refers to the removal of diacritical marks, elimination of whitespace (6.11) and various special characters (6.9), and disregard for uppercase and lowercase distinctions. Compression is used, for example, when neutralizing certain spelling variants in order to generate references (5.5). 8.5 Search word
Search keyword searchkey
<Information retrieval uses a character string (6.13) to provide access records (7.9). 526
GB/T 17532 --1998
Note: Search word can be a term, word, character, etc. (English also includes a word or part of a word). 8.6 Stem search stem search
Degradation
The process of removing characters (6.1) from a string (6.13) one by one according to predetermined rules, and using the automatically generated search terms (8.5) to ask random questions.
Note: Depending on the different ways of word formation in different languages, the order can be from right to left or from left to right. 8.7 Search search
<Information retrieval>Automatically check a collection of data (3.2) to match (8.8) or extract (8.9) elements with given characteristics. Example: Find all names starting with J in a collection of names. 8.8 Match match
<Information retrieval>Determine the identity of items between different data (3.2) sets by comparison, so as to select or separate items with the same properties.
8.9 Extract extract
Information retrieval selects and removes certain items that meet predetermined properties from a group of items. 8.10 Weighted ranking
<Information retrieval ≥ Assigning different weights to search responses according to their relevance to the corresponding search query. 9 Data management
9.1 Read
Get data (3.2) from a data medium (7.1) and keep the data unchanged. Note: This clause is adapted from ISO 2382-6.
9.2 Write
Record data (3.2) on a data medium (7.1) permanently or temporarily. Note: This clause is adapted from ISO 2382-6.
9.3 Copy
Read (9.1) data (3.2) from a source data medium (7.1), keep the source data unchanged, and then write (9.2) to a target data medium. Example: Copy a file from a hard disk to a floppy disk. Note: This clause is adapted from ISO 2382-6.
9.4 sort
The act of separating items into groups and arranging them according to specified criteria. Note: This clause is adapted from ISO2382-6.
9.5 merge
The act of merging items from two or more sets of data (3.2) with the same given order into a larger set with that order.
Note: This clause is adapted from ISO2382-6
9.6 display
The visual display of data (3.2) on the screen. 9.7 mask
The act of using the pattern (6.14) of a certain character (6.1) to control which parts of the pattern (6.14) of another character (6.1) are to be retained or deleted.
Example: All vowels a, ei, o, u can be replaced by the character V representing the vowel category. Note: This clause is modified from ISO2382-6.
Tip: This standard content only shows part of the intercepted content of the complete standard. If you need the complete standard, please go to the top to download the complete standard document for free.