GB/T 8565.3-1988 Information processing Coded character set for text communication Part 3: Control functions for page imaging formats
other information
Release date:1988-12-30
Review date:2004-10-14
drafter:Chan Suk-yee, Wu Kin-cheung, Lee Shu-leung, Wong Wai-man
Drafting unit:National Computer and Information Processing Standardization Committee
Focal point unit:National Information Technology Standardization Technical Committee
Proposing unit:Ministry of Posts and Telecommunications of the People's Republic of China
Publishing department:National Standardization Administration
competent authority:National Standardization Administration
Some standard content:
National Standard of the People's Republic of China
Information processing
Codes character sets for text communication
Part 3
Control functions for page-image format
Information processing
Codes character sets for text commanicationPart S:Control functions for page-image formatGB 8565.3--88
This standard adopts the international standard IS06937/3 Coded character sets for text communicationPart 3 Control functions for page-image format.
1 Subject content
This standard specifies the general table of graphic characters and the general table of control functions for text communication and their coded representation. This standard consists of the following three parts:
Part 1 General provisions;
Part 2 Graphic character sets for text communicationPart 3 Control functions for page-image format. Parts 2 and 3 can be used in conjunction with each other or separately, but they are always used in conjunction with Part 1.
This part of GB8565.3 includes:
Specifies the general table of control functions used to determine the image of communication text; a
Specifies the coding representation of control functions,
Specifies the code expansion method used to specify and call graphic character sets: This part of this standard does not specify the control functions required to control the communication process (whether through remote communication or physical interchange of storage media).
2 Scope of application
This standard applies to text communicated at the coding interface of terminal equipment, and does not necessarily apply to the coded character sets used internally between the coding interface and the input/output devices of the terminal. When the relevant control functions are included in the text communication rather than separated from the text as a communication procedure element, the coded representation specified in this standard should be used.
. Cited standards
GB 1988
Seven-bit coded character set for information processing interchange GB2311
Extension method of seven-bit coded character set for information processing interchange GB3910
Line spacing and character spacing for office machines and data processing equipment GB3911
Graphical representation of control characters of seven-bit coded character set for information processing Approved by the State Administration of Technical Supervision on December 30, 1988 and implemented on July 1, 1989
GB8565.3—88||tt ||GB·5261
Supplementary control functions for text and symbol forming equipment GB9655 Information processing equipment Chinese character spacing and line spacing 4 Terminology
Unstandardized terms in Part 1 also apply to this part. The following supplementary terms in Part 3 apply: 4.1 Page image format (PIF) A display method for text images, which is determined by the sending end and used by the receiving end for display, and it does not accept the operation of re-formatting by the receiving end.
Note: Chapter 5 gives a more specific definition of page image format. 4.2 Page
A two-dimensional medium that can present text.
4.3 Text area
The part of the image on the page that can present text. Note: () In this part of this standard, the concept of "text" always refers to the message or image, not the medium that can be displayed. ② It is possible to send only the image of the variable content of the page (such as a grid), which, after receipt, is combined with the pre-defined text content or printed on a pre-printed form, so that the variable part is combined with the fixed content on the page. ③ The size of the text area and the initial position of the line are implicit in the page format specified by the page control function "Page Format Selection" (PFS). Appendix A specifies these characteristics of the page format.
4.4 Active position The character position at which the action required by the next character is affected. If the next character is a graphic character, its graphic is generated at this position. If the next character is a control function, its function is performed relative to this position. Note: The movement of the active position is as follows: (1) After the generation of a "space" or "shape character" (except for non-grid characters), the active position is advanced by ·· character positions. ② After executing a control function that moves the operating position (i.e., the fall control function), the operating position moves to the specified character position. 4.5 Line home position The reference position of each line, the starting position of the image of the text line. When the text is imaged from left to right, the reference position of the line is at or near the left border line of the text area; when the text is imaged from top to bottom, the reference position of the line is at or near the upper border line of the text area. Note: If the character position of the imaged line is not consistent with the reference position, one or more format control characters "space", "backspace" and (or) "character position stop movement" are used to move the operating position to the line home position accordingly. 4.6 Default value
The assumed value used when no value is explicitly specified: Note: This term is used for control functions with parameters. When the relevant control function is used and the specified parameters are omitted, the parameter value is used. This part of this standard defines the default values for most control functions. 4.7 Default state (default value) 4.8 Document
An ordered collection of one or more pages that the sender intends as a whole. 4.9 Page terminator A control function that delimits the text imaged on a given page and separates it from the text of the next frame (if any) that appears earlier in the same document. The frame terminator may be a control function generated by a "page terminator" (PT) or a "file terminator" (DT) enclosed in the text, or an equivalent function as a protocol unit. ..comGB8565.3-88
4.10 Document terminator A control function that delimits the text of a given document portion and separates it from the text of the next document (if any). The document terminator may be a control function generated by a "file terminator" (DT) enclosed in the text, or an equivalent function as a protocol unit.
5 Page-image concept
Text transmitted in page-image format consists of one or more files, each file consisting of one or more pages. Communication in page-image format means that the byte string of graphic characters is transmitted in the order in which the characters appear on the pages of the file. Communication starts from the beginning of the first text of the first file and proceeds line by line and page by page until the end of the last file. The data stream may be interspersed with the coded representation of control functions for the following purposes: a. "Page format selection" control function for specifying the size and orientation of the page, the size and position of the text area on the page; b. Format control function for specifying the position of the next graphic character if the next graphic character is not in the immediate position of the previous graphic character on the same line:
c. Control function for specifying the display style of graphic characters, such as font and character emphasis, spacing and line spacing: d. Control function for marking the end of a page or a file. Only in the following three cases can the interpretation of the coded representation of control functions and graphic characters not be confused:. In accordance with the above-mentioned text sequence, all graphic characters and "space" positions are within the text area, and only one graphic character can be placed in any character position on the page. Unless such position overlap is part of the graphic character encoding method in the graphic character table used. Text communications that do not follow this restriction are inconsistent with this part of this standard. The following are: ① Terminal equipment that compiles communication text and presents received text may use other sequences and other control function encodings internally after making necessary conversions (see Appendix C for examples). ② Certain control functions specified in this part of this standard are intended for use by terminal equipment with convenient copying capabilities. When terminal equipment such as soft copy equipment (such as a display) receives these control functions, it may not be able to accurately perform the text specified in this part of this standard. 6 Character table and coded representation
6.1 Composition of character table
The character table defined in this part of this standard consists of the following: a. Format control characters
“Format control characters” enable operations to move text areas on a page and from one page to another. b.Presentation control function
"Presentation control function" specifies presentation attributes, that is, the presentation mode of the following text. Page format and word spacing are examples of presentation attributes. At the beginning of each file transfer, all presentation attributes are assumed to be in the default state, and all presentation attributes are reset to the default state using a "page terminator" or "file terminator". c. Code extension control function
"Code extension control function" makes it possible to encode graphic characters and control functions other than the code set that satisfies the "-correspondence between the bit group and the symbol of the seven-bit and eight-bit codes". d. Other control functions
Those control functions that are not included in the above categories, such as "replacement characters". In addition to the above categories, text communication may also use other control functions. For example, specific transmission control functions required by low-level control procedures. However, this standard does not specify these control functions because these control functions are not specifically specified for text communication. GB 8565.3—8
In 7.1, 7.2, 7.3 and 7.4, format control characters, star current control characters, code expansion control functions and other control functions are defined respectively. wwW.bzxz.Net
6.2 Character set
From the perspective of coding, the total table of control functions defined in this standard can be divided into three sets: a. basic set
b. auxiliary set,
c. single control function set.
The basic set is a subset of the control functions in the GB1988-7-bit coded character set. When the basic set is used in an 8-bit code, the code expansion function replaces the "shift lock 1" and "shift lock 0" that can only be used in a 7-bit code. The "out" and "move in" auxiliary set is a set of the total table of control functions specified in GB5261. It consists of two parts: basic control function
b, and parameter control function.
The single control function set includes those lock and shift functions specified in GB2311 but not included in the basic set. The code table of the basic set, auxiliary set and single control function set is shown in Figures 1, 2, 3, 1, 5 and 6. The narrow position in the code table indicates the bit group reserved for future standardization, that is, it is possible to configure additional control functions. The shaded position is not used to indicate the bit group of the control function in the relevant character set. b463b261
GB 8565.3-88
Figure 1 Control functions of the basic set for the page image format Note: In the seven-bit code, it is represented by the bit group 0/0~1/15. bbbab
GB 8565.3-8B
Figure 2 Basic control functions of the auxiliary set for the page image format Note: In the seven-bit code, the code of the termination group is represented by the escape sequence LSCFe, r
GB8565.3-88
Figure 3 Control functions of the basic set for the page image format and the auxiliary set: In the eight-bit code, it is represented by the bit group [00/00-01/15 and 08/~09/15] hbh
GB 8565.388
Figure 4 Control functions with parameters in auxiliary set according to the image format: In the control sequence of the non-bit group, the allocation of the termination sequence is terminated. -
GB8565.3-88
Figure 5 Control functions with parameters in auxiliary set according to the image format: There is an intermediate control sequence 2/3, the allocation of the termination sequence is terminated. 3
GB8565.8-88
LS2LSIR
Figure 6 Control functions in single control function set according to the page image format: In the seven-bit code and the eight-bit code, the escape sequence ESCF terminates the byte group code representation. 6.8 Coded representation
6.a.1 General format
GB8565.8-88
The coded representation of the control functions in the general table of control functions defined in this part of the standard is specified in 7.1, 7.2, 7.3 and 7.4. The format of the code table is as follows:
6.3.1.1 Control functions of the basic set
The basic set shall be used as a C1 set in seven-bit or eight-bit codes. In seven-bit and eight-bit codes, each control function of the basic set shall be represented by a single group in 0/0 to 1/15 (/0 to 01/15). 6.3.1.2 Basic control functions of the auxiliary set
The format of the coded representation of the basic control functions in the auxiliary set depends on whether the auxiliary set is used as a C1 set in a seven-bit code or a C1 set in a non-octavalent code.
, seven-bit code C1 set
Each basic control function in the auxiliary set is represented by the two-character escape sequence ESCFa specified in GB 2311. The first character of this sequence is the code extension control function "escape", and its bit group is 1/11. The first is a single bit group from 4/0 to 5/15, which is used to identify the control function.
b. Eight-bit code C1 set
Each basic control function in the auxiliary set is represented by a single bit group from 08/00 to 09/154. 6.3.1.3 Control functions with parameters in the auxiliary set Each control function with parameters is represented by a variable length bit group sequence specified in 6.3.2. 6.3.1.4 Control functions in a single control function set In the seven-bit and eight-bit codes, each control function in a single control function set is represented by a two-character escape sequence ESCFs. The first part of this sequence is the code verification extension control function "escape" in the basic set, and its bit group is 1/11 (01/11). The second part is a bit group in 6/0~7/14 (06/00~07/14), which is used to identify the control function. 6.3.2 Control sequence format
The control sequence format specified in this part of this standard is: CSI P1.-Pn F or CSl P1--.Pn IF. The meanings of the symbols used are as follows:
a. CS is the abbreviation of the code extension control function "control sequence leader". It is a basic control function in the auxiliary set. In the seven-bit code, the coding representation of CsI is the sequence ESC5/11: In the eight-bit code, the coding representation of CSI is the value group 09/11. b. P1+P is a byte group or a sequence of byte groups within the range of 3/0±3/9 or 3/11 (03/00~03/9 or 03/11), called a "parameter string", which represents one or more parameters of the control function. If the default parameter value is used, the parameter string can be omitted. The format of the parameter string is specified in 6.3.3.
cF is a byte group within the range of 4/0~6/15 (04/00~06/15), which terminates the control sequence. If there is an intermediate byte group, it identifies the control function together with the middle byte group I. d. I is the "interval group" 2/0 (02/00), which identifies the control function together with the end byte group F. If the control function is determined only by the end byte group F, there is no intermediate byte group.
All the byte groups following the CS leader to the termination group F (including the termination byte group) are within the range of 2/ to 6/15 (02/01 to 06/15) except the intermediate group 2/0 (02/00). Therefore, in the eight-bit code, the highest bit is "0". When converting between 7-bit and 8-bit codes, this high bit may become 1". For the bit groups in 10/01 to 14/15, these bit groups should be treated as equivalent to the corresponding bits in the range 2/1 to 6/15 (02/01 to 06/15), that is, the high bit should be ignored. Note: The above format is a simplification of the general format specified in GB5261. Some other characteristics specified in GB6261 include: ① The use of the parameter group 3/12 to 3/15 (03/12 to 03/15) ② The use of multiple bit groups and the interval group 2/1 to 2/15 (02/01 to 02/15) GB 8566.388
@) Termination bit group 7/0 to 7/14 (07/00 ~ 07/14) Usage, 6.3.3 Parameter string format
The parameter string includes one or more byte groups from 3/0 to 3/9 and 3/11 (03/00 to 03/09 and 03/11). It is interpreted in the following format,
The parameter string consists of one or more sub-parameter strings, 8
Each sub-parameter string includes at most four byte groups from 3/0 to 3/9 (03/00 to 03/09), and each byte group represents a binary number 0 -9.
Two consecutive parameter strings are separated by a byte group 3/11 (03/11). The first byte group 3/0 (03/00) is meaningless and can be omitted: d.
If the parameter string starts with the byte 3/11 (03/11), it is considered that there is an empty sub-parameter between the delimiters: if the parameter e
ends with the byte 3/11 (03/11), it is considered that there is an empty sub-parameter string after the delimiter. If the parameter string contains several consecutive byte 3/11 (03/11), it is considered that there are several empty sub-parameter strings between these delimiters.
An empty parameter string or a parameter string consisting of the byte 3/0 (0300) only represents a default value determined by the control function itself.
Parameters are of two types, namely, numeric parameters and optional parameters. Numeric parameters represent a number; optional parameters only represent a string whose meaning depends on the control function.
: Numeric parameters
In a control sequence representing a control function with a numeric parameter, the parameter string should only consist of one sub-parameter. Each numerical subparameter string contains at least one byte group of 3/1 to 3/9 (03 to 03/09), which represents a decimal number. b. Selection parameters
In a control sequence representing a control function with selection parameters, a parameter string may include multiple subparameter strings and one or more parameter separators. Each subparameter in the selection parameter represents a value of the selection parameter. Although these values are represented by numbers, they do not seem to have a quantitative meaning. Each value corresponds to an action that the control function can perform. The maximum number of selection parameter values is specified for each relevant control function.
A specific parameter value can have the same meaning as a combination of two or more independent values. Note: The above format is a simplification of the general format specified in GB5261. GB5261 also specifies some other characteristics, including the use of the original parameter string with multiple numerical values. Definition of control function
In general, the definition of control functions in this part of this standard is more specific than the definition of corresponding control functions in GB1988 and GB5261.
7.1 Format Control Characters
7.1.1 BS—Backspace
Definition: Backspace is a format control character that causes the operating position to move one character position in the opposite direction of the character. The character and line directions are determined by the parameters of the "Presentation Direction Selection" (SPD). Representation: 078 (00/08)
Note: The main purpose of "backspace" is to assist in the reverse movement of the operating position from the beginning of the line after a "carriage return". "Backspace" should not be used to produce an image of two or more graphic characters combined in a single character position, unless this use of "backspace" is part of the encoding method of graphic characters in the graphic line table used. 7.1.2 LF Line Feed
Definition: A format control character that indicates the start of a new line image. It causes the operating position to move forward to the next line, but does not move the character position. The amount of line movement depends on the most recently generated "line spacing selection". If "line spacing selection" is not given, it depends on the missing character position.
Tip: This standard content only shows part of the intercepted content of the complete standard. If you need the complete standard, please go to the top to download the complete standard document for free.