Initial revision

1986-12-10 11:40:00 +00:00 · 1986-12-10 11:40:00 +00:00 · df86573d4c
commit df86573d4c
parent 96d9890d86
3 changed files with 877 additions and 0 deletions
--- a/doc/top/Makefile
+++ b/doc/top/Makefile
@ -0,0 +1,6 @@
+top.f:
+	refer -sA+T -l4,2 -p refs.top top.n | nroff -ms > top.f
+top.f.35:
+	refer -sA+T -l4,2 -p refs.top top.n | nroff -ms -Thr35 > top.f.35
+top.f.agfa:
+	refer -sA+T -l4,2 -p refs.top top.n | nroff -ms -Tlp > top.f.agfa
--- a/doc/top/refs.top
+++ b/doc/top/refs.top
@ -0,0 +1,79 @@
+%T A Practical Toolkit for Making Portable Compilers
+%A A.S. Tanenbaum
+%A J.M. van Staveren
+%A E.G. Keizer
+%A J.W. Stevenson
+%I Vrije Universiteit, Amsterdam
+%R Rapport nr IR-74
+%D October 1981
+
+%T A Practical Toolkit for Making Portable Compilers
+%A A.S. Tanenbaum
+%A J.M. van Staveren
+%A E.G. Keizer
+%A J.W. Stevenson
+%J CACM
+%V 26
+%N 9
+%P 654-660
+%D September 1983
+
+%T A Unix Toolkit for Making Portable Compilers
+%A A.S. Tanenbaum
+%A J.M. van Staveren
+%A E.G. Keizer
+%A J.W. Stevenson
+%J Proceedings USENIX conf.
+%C Toronto, Canada
+%V 26
+%D July 1983
+%P 255-261
+
+%T Using Peephole Optimization on Intermediate Code
+%A A.S. Tanenbaum
+%A J.M. van Staveren
+%A J.W. Stevenson
+%J TOPLAS
+%V 4
+%N 1
+%P 21-36
+%D January 1982
+
+%T Amsterdam Compiler Kit documentation
+%A A.S. Tanenbaum
+%A E.G. Keizer
+%A J.M. van Staveren
+%A J.W. Stevenson
+%I Vrije Universiteit, Amsterdam
+%R Rapport nr IR-90
+%D June 1984
+
+%T Language- and Machine-independant Global Optimization on
+Intermediate Code
+%A H.E. Bal
+%A A.S. Tanenbaum
+%I Vrije Universiteit, Amsterdam
+%R Rapport IR-98
+%D March 1985
+
+%T The Design and Implementation of the EM Global Optimizer
+%A H.E. Bal
+%I Vrije Universiteit, Amsterdam
+%R Rapport IR-99
+%D March 1985
+
+
+%T The C Programming Language
+%A B.W. Kernighan
+%A D.M. Ritchie
+%I Prentice-Hall, Inc
+%C Englewood Cliffs,NJ
+%D 1978
+
+%T Principles of compiler design
+%A A.V. Aho
+%A J.D. Ullman
+%I Addison-Wesley
+%C Reading, Massachusetts
+%D 1978
+
--- a/doc/top/top.n
+++ b/doc/top/top.n
@ -0,0 +1,792 @@
+.ND
+.pl 11.7i
+.ll 80m
+.nr LL 80m
+.nr tl 78m
+.tr ~
+.ds >. .
+.TL
+The ACK Target Optimizer
+.AU
+H.E. Bal
+.AI
+Vrije Universiteit
+Wiskundig Seminarium, Amsterdam
+.AB
+The Target Optimizer is one of several optimizers that are part of
+the Amsterdam Compiler Kit.
+It operates directly on assembly code,
+rather than on a higher level intermediate code,
+as the Peephole Optimizer and Global Optimizer do.
+Consequently, the Target Optimizer can do optimizations
+that are highly machine-dependent.
+.PP
+Each target machine has its own Target Optimizer.
+New optimizers are generated by the Target Optimizer Generator,
+which uses a machine-dependent table as input.
+This document contains full information on how to
+write such a table for a new machine.
+It also discusses the implementation of the
+Target Optimizer and its generator.
+.AE
+.bp
+.NH 1
+Introduction
+.PP
+.FS
+This work was supported by the
+Stichting Technische Wetenschappen (STW)
+under grant VWI03.0001.
+.FE
+This document describes the target optimizer component
+of the Amsterdam Compiler Kit (ACK) .
+.[
+tanenbaum staveren amsterdam toolkit
+.]
+.[
+tanenbaum staveren cacm
+.]
+.[
+tanenbaum staveren toronto
+.]
+Optimization takes place in several parts of ACK compilers,
+most notably in the Peephole Optimizer
+.[
+staveren peephole toplas
+.]
+and
+the Global Optimizer,
+.[
+bal tanenbaum global optimization
+.]
+.[
+bal implementation global optimizer
+.]
+which are both language- and machine-independent,
+and in the machine-specific code generators.
+.[
+documentation amsterdam compiler kit
+.]
+The target optimizer is the finishing touch in this sequence of
+optimizers.
+It can be used to capture those optimizations that are hard
+to express in the other parts of ACK.
+These optimizations will typically be very machine-specific.
+.PP
+The target optimizer operates on the assembly code of some target machine.
+Hence there is one target optimizer per machine.
+However, just as for the ACK code generators and assemblers,
+a framework has been build that allows easy generation of
+target optimizers out of machine-independent parts and a
+machine-dependent description table (see figure 1.).
+So the major part of the code of a target optimizer is
+shared among all target optimizers.
+.DS
+
+
+                                       |-------------------------|
+                                       | machine-independent     |
+                                       | code                    |
+                                       |                         |
+          |-----------------|          |-------------------------|
+descrip-  |target optimizer |          | machine-dependent code  |
+ tion --> |generator        | ---->    | + tables                |
+table     |                 |          |                         |
+          |-----------------|          |-------------------------|
+   
+                                              target optimizer
+   
+    Figure 1: Generation of a target optimizer.
+
+.DE
+.PP
+This document focusses on the description of the machine-dependent table.
+In chapter 2 we give an informal introduction to the optimization
+algorithm and to the definition of the table format.
+Chapters 3 and 4 discuss the implementation of the target optimizer
+and the target optimizer generator.
+Appendix A gives full information for writing a description table.
+.bp
+.NH 1
+Global structure of the target optimizer
+.PP
+The target optimizer is based on the well understood model
+of a \fIpeephole optimizer\fR.
+.[
+aho ullman compiler
+.]
+It contains a machine-dependent table
+of (pattern,replacement) pairs.
+Each pattern describes
+a sequence of one or more assembler instructions
+that can be replaced by zero or more equivalent, yet cheaper,
+instructions (the 'replacement').
+The optimizer maintains a \fIwindow\fR that moves over the input.
+At any moment, the window contains some contiguous part of the input.
+If the instructions in the current window match some pattern
+in the table,
+they are replaced by the corresponding replacement;
+else, the window moves one instruction to the right.
+.PP
+In the remainder of this section we will give an informal
+description of the machine-dependent table.
+A more precise definition is given in appendix A.
+We will first discuss the restrictions put on the
+format of the assembly code.
+.NH 2
+Assumptions about the assembly code format
+.PP
+We assume that a line of assembly code begins with an
+instruction \fImnemonic\fR (opcode),
+followed by zero or more \fIoperands\fR.
+The mnemonic and the first operand must be separated by a special
+character (e.g. a space or a tab).
+Likewise, the operands must be separated by a special
+character (e.g. a comma).
+These separators need not be the same for all machines.
+.NH 2
+Informal description of the machine-dependent tables
+.PP
+The major part of the table consists of (pattern,replacement) pairs
+called \fIentries\fR.
+.PP
+A pattern is a list of instruction descriptions.
+Each instruction description describes the instruction mnemonic and
+the operands.
+.PP
+A mnemonic is described either by a string constant or by the
+keyword ANY.
+As all entities dealt with by the target optimizer are strings,
+string constants do not contain quotes.
+A string constant matches only itself.
+ANY matches every instruction mnemonic.
+.nf
+
+Examples of mnemonic descriptions:
+
+        add
+        sub.l
+        mulw3
+        ANY
+.fi
+.PP
+An operand can also be described by a string constant.
+.nf
+
+Examples:
+
+       (sp)+
+       r5
+       -4(r6)
+
+.fi
+Alternatively, it can be described by means of a \fIvariable name\fR.
+Variables have values which are strings.
+They have to be declared in the table before the patterns.
+Each such declaration defines the name of a variable and
+a \fIrestriction\fR to which its value is subjected.
+.nf
+Example of variable declarations:
+
+      CONST       { VAL[0] == '$' };
+      REG         { VAL[0] == 'r' && VAL[1] >= '0' && VAL[1] <= '3' &&
+                    VAL[2] == '\\0' };
+      X           { TRUE };
+
+.fi
+The keyword VAL denotes the value of the variable, which is
+a null-terminated string.
+An operand description given via a variable name matches an
+actual operand if the actual operand obeys the associated restriction.
+.nf
+
+     CONST  matches   $1, $-5, $foo etc.
+     REG    matches   r0, r1, r2 and r3
+     X      matches   anything
+.fi
+The restriction (between curly braces) may be any legal "C"
+.[
+kernighan ritchie c programming
+.]
+expression.
+It may also contain calls to user-defined procedures.
+These procedures must be added to the table after the patterns.
+.nf
+
+Example:
+
+     FERMAT_NUMBER    { VAL[0] == '$' && is_fermat_number(&VAL[1]) };
+
+.fi
+An operand can also be described by a mixture of a string constant
+and a variable name.
+The most general form allowed is:
+.nf
+
+       string_constant1 variable_name string_constant2
+
+Example:
+
+       (REG)+  matches  (r0)+, (r1)+, (r2)+ and (r3)+
+
+.fi
+Any of the three components may be omitted,
+so the first two forms are just special cases of the general form.
+The name of a variable can not be used as a string constant.
+In the above context, it is impossible to define an operand that
+matches the string "REG".
+This limitation is of little consequence,
+as the table writer is free to choose the names of variables.
+This approach, however, avoids the need for awkward escape sequences.
+.PP
+A pattern consists of one or more instruction descriptions
+(separated by a colon)
+followed by an optional constraint.
+A pattern "P1 : P2 : .. : Pn C" matches the sequence of
+instructions "I1 I2 .. In" if:
+.IP (i) 7
+for each i, 1 <= i <= n, Pi matches Ii, as described above;
+.IP (ii)
+multiple occurrences of the same variable name or of
+the keyword ANY stand for the same values throughout the pattern;
+.IP (iii)
+the optional constraint C is satisfied, i.e. it evaluates to TRUE.
+.LP
+.nf
+The pattern:
+
+      dec REG : move.b CONST,(REG)
+
+matches:
+
+      dec r0 : move.b $4,(r0)
+
+but not:
+
+      dec r0 : move.b $4,(r1)
+
+(as the variable REG matches two different strings).
+.fi
+If a pattern containing different registers must be described,
+extra names for a register should be declared, all sharing
+the same restriction.
+.nf
+Example:
+
+     REG1,REG2  { VAL[0] == 'r' &&  .....  };
+
+     addl3 REG1,REG1,REG2 : subl2 REG2,REG1
+.fi
+.PP
+The optional constraint is an auxiliary "C" expression (just like
+the parameter restrictions).
+The expression may refer to the variables and to ANY.
+.nf
+Example:
+
+    move REG1,REG2    { REG1[1] == REG2[1] + 1 }
+
+matches
+
+    move r1,r0
+    move r2,r1
+    move r3,r2
+.fi
+.PP
+The replacement part of a (pattern,replacement) table entry
+has the same structure as a pattern, except that:
+.IP (i)
+it may not contain an additional constraint;
+.IP (ii)
+it may be empty.
+.LP
+A replacement may also refer to the values of variables and ANY.
+.NH 2
+Examples
+.PP
+This section contains some realistic examples for
+optimization on PDP-11 and Vax assembly code.
+.NH 3
+Vax examples
+.PP
+Suppose the table contains the following declarations:
+.nf
+         X, LOG        { TRUE };
+         LAB           { VAL[0] == 'L' };   /* e.g. L0017 */
+         A             { no_side_effects(VAL) };
+         NUM           { is_number(VAL) };
+
+.fi
+The procedure "no_side_effects" checks if its argument
+contains any side effects, i.e. auto increment or auto decrement.
+The procedure "is_number" checks if its argument contains only digits.
+These procedures must be supplied by the table-writer and must be
+included in the table.
+.PP
+.nf
+\fIentry:\fR  addl3 X,A,A    -> addl2 X,A;
+
+.fi
+This entry changes a 3-operand instruction into a cheaper  2-operand
+instruction.
+An optimization like:
+.nf
+
+        addl3 r0,(r2)+,(r2)+   -> addl2 r0,(r2)+
+
+.fi
+is illegal, as r2 should be incremented twice.
+Hence the second argument is required to
+be side-effect free.
+.PP
+.nf
+\fIentry:\fR  addw2 $-NUM,X  -> subw2 $NUM,X;
+
+.fi
+An instruction like "subw2 $5,r0" is cheaper
+than "addw2 $-5,r0",
+because constants in the range 0 to 63 are represented
+very efficiently on the Vax.
+.PP
+.nf
+\fIentry:\fR  bitw $NUM,A : jneq LAB
+                { is_poweroftwo(NUM,LOG) }  -> jbs $LOG,A,LAB;
+
+.fi
+A "bitw x,y" sets the condition codes to the bitwise "and" of
+x and y.
+A "jbs n,x,l" branches to l if bit n of x is set.
+So, for example, the following transformation is possible:
+.nf
+
+      bitw $32,r0 : jneq L0017 ->  jbs $5,r0,L0017
+
+.fi
+The user-defined  procedure "is_poweroftwo" checks if its first argument is
+a power of 2 and, if so, sets its second argument to the logarithm
+of the first argument. (Both arguments are strings).
+Note that the variable LOG is not used in the pattern itself.
+It is assigned a (string) value by "is_poweroftwo" and is used
+in the replacement.
+.NH 3
+PDP-11 examples
+.PP
+Suppose we have the following declarations:
+.nf
+         X             { TRUE };
+         A             { no_side_effects(VAL) };
+         L1, L2        { VAL[0] == 'I' };
+         REG           { VAL[0] == 'r' && VAL[1] >= '0' && VAL[1] <= '5' &&
+                         VAL[2] == '\\0' };
+
+.fi
+The implementation of "no_side_effects" may of course
+differ for the PDP-11 and the Vax.
+.PP
+.nf
+\fIentry:\fR  mov REG,A : ANY A,X  ->  mov REG,A : ANY REG,X ;
+
+.fi
+This entry implements register subsumption.
+If A and REG hold the same value (which is true after "mov REG,A")
+and A is used as source (first) operand, it is cheaper to use REG instead.
+.PP
+.nf
+\fIentry:\fR  jeq L1 : jbr L2 : labdef L1  ->  jne L2 : labdef L1;
+
+.fi
+The "jeq L1" is a "skip over an unconditional jump". "labdef L1"
+denotes the definition (i.e. defining occurrence) of label L1.
+As the target optimizer has to know how such a definition
+looks like, this must be expressed in the table (see Appendix A).
+.PP
+.nf
+\fIentry:\fR  add $01,X { carry_dead(REST) }  -> inc X;
+
+.fi
+On the PDP-11, an add-one is not equivalent to an increment.
+The latter does not set the carry-bit of the condition codes,
+while the former does.
+So a look-ahead is needed to see if the rest of the input uses
+the carry-bit before changing the condition codes.
+A look-ahead of one instruction is provided by
+the target optimizer.
+This will normally be sufficient for compiler-generated code.
+The keyword REST contains the mnemonic of the first instruction of
+the rest of the input.
+If this instruction uses the carry-bit (e.g. an adc, subc, bhis)
+the transformation is not allowed.
+.bp
+.NH 1
+Implementation of the target optimizer
+.PP
+The target optimizer reads one input file of assembler instructions,
+processes it, and writes the optimized code
+to the output file.
+So it performs one pass over the input.
+.NH 2
+The window mechanism
+.PP
+The optimizer uses a \fIwindow\fR that moves over the input.
+It repeatedly tries to match the instructions in the window
+with the patterns in the table.
+If no match is possible, the window moves
+one instruction forwards (to the right).
+After a successful match the matched instructions are
+removed from the window and are replaced by the
+replacement part of the table entry.
+Furthermore, the window is moved a few instructions
+backwards,
+as it is possible that instructions that were rejected earlier now do match.
+For example, consider the following patterns:
+.DS
+cmp $0, X           -> tst X ;
+mov REG,X : tst X   -> move REG.X ;   /* redundant test */
+.DE
+If the input is:
+.DS
+mov r0,foo : cmp $0,foo
+.DE
+then the first instruction is initially rejected.
+However, after the transformation
+.DS
+cmp $0,foo   ->  tst foo
+.DE
+the following optimization is possible:
+.DS
+mov r0,foo : tst foo  ->  mov r0,foo
+.DE
+.PP
+The window is implemented a a \fIqueue\fR.
+Matching takes place at the head of the queue.
+New instructions are added at the tail.
+If the window is moved forwards, the instruction at the head
+is not yet written to the output,
+as it may be needed later on.
+Instead it is added to a second queue,
+the \fIbackup queue\fR.
+After a successful match, the entire backup queue is
+inserted at the front of the window queue,
+which effectively implements the shift backwards.
+.PP
+Both queues have the length of the longest pattern in the table.
+If, as a result of a forward window move,
+the backup queue gets full,
+the instruction at its head is outputted and removed.
+Instructions are read from the input whenever the
+window queue contains fewer elements than the length
+of the longest pattern.
+.NH 2
+Pattern matching
+.PP
+Pattern matching is done in three steps:
+.IP (i) 7
+find patterns in the table whose instruction mnemonics
+match the mnemonics of the instructions in the
+current window;
+.IP (ii)
+check if the operands of the pattern match the operands of the
+instructions in the current window;
+.IP (iii)
+check if the optional constraint is satisfied.
+.LP
+For step (i) hashing is used.
+The mnemonic of the first instruction of the window
+is used to determine a list of possible patterns.
+Patterns starting with ANY are always tried.
+.PP
+Matching of operand descriptions against actual operands
+takes place as follows.
+The general form of an operand description is:
+.DS
+string_constant1 variable_name string_constant2
+.DE
+The actual operand should begin with string_constant1 and end
+on string_constant2.
+If so, these strings are stripped from it and the remaining string is
+matched against the variable.
+Matching a string against a variable is
+defined as follows:
+.IP 1.
+initially (before the entire pattern match)
+all variables are uninstantiated;
+.IP 2.
+matching a string against an uninstantiated variable
+succeeds if the restriction associated with the variable is
+satisfied.
+As a side effect, it causes the variable to be instantiated to
+the string;
+.IP 3.
+matching a string against an instantiated variable succeeds
+only if the variable was instantiated to the same string.
+.LP
+Matching an actual mnemonic against the keyword ANY is defined likewise.
+.PP
+The matching scheme implements the requirement that multiple occurrences
+of the same variable name or of the keyword ANY should
+stand for the same values throughout the entire pattern
+(see section 2.).
+.PP
+Both the parameter restriction of 2. and the constraint of step (iii)
+are checked by executing the "C" expression.
+.NH 2
+Data structures
+.PP
+The most important data structure is the representation
+of the input instructions.
+For every instruction we use two representations:
+.IP (i)
+the textual representation,
+i.e. the exact code as it appeared in the input;
+.IP (ii)
+a structural representation,
+containing the opcode and the operands.
+.LP
+The opcode of an instruction is determined as soon as it is read.
+If the line contains a label definition, the opcode is set
+to "labdef", so a label definition is treated like a normal
+instruction.
+.PP
+The operands of an instruction are not determined until
+they are needed, i.e. until step (i) of the pattern matching
+process has succeeded.
+For every instruction we keep track of a \fIstate\fR.
+After the opcode has successfully been determined,
+the state is OPC_ONLY.
+Once the operands have been recognized, the state is set to DONE.
+If the opcode or operands can not be determined,
+or if the instruction cannot be optimized for any other
+reason (see Appendix A), the state is set to JUNK
+and any attempt to match it will fail.
+.PP
+For each table entry we record the following information:
+.IP (i) 7
+the length of the pattern (i.e. the number of instruction descriptions)
+.IP (ii)
+a description of the instructions of the pattern
+.IP (iii)
+the length of the replacement
+.IP (iv)
+a description of the instructions of the replacement.
+.LP
+The description of an instruction consists of:
+.IP (i)
+the opcode
+.IP (ii)
+for each operand, a description of the operand.
+.LP
+The description of an operand of the form:
+.DS
+string_constant1 variable_name string_constant2
+.DE
+contains:
+.IP (i)
+both string constants
+.IP (ii)
+the number of the variable.
+.LP
+Each declared variable is assigned a unique number.
+For every variable we maintain:
+.IP (i)
+its state (instantiated or not instantiated)
+.IP (ii)
+its current value (a string).
+.LP
+The restrictions on variables and the constraints are stored
+in a switch-statement,
+indexed by variable number and entry number respectively.
+.bp
+.NH 1
+Implementation of the target optimizer generator
+.PP
+The target optimizer generator (\fItopgen\fR)
+reads a target machine description table and produces
+two files:
+.IP gen.h: 9
+contains macro definitions for
+machine parameters that were changed
+in the parameter section of the table (see appendix A)
+and for some attributes derived from the table
+(longest pattern, number of patterns, number
+of variables).
+.IP gen.c:
+contains the entry description tables,
+code for checking the parameter restrictions and constraints
+(switch statements)
+and the user-defined procedures.
+.LP
+These two files are compiled together with some machine-independent
+files to produce a target optimizer.
+.PP
+Topgen is implemented using 
+the LL(1) parser generator system LLgen,
+a powerful tool of the Amsterdam Compiler Kit.
+This system provides a flexible way of describing the syntax of the tables.
+The syntactical description of the table format included
+in Appendix A was derived from the LLgen syntax rules.
+.PP
+The parser uses a simple, hand-written, lexical analyzer (scanner).
+The scanner returns a single character in most cases.
+The recognition of identifiers is left to the parser, as
+this eases the analysis of operand descriptions.
+Comments are removed from the input by the scanner,
+but white space is passed to the parser,
+as it is meaningful in some contexts (it separates the
+opcode description from the description of the first operand).
+.PP
+Topgen maintains two symbol tables, one for variable names and one
+for tunable parameters.
+The symbol tables are organized as binary trees.
+.bp
+.SH
+Appendix A
+.PP
+In this appendix we present a complete definition of the target
+optimizer description table format.
+This appendix is intended for table-writers.
+We use syntax rules for the description of the table format.
+The following notation is used:
+.nf
+      { a }      zero or more of a
+      [ a ]      zero or one of a
+      a b        a followed by b
+      a | b      a or b
+
+.fi
+Terminals are given in quotes, as in ';'.
+.PP
+The table may contain white space and comment at all reasonable places.
+Comments are as in "C", so they begin with /* and end on */.
+Identifiers are sequences of letters, digits and the underscore ('_'),
+beginning with a letter.
+.PP
+.DS
+table   ->   {parameter_line} '%%;' {variable_declaration} '%%;'
+             {entry} '%%;' user_routines.
+
+.DE
+A table consists of four sections, containing machine-dependent
+constants, variable declarations, pattern rules and
+user-supplied subroutines.
+.PP
+.DS
+parameter_line ->  identifier value ';' .
+
+.DE
+A parameter line defines some attributes of the target machines
+assembly code.
+For unspecified parameters default values apply.
+The names of the parameters and the corresponding defaults
+are shown in table 1.
+.DS
+       OPC_TERMINATOR       ' '
+       OP_SEPARATOR         ','
+       LABEL_STARTER        'I'
+       LABEL_TERMINATOR     ':'
+       MAXOP                  2
+       MAXOPLEN              25
+       MAX_OPC_LEN           10
+       MAXVARLEN             25
+       MAXLINELEN           100
+
+       table 1: parameter names and defaults
+.DE
+The OPC_TERMINATOR is the character that separates the instruction
+mnemonic from the first operand (if any).
+The OP_SEPARATOR separates adjacent operands.
+A LABEL_STARTER is the first character of an instruction label.
+(Instruction labels are assumed to start with the same character).
+The LABEL_TERMINATOR is the last character of a label definition.
+It is assumed that this character is not used in an applied
+occurrence of the label identifier.
+For example, the defining occurrence may be "I0017:"
+and the applied occurrence may be "I0017"
+as in "jmp I0017".
+MAXOP defines the maximum number of operands an instruction can have.
+MAXOPLEN is the maximum length (in characters) of an operand.
+MAX_OPC_LEN is the maximum length of an instruction opcode.
+MAXVARLEN is the maximum length of a declared string variable.
+As variables may be set by user routines (see "bitw" example for
+the Vax) the table-writer must have access to this length and
+must be able to change it.
+MAXLINELEN denotes the maximum length of a line of assembly code.
+.PP
+If a line of assembly code violates any of the assumptions or
+exceeds some limit,
+the line is not optimized.
+Optimization does, however, proceed with the rest of the input.
+.PP
+.DS
+variable_declaration  -> identifier {',' identifier} restriction ';' .
+
+restriction           ->  '{' anything '}' .
+.DE
+A variable declaration declares one or more string variables
+that may be used in the patterns and in the replacements.
+If a variable is used as part of an operand description in
+a pattern, the entire pattern can only match if the
+restriction evaluates to TRUE.
+If the pattern does match, the variable is assigned the matching
+part of the actual operand.
+Variables that are not used in a pattern are initialized to
+null-strings and may be assigned a value in the constraint-part of
+the pattern.
+.PP
+The restriction must be a legal "C" expression.
+It may not contain a closing bracket ('}').
+Inside the expression, the name VAL stands for the part of the actual
+(matching) operand.
+The expression may contain calls to procedures that are defined in the
+user-routines section.
+.DS
+entry             ->  pattern '->' replacement ';' .
+
+pattern           ->  instruction_descr
+		      { ':' instruction_descr }
+		      constraint .
+
+replacement       ->  [ instruction_descr { ':' instruction_descr } ] .
+
+instruction_descr -> opcode
+		     white
+		     [ operand_descr { ',' operand_descr } ] .
+
+constraint        -> '{' anything '}' .
+
+operand_descr     -> [ string_constant ]
+		     [ variable_name ]
+		     [ string_constant ] .
+
+variable_name     -> identifier .
+
+opcode            -> anything .
+.DE
+The symbol 'white' stands for white space (space or tab).
+An opcode can be any string not containing the special
+symbols ';', '{', '}', ':', ',', '->' or white space.
+To be recognized, it must begin with a letter.
+The opcode should either be a mnemonic of a target machine
+instruction or it should be one of the keywords ANY and labdef.
+ANY matches any actual opcode. labdef matches only label definitions.
+.PP
+If an operand description contains an identifier (as defined earlier),
+it is checked if the identifier is the name of a declared variable.
+This effects the semantics of the matching rules for the operand,
+as described in section 2.
+An operand may contain at most one such variable name.
+.PP
+The constraint must be a legal "C" expression, just as the operand restriction.
+It may call user-defined procedures and use or change the value of
+declared variables.
+It may also use the string variable REST,
+which contains the mnemonic of the first instruction of the
+rest of the input. (REST is a null-string if this mnemonic can
+not be determined).
+.DS
+user_routines -> anything .
+.DE
+The remainder of the table consists of user-defined subroutines.
+.bp
+.[
+$LIST$
+.]