430 lines
16 KiB
Text
430 lines
16 KiB
Text
.\" $Header$
|
||
.nr PD 1v
|
||
.tr ~
|
||
.TL
|
||
Ack Description File
|
||
.br
|
||
Reference Manual
|
||
.AU
|
||
Ed Keizer
|
||
.AI
|
||
Vakgroep Informatica
|
||
Vrije Universiteit
|
||
Amsterdam
|
||
.NH
|
||
Introduction
|
||
.PP
|
||
The program \fIack\fP(I) internally maintains a table of
|
||
possible transformations and a table of string variables.
|
||
The transformation table contains one entry for each possible
|
||
transformation of a file.
|
||
Which transformations are used depends on the suffix of the
|
||
source file.
|
||
Each transformation table entry tells which input suffixes are
|
||
allowed and what suffix/name the output file has.
|
||
When the output file does not already satisfy the request of the
|
||
user, with the flag \fB\-c.suffix\fP, the table is scanned
|
||
starting with the next transformation in the table for another
|
||
transformation that has as input suffix the output suffix of
|
||
the previous transformation.
|
||
A few special transformations are recognized, among them is the
|
||
combiner, which is
|
||
a program combining several files into one.
|
||
When no stop suffix was specified (flag \fB\-c.suffix\fP) \fIack\fP
|
||
stops after executing the combiner with as arguments the \-
|
||
possibly transformed \- input files and libraries.
|
||
\fIAck\fP will only perform the transformations in the order in
|
||
which they are presented in the table.
|
||
.LP
|
||
The string variables are used while creating the argument list
|
||
and program call name for
|
||
a particular transformation.
|
||
.NH
|
||
Which descriptions are used
|
||
.PP
|
||
\fIAck\fP always uses two description files: one to define the
|
||
front-end transformations and one for the machine dependent
|
||
back-end transformations.
|
||
Each description has a name.
|
||
First the way of determining
|
||
the name of the descriptions needed is described.
|
||
.PP
|
||
When the shell environment variable ACKFE is set \fIack\fP uses
|
||
that to determine the front-end table name, otherwise it uses
|
||
\fBfe\fP.
|
||
.PP
|
||
The way the backend table name is determined is more
|
||
convoluted.
|
||
.br
|
||
First, when the last filename in the program call name is not
|
||
one of \fIack\fP, \fIcc\fP, \fIacc\fP, \fIpc\fP or \fIapc\fP,
|
||
this filename is used as the backend description name.
|
||
Second, when the \fB\-m\fP is present the \fB\-m\fP is chopped of this
|
||
flag and the rest is used as the backend description name.
|
||
Third, when both failed the shell environment variable ACKM is
|
||
used.
|
||
Last, when also ACKM was not present the default backend is
|
||
used, determined by the definition of ACKM in h/local.h.
|
||
The presence and value of the definition of ACKM is
|
||
determined at compile time of \fIack\fP.
|
||
.PP
|
||
Now, we have the names, but that is only the first step.
|
||
\fIAck\fP stores a few descriptions at compile time.
|
||
This descriptions are simply files read in at compile time.
|
||
At the moment of writing this document, the descriptions
|
||
included are: pdp, fe, i86, m68k2, vax2 and int.
|
||
The name of a description is first searched for internally,
|
||
then in lib/descr/\fIname\fP, then in
|
||
lib/\fIname\fP/descr, band finally in the current
|
||
directory of the user.
|
||
.NH
|
||
Using the description file
|
||
.PP
|
||
Before starting on a narrative of the description file,
|
||
the introduction of a few terms is necessary.
|
||
All these terms are used to describe the scanning of zero
|
||
terminated strings, thereby producing another string or
|
||
sequence of strings.
|
||
.IP Backslashing 5
|
||
.br
|
||
All characters preceded by \e are modified to prevent
|
||
recognition at further scanning.
|
||
This modification is undone before a string is passed to the
|
||
outside world as argument or message.
|
||
When reading the description files the
|
||
sequences \e\e, \e# and \e<newline> have a special meaning.
|
||
\e\e translates to a single \e, \e# translates to a single #
|
||
that is not
|
||
recognized as the start of comment, but can be used in
|
||
recognition and finally, \e<newline> translates to nothing at
|
||
all, thereby allowing continuation lines.
|
||
.nr PD 0
|
||
.IP "Variable replacement"
|
||
.br
|
||
The scan recognizes the sequences {{, {NAME} and {NAME?text}
|
||
Where NAME can be any combination if characters excluding ? and
|
||
} and text may be anything excluding }.
|
||
(~\e} is allowed of course~)
|
||
The first sequence produces an unescaped single {.
|
||
The second produces the contents of the NAME, definitions are
|
||
done by \fIack\fP and in description files.
|
||
When the NAME is not defined an error message is produced on
|
||
the diagnostic output.
|
||
The last sequence produces the contents of NAME if it is
|
||
defined and text otherwise.
|
||
.PP
|
||
.IP "Expression replacement"
|
||
.br
|
||
Syntax: (\fIsuffix sequence\fP:\fIsuffix sequence\fP=\fItext\fP)
|
||
.br
|
||
Example: (.c.p.e:.e=tail_em)
|
||
.br
|
||
If the two suffix sequences have a common member \-~\&.e in this
|
||
case~\- the text is produced.
|
||
When no common member is present the empty string is produced.
|
||
Thus the example given is a constant expression.
|
||
Normally, one of the suffix sequences is produced by variable
|
||
replacement.
|
||
\fIAck\fP sets three variables while performing the diverse
|
||
transformations: HEAD, TAIL and RTS.
|
||
All three variables depend on the properties \fIrts\fP and
|
||
\fIneed\fP from the transformations used.
|
||
Whenever a transformation is used for the first time,
|
||
the text following the \fIneed\fP is appended to both the HEAD and
|
||
TAIL variable.
|
||
The value of the variable RTS is determined by the first
|
||
transformation used with a \fIrts\fP property.
|
||
.IP
|
||
Two runtime flags have effect on the value of one or more of
|
||
these variables.
|
||
The flag \fB\-.suffix\fP has the same effect on these three variables
|
||
as if a file with that \fBsuffix\fP was included in the argument list
|
||
and had to be translated.
|
||
The flag \fB\-r.suffix\fP only has that effect on the TAIL
|
||
variable.
|
||
The program call names \fIacc\fP and \fIcc\fP have the effect
|
||
of an automatic \fB\-.c\fP flag.
|
||
\fIApc\fP and \fIpc\fP have the effect of an automatic \fB\-.p\fP flag.
|
||
.IP "Line splitting"
|
||
.br
|
||
The string is transformed into a sequence of strings by replacing
|
||
the blank space by string separators (nulls).
|
||
.IP "IO replacement"
|
||
.br
|
||
The > in the string is replaced by the output file name.
|
||
The < in the string is replaced by the input file name.
|
||
When multiple input files are present the string is duplicated
|
||
for each input file name.
|
||
.nr PD 1v
|
||
.LP
|
||
Each description is a sequence of variable definitions followed
|
||
by a sequence of transformation definitions.
|
||
Variable definitions use a line each, transformations
|
||
definitions consist of a sequence of lines.
|
||
Empty lines are discarded, as are lines with nothing but
|
||
comment.
|
||
Comment is started by a # character, and continues to the end
|
||
of the line.
|
||
Three special two-characters sequences exist: \e#, \e\e and
|
||
\e<newline>.
|
||
Their effect is described under 'backslashing' above.
|
||
Each \- nonempty \- line starts with a keyword, possibly
|
||
preceded by blank space.
|
||
The keyword can be followed by a further specification.
|
||
The two are separated by blank space.
|
||
.PP
|
||
Variable definitions use the keyword \fIvar\fP and look like this:
|
||
.DS X
|
||
var NAME=text
|
||
.DE
|
||
The name can be any identifier, the text may contain any
|
||
character.
|
||
Blank space before the equal sign is not part of the NAME.
|
||
Blank space after the equal is considered as part of the text.
|
||
The text is scanned for variable replacement before it is
|
||
associated with the variable name.
|
||
.br
|
||
.sp 2
|
||
The start of a transformation definition is indicated by the
|
||
keyword \fIname\fP.
|
||
The last line of such a definition contains the keyword
|
||
\fIend\fP.
|
||
The lines in between associate properties to a transformation
|
||
and may be presented in any order.
|
||
The identifier after the \fIname\fP keyword determines the name
|
||
of the transformation.
|
||
This name is used for debugging and by the \fB\-R\fP flag.
|
||
The keywords are used to specify which input suffices are
|
||
recognized by that transformation,
|
||
the program to run, the arguments to be handed to that program
|
||
and the name or suffix of the resulting output file.
|
||
Two keywords are used to indicate which run-time startoffs and
|
||
libraries are needed.
|
||
The possible keywords are:
|
||
.IP \fIfrom\fP
|
||
.br
|
||
followed by a sequence of suffices.
|
||
Each file with one of these suffices is allowed as input file.
|
||
Preprocessor transformations do not need the \fIfrom\fP
|
||
keyword. All other transformations do.
|
||
.nr PD 0
|
||
.IP \fIto\fP
|
||
.br
|
||
followed by the suffix of the output file name or in the case of a
|
||
linker
|
||
the output file name.
|
||
.IP \fIprogram\fP
|
||
.br
|
||
followed by name of the load file of the program, a pathname most likely
|
||
starts with either a / or {EM}.
|
||
This keyword must be
|
||
present, the remainder of the line
|
||
is subject to backslashing and variable replacement.
|
||
.IP \fImapflag\fP
|
||
.br
|
||
The mapflags are used to grab flags given to \fIack\fP and
|
||
pass them on to a specific transformation.
|
||
This feature uses a few simple pattern matching and replacement
|
||
facilities.
|
||
Multiple occurences of this keyword are allowed.
|
||
This text following the keyword is
|
||
subjected to backslashing.
|
||
The keyword is followed by a match expression and a variable
|
||
assignment separated by blank space.
|
||
As soon as both description files are read, \fIack\fP looks
|
||
at all transformations in these files to find a match for the
|
||
flags given to \fIack\fP.
|
||
The flags \fB\-m\fP, \fB\-o\fP,
|
||
\fB\-O\fP, \fB\-r\fP, \fB\-v\fP, \fB\-g\fP, \-\fB\-c\fP, \fB\-t\fP,
|
||
\fB\-k\fP, \fB\-R\fP and \-\fB\-.\fP are specific to \fIack\fP and
|
||
not handed down to any transformation.
|
||
The matching is performed in the order in which the entries
|
||
appear in the definition.
|
||
The scanning stops after first match is found.
|
||
When a match is found, the variable assignment is executed.
|
||
A * in the match expression matches any sequence of characters,
|
||
a * in the right hand part of the assignment is
|
||
replaced by the characters matched by
|
||
the * in the expression.
|
||
The right hand part is also subject to variable replacement.
|
||
The variable will probably be used in the program arguments.
|
||
The \fB\-l\fP flags are special,
|
||
the order in which they are presented to \fIack\fP must be
|
||
preserved.
|
||
The identifier LNAME is used in conjunction with the scanning of
|
||
\fB\-l\fP flags.
|
||
The value assigned to LNAME is used to replace the flag.
|
||
The example further on shows the use all this.
|
||
.IP \fIargs\fP
|
||
.br
|
||
The keyword is followed by the program call arguments.
|
||
It is subject to backslashing, variable replacement, expression
|
||
replacement, line splitting and IO replacement.
|
||
The variables assigned to by \fImapflags\fP will probably be
|
||
used here.
|
||
The flags not recognized by \fIack\fP or any of the transformations
|
||
are passed to the linker and inserted before all other arguments.
|
||
.IP \fIstdin\fP
|
||
.br
|
||
This keyword indicates that the transformation reads from standard input.
|
||
.IP \fIstdout\fP
|
||
.br
|
||
This keyword indicates that the transformation writes on standard output.
|
||
.IP \fIoptimizer\fP
|
||
.br
|
||
This keyword indicates that this transformation is an optimizer.
|
||
.IP \fIlinker\fP
|
||
.br
|
||
This keyword indicates that this transformation is the linker.
|
||
.IP \fIcombiner\fP
|
||
.br
|
||
This keyword indicates that this transformation is a combiner. A combiner
|
||
is a program combining several files into one, but is not a linker.
|
||
An example of a combiner is the global optimizer.
|
||
.IP \fIprep\fP
|
||
.br
|
||
This \-~optional~\- keyword is followed an option indicating its relation
|
||
to the preprocessor.
|
||
The possible options are:
|
||
.DS X
|
||
always the input files must be preprocessed
|
||
cond the input files must be preprocessed when starting with #
|
||
is this transformation is the preprocessor
|
||
.DE
|
||
.IP \fIrts\fP
|
||
.br
|
||
This \-~optional~\- keyword indicates that the rest of the line must be
|
||
used to set the variable RTS, if it was not already set.
|
||
Thus the variable RTS is set by the first transformation
|
||
executed which such a property or as a result from \fIack\fP's program
|
||
call name (acc, cc, apc or pc) or by the \fB\-.suffix\fP flag.
|
||
.IP \fIneed\fP
|
||
.br
|
||
This \-~optional~\- keyword indicates that the rest of the line must be
|
||
concatenated to the NEEDS variable.
|
||
This is done once for every transformation used or indicated
|
||
by one of the program call names mentioned above or indicated
|
||
by the \fB\-.suffix\fP flag.
|
||
.br
|
||
.nr PD 1v
|
||
.NH
|
||
Conventions used in description files
|
||
.PP
|
||
\fIAck\fP reads two description files.
|
||
A few of the variables defined in the machine specific file
|
||
are used by the descriptions of the front-ends.
|
||
Other variables, set by \fIack\fP, are of use to all
|
||
transformations.
|
||
.PP
|
||
\fIAck\fP sets the variable EM to the home directory of the
|
||
Amsterdam Compiler Kit.
|
||
The variable SOURCE is set to the name of the argument that is currently
|
||
being massaged, this is usefull for debugging.
|
||
The variable SUFFIX is set to the suffix of the argument that is
|
||
currently being massaged.
|
||
.br
|
||
The variable M indicates the
|
||
directory in lib/{M}/tail_..... and NAME is the string to
|
||
be defined by the preprocessor with \-D{NAME}.
|
||
The definitions of {w}, {s}, {l}, {d}, {f} and {p} indicate
|
||
EM_WSIZE, EM_SSIZE, EM_LSIZE, EM_DSIZE, EM_FSIZE and EM_PSIZE
|
||
respectively.
|
||
.br
|
||
The variable INCLUDES is used as the last argument to \fIcpp\fP.
|
||
It is used to add directories to
|
||
the list of directories containing #include files.
|
||
.PP
|
||
The variables HEAD, TAIL and RTS are set by \fIack\fP and used
|
||
to compose the arguments for the linker.
|
||
.NH
|
||
Example
|
||
.PP
|
||
Description for front-end
|
||
.DS X
|
||
.ta 4n 40n
|
||
name cpp # the C-preprocessor
|
||
# no from, it's governed by the P property
|
||
to .i # result files have suffix i
|
||
program {EM}/lib/cpp # pathname of loadfile
|
||
mapflag \-I* CPP_F={CPP_F?} \-I* # grab \-I.. \-U.. and
|
||
mapflag \-U* CPP_F={CPP_F?} \-U* # \-D.. to use as arguments
|
||
mapflag \-D* CPP_F={CPP_F?} \-D* # in the variable CPP_F
|
||
args {CPP_F?} {INCLUDES?} \-D{NAME} \-DEM_WSIZE={w} \-DEM_PSIZE={p} \e
|
||
\-DEM_SSIZE={s} \-DEM_LSIZE={l} \-DEM_FSIZE={f} \-DEM_DSIZE={d} <
|
||
# The arguments are: first the \-[IUD]...
|
||
# then the include dir's for this machine
|
||
# then the NAME and size valeus finally
|
||
# followed by the input file name
|
||
stdout # Output on stdout
|
||
prep is # Is preprocessor
|
||
end
|
||
name cem # the C-compiler proper
|
||
from .c # used for files with suffix .c
|
||
to .k # produces compact code files
|
||
program {EM}/lib/em_cem # pathname of loadfile
|
||
mapflag \-p CEM_F={CEM_F?} \-Xp # pass \-p as \-Xp to cem
|
||
mapflag \-L CEM_F={CEM_F?} \-l # pass \-L as \-l to cem
|
||
args \-Vw{w}i{w}p{p}f{f}s{s}l{l}d{d} {CEM_F?}
|
||
# the arguments are the object sizes in
|
||
# the \-V... flag and possibly \-l and \-Xp
|
||
stdin # input from stdin
|
||
stdout # output on stdout
|
||
prep always # use cpp
|
||
rts .c # use the C run-time system
|
||
need .c # use the C libraries
|
||
end
|
||
name decode # make human readable files from compact code
|
||
from .k.m # accept files with suffix .k or .m
|
||
to .e # produce .e files
|
||
program {EM}/lib/em_decode # pathname of loadfile
|
||
args < # the input file name is the only argument
|
||
stdout # the output comes on stdout
|
||
end
|
||
.DE
|
||
|
||
.DS X
|
||
.ta 4n 40n
|
||
Example of a backend, in this case the EM assembler/loader.
|
||
|
||
var w=2 # wordsize 2
|
||
var p=2 # pointersize 2
|
||
var s=2 # short size 2
|
||
var l=4 # long size 4
|
||
var f=4 # float size 4
|
||
var d=8 # dou<6F><75>XY<17>H<EFBFBD>\<5C>و<19>[H<1B>e startoff
|
||
var SIZE_FLAG=\-sm # default internal table size flag
|
||
var INCLUDES=\-I{EM}/include # use {EM}/include for #include files
|
||
name asld # Assembler/loader
|
||
from .k.m.a # accepts compact code and archives
|
||
to e.out # output file name
|
||
program {EM}/lib/em_ass # load file pathname
|
||
mapflag \-l* LNAME={EM}/{LIB}* # e.g. \-ly becomes
|
||
# {EM}/mach/int/lib/tail_y
|
||
mapflag \-+* ASS_F={ASS_F?} \-+* # recognize \-+ and \-\-
|
||
mapflag \-\-* ASS_F={ASS_F?} \-\-*
|
||
mapflag \-s* SIZE_FLAG=\-s* # overwrite old value of SIZE_FLAG
|
||
args {SIZE_FLAG} \e
|
||
({RTS}:.c={EM}/{RT}cc) ({RTS}:.p={EM}/{RT}pc) \-o > < \e
|
||
(.p:{TAIL}={EM}/{LIB}pc) \e
|
||
(.c:{TAIL}={EM}/{LIB}cc.1s {EM}/{LIB}cc.2g) \e
|
||
(.c.p:{TAIL}={EM}/{LIB}mon)
|
||
# \-s[sml] must be first argument
|
||
# the next line contains the choice for head_cc or head_pc
|
||
# and the specification of in- and output.
|
||
# the last three args lines choose libraries
|
||
linker
|
||
end
|
||
.DE
|
||
|
||
The command \fIack \-mint \-v \-v \-I../h \-L \-ly prog.c\fP
|
||
would result in the following
|
||
calls (with exec(II)):
|
||
.DS X
|
||
.ta 4n
|
||
1) /lib/cpp \-I../h \-I/usr/em/include \-Dint22 \-DEM_WSIZE=2 \-DEM_PSIZE=2 \e
|
||
\-DEM_SSIZE=2 \-DEM_LSIZE=4 \-DEM_FSIZE=4 \-DEM_DSIZE=8 prog.c
|
||
2) /usr/em/lib/em_cem \-Vw2i2p2f4s2l4d8 \-l
|
||
3) /usr/em/lib/em_ass \-sm /usr/em/mach/int/lib/head_cc \-o e.out prog.k
|
||
/usr/em/mach/int/lib/tail_y /usr/em/mach/int/lib/tail_cc.1s
|
||
/usr/em/mach/int/lib/tail_cc.2g /usr/em/mach/int/lib/tail_mon
|
||
.DE
|