421 lines
17 KiB
Plaintext
421 lines
17 KiB
Plaintext
.\" $Header$
|
|
.nr LL 7.5i
|
|
.tr ~
|
|
.nr PD 1v
|
|
.TL
|
|
Ack Description File
|
|
.br
|
|
Reference Manual
|
|
.AU
|
|
Ed Keizer
|
|
.AI
|
|
Wiskundig Seminarium
|
|
Vrije Universiteit
|
|
Amsterdam
|
|
.NH
|
|
Introduction
|
|
.PP
|
|
The program \fIack\fP(I) internally maintains a table of
|
|
possible transformations and a table of string variables.
|
|
The transformation table contains one entry for each possible
|
|
transformation of a file.
|
|
Which transformations are used depends on the suffix of the
|
|
source file.
|
|
Each transformation table entry tells which input suffixes are
|
|
allowed and what suffix/name the output file has.
|
|
When the output file does not already satisfy the request of the
|
|
user, with the flag \fB-c.suffix\fP, the table is scanned
|
|
starting with the next transformation in the table for another
|
|
transformation that has as input suffix the output suffix of
|
|
the previous transformation.
|
|
A few special transformations are recognized, among them is the
|
|
combiner.
|
|
A program combining several files into one.
|
|
When no stop suffix was specified (flag \fB-c.suffix\fP) \fIack\fP
|
|
stops after executing the combiner with as arguments the -
|
|
possibly transformed - input files and libraries.
|
|
\fIAck\fP will only perform the transformations in the order in
|
|
which they are presented in the table.
|
|
.LP
|
|
The string variables are used while creating the argument list
|
|
and program call name for
|
|
a particular transformation.
|
|
.NH
|
|
Which descriptions are used
|
|
.PP
|
|
\fIAck\fP always uses two description files: one to define the
|
|
front-end transformations and one for the machine dependent
|
|
back-end transformations.
|
|
Each description has a name.
|
|
First the way of determining
|
|
the name of the descriptions needed is described.
|
|
.PP
|
|
When the shell environment variable ACKFE is set \fIack\fP uses
|
|
that to determine the front-end table name, otherwise it uses
|
|
\fBfe\fP.
|
|
.PP
|
|
The way the backend table name is determined is more
|
|
convoluted.
|
|
.br
|
|
First, when the last filename in the program call name is not
|
|
one of \fIack\fP, \fIcc\fP, \fIacc\fP, \fIpc\fP or \fIapc\fP,
|
|
this filename is used as the backend description name.
|
|
Second, when the \fB-m\fP is present the \fB-m\fP is chopped of this
|
|
flag and the rest is used as the backend description name.
|
|
Third, when both failed the shell environment variable ACKM is
|
|
used.
|
|
Last, when also ACKM was not present the default backend is
|
|
used, determined by the definition of ACKM in h/local.h.
|
|
The presence and value of the definition of ACKM is
|
|
determined at compile time of \fIack\fP.
|
|
.PP
|
|
Now, we have the names, but that is only the first step.
|
|
\fIAck\fP stores a few descriptions at compile time.
|
|
This descriptions are simply files read in at compile time.
|
|
At the moment of writing this document, the descriptions
|
|
included are: pdp, fe, i86, m68k2, vax2 and int.
|
|
The name of a description is first searched for internally,
|
|
then in the directory lib/ack and finally in the current
|
|
directory of the user.
|
|
.NH
|
|
Using the description file
|
|
.PP
|
|
Before starting on a narrative of the description file,
|
|
the introduction of a few terms is necessary.
|
|
All these terms are used to describe the scanning of zero
|
|
terminated strings, thereby producing another string or
|
|
sequence of strings.
|
|
.IP Backslashing 5
|
|
.br
|
|
All characters preceded by \e are modified to prevent
|
|
recognition at further scanning.
|
|
This modification is undone before a string is passed to the
|
|
outside world as argument or message.
|
|
When reading the description files the
|
|
sequences \e\e, \e# and \e<newline> have a special meaning.
|
|
\e\e translates to a single \e, \e# translates to a single #
|
|
that is not
|
|
recognized as the start of comment, but can be used in
|
|
recognition and finally, \e<newline> translates to nothing at
|
|
all, thereby allowing continuation lines.
|
|
.nr PD 0
|
|
.IP "Variable replacement"
|
|
.br
|
|
The scan recognizes the sequences {{, {NAME} and {NAME?text}
|
|
Where NAME can be any combination if characters excluding ? and
|
|
} and text may be anything excluding }.
|
|
(~\e} is allowed of course~)
|
|
The first sequence produces an unescaped single {.
|
|
The second produces the contents of the NAME, definitions are
|
|
done by \fIack\fP and in description files.
|
|
When the NAME is not defined an error message is produced on
|
|
the diagnostic output.
|
|
The last sequence produces the contents of NAME if it is
|
|
defined and text otherwise.
|
|
.PP
|
|
.IP "Expression replacement"
|
|
.br
|
|
Syntax: (\fIsuffix sequence\fP:\fIsuffix sequence\fP=\fItext\fP)
|
|
.br
|
|
Example: (.c.p.e:.e=tail_em)
|
|
.br
|
|
If the two suffix sequences have a common member -~\&.e in this
|
|
case~- the text is produced.
|
|
When no common member is present the empty string is produced.
|
|
Thus the example given is a constant expression.
|
|
Normally, one of the suffix sequences is produced by variable
|
|
replacement.
|
|
\fIAck\fP sets three variables while performing the diverse
|
|
transformations: HEAD, TAIL and RTS.
|
|
All three variables depend on the properties \fIrts\fP and
|
|
\fIneed\fP from the transformations used.
|
|
Whenever a transformation is used for the first time,
|
|
the text following the \fIneed\fP is appended to both the HEAD and
|
|
TAIL variable.
|
|
The value of the variable RTS is determined by the first
|
|
transformation used with a \fIrts\fP property.
|
|
.LP
|
|
Two runtime flags have effect on the value of one or more of
|
|
these variables.
|
|
The flag \fB-.suffix\fP has the same effect on these three variables
|
|
as if a file with that \fBsuffix\fP was included in the argument list
|
|
and had to be translated.
|
|
The flag \fB-r.suffix\fP only has that effect on the TAIL
|
|
variable.
|
|
The program call names \fIacc\fP and \fIcc\fP have the effect
|
|
of an automatic \fB-.c\fB flag.
|
|
\fIApc\fP and \fIpc\fP have the effect of an automatic \fB-.p\fP flag.
|
|
.IP "Line splitting"
|
|
.br
|
|
The string is transformed into a sequence of strings by replacing
|
|
the blank space by string separators (nulls).
|
|
.IP "IO replacement"
|
|
.br
|
|
The > in the string is replaced by the output file name.
|
|
The < in the string is replaced by the input file name.
|
|
When multiple input files are present the string is duplicated
|
|
for each input file name.
|
|
.nr PD 1v
|
|
.LP
|
|
Each description is a sequence of variable definitions followed
|
|
by a sequence of transformation definitions.
|
|
Variable definitions use a line each, transformations
|
|
definitions consist of a sequence of lines.
|
|
Empty lines are discarded, as are lines with nothing but
|
|
comment.
|
|
Comment is started by a # character, and continues to the end
|
|
of the line.
|
|
Three special two-characters sequences exist: \e#, \e\e and
|
|
\e<newline>.
|
|
Their effect is described under 'backslashing' above.
|
|
Each - nonempty - line starts with a keyword, possibly
|
|
preceded by blank space.
|
|
The keyword can be followed by a further specification.
|
|
The two are separated by blank space.
|
|
.PP
|
|
Variable definitions use the keyword \fIvar\fP and look like this:
|
|
.DS X
|
|
var NAME=text
|
|
.DE
|
|
The name can be any identifier, the text may contain any
|
|
character.
|
|
Blank space before the equal sign is not part of the NAME.
|
|
Blank space after the equal is considered as part of the text.
|
|
The text is scanned for variable replacement before it is
|
|
associated with the variable name.
|
|
.br
|
|
.sp 2
|
|
The start of a transformation definition is indicated by the
|
|
keyword \fIname\fP.
|
|
The last line of such a definition contains the keyword
|
|
\fIend\fP.
|
|
The lines in between associate properties to a transformation
|
|
and may be presented in any order.
|
|
The identifier after the \fIname\fP keyword determines the name
|
|
of the transformation.
|
|
This name is used for debugging and by the \fB-R\fP flag.
|
|
The keywords are used to specify which input suffices are
|
|
recognized by that transformation,
|
|
the program to run, the arguments to be handed to that program
|
|
and the name or suffix of the resulting output file.
|
|
Two keywords are used to indicate which run-time startoffs and
|
|
libraries are needed.
|
|
The possible keywords are:
|
|
.IP \fIfrom\fP
|
|
.br
|
|
followed by a sequence of suffices.
|
|
Each file with one of these suffices is allowed as input file.
|
|
Preprocessor transformations, those with the \fBP\fP property
|
|
after the \fIprop\fP keyword, do not need the \fIfrom\fP
|
|
keyword. All other transformations do.
|
|
.nr PD 0
|
|
.IP \fIto\fP
|
|
.br
|
|
followed by the suffix of the output file name or in the case of a
|
|
linker -~indicated by C option after the \fIprop\fP keyword~-
|
|
the output file name.
|
|
.IP \fIprogram\fP
|
|
.br
|
|
followed by name of the load file of the program, a pathname most likely
|
|
starts with either a / or {EM}.
|
|
This keyword must be
|
|
present, the remainder of the line
|
|
is subject to backslashing and variable replacement.
|
|
.IP \fImapflag\fP
|
|
.br
|
|
The mapflags are used to grab flags given to \fIack\fP and
|
|
pass them on to a specific transformation.
|
|
This feature uses a few simple pattern matching and replacement
|
|
facilities.
|
|
Multiple occurences of this keyword are allowed.
|
|
This text following the keyword is
|
|
subjected to backslashing.
|
|
The keyword is followed by a match expression and a variable
|
|
assignment separated by blank space.
|
|
As soon as both description files are read, \fIack\fP looks
|
|
at all transformations in these files to find a match for the
|
|
flags given to \fIack\fP.
|
|
The flags \fB-m\fP, \fB-o\fP,
|
|
\fI-O\fP, \fB-r\fP, \fB-v\fP, \fB-g\fP, -\fB-c\fP, \fB-t\fP,
|
|
\fB-k\fP, \fB-R\fP and -\f-.\fP are specific to \fIack\fP and
|
|
not handed down to any transformation.
|
|
The matching is performed in the order in which the entries
|
|
appear in the definition.
|
|
The scanning stops after first match is found.
|
|
When a match is found, the variable assignment is executed.
|
|
A * in the match expression matches any sequence of characters,
|
|
a * in the right hand part of the assignment is
|
|
replaced by the characters matched by
|
|
the * in the expression.
|
|
The right hand part is also subject to variable replacement.
|
|
The variable will probably be used in the program arguments.
|
|
The \fB-l\fP flags are special,
|
|
the order in which they are presented to \fIack\fP must be
|
|
preserved.
|
|
The identifier LNAME is used in conjunction with the scanning of
|
|
\fB-l\fP flags.
|
|
The value assigned to LNAME is used to replace the flag.
|
|
The example further on shows the use all this.
|
|
.IP \fIargs\fP
|
|
.br
|
|
The keyword is followed by the program call arguments.
|
|
It is subject to backslashing, variable replacement, expression
|
|
replacement, line splitting and IO replacement.
|
|
The variables assigned to by \fImapflags\P will probably be
|
|
used here.
|
|
The flags not recognized by \fIack\fP or any of the transformations
|
|
are passed to the linker and inserted before all other arguments.
|
|
.IP \fIprop\fB
|
|
.br
|
|
This -~optional~- keyword is followed by a sequence of options,
|
|
each option is indicated by one character
|
|
signifying a special property of the transformation.
|
|
The possible options are:
|
|
.DS X
|
|
< the input file will be read from standard input
|
|
> the output file will be written on standard output
|
|
p the input files must be preprocessed
|
|
m the input files must be preprocessed when starting with #
|
|
O this transformation is an optimizer and may be skipped
|
|
P this transformation is the preprocessor
|
|
C this transformation is the linker
|
|
.DE
|
|
.IP \fIrts\fP
|
|
.br
|
|
This -~optional~- keyword indicates that the rest of the line must be
|
|
used to set the variable RTS, if it was not already set.
|
|
Thus the variable RTS is set by the first transformation
|
|
executed which such a property or as a result from \fIack\fP's program
|
|
call name (acc, cc, apc or pc) or by the \fB-.suffix\fP flag.
|
|
.IP \fIneed\fP
|
|
.br
|
|
This -~optional~- keyword indicates that the rest of the line must be
|
|
concatenated to the NEEDS variable.
|
|
This is done once for every transformation used or indicated
|
|
by one of the program call names mentioned above or indicated
|
|
by the \fB-.suffix\fP flag.
|
|
.br
|
|
.nr PD 1v
|
|
.NH
|
|
Conventions used in description files
|
|
.PP
|
|
\fIAck\fP reads two description files.
|
|
A few of the variables defined in the machine specific file
|
|
are used by the descriptions of the front-ends.
|
|
Other variables, set by \fack\fB, are of use to all
|
|
transformations.
|
|
.PP
|
|
\fIAck\fP sets the variable EM to the home directory of the
|
|
Amsterdam Compiler Kit.
|
|
The variable SOURCE is set to the name of the argument that is currently
|
|
being massaged, this is usefull for debugging.
|
|
.br
|
|
The variable M indicates the
|
|
directory in mach/{M}/lib/tail_..... and NAME is the string to
|
|
be defined by the preprocessor with -D{NAME}.
|
|
The definitions of {w}, {s}, {l}, {d}, {f} and {p} indicate
|
|
EM_WSIZE, EM_SSIZE, EM_LSIZE, EM_DSIZE, EM_FSIZE and EM_PSIZE
|
|
respectively.
|
|
.br
|
|
The variable INCLUDES is used as the last argument to \fIcpp\fP,
|
|
it is currently used to add the directory {EM}/include to
|
|
the list of directories containing #include files.
|
|
{EM}/include contains a few files used by the library routines
|
|
for part III from the
|
|
.UX
|
|
manual.
|
|
These routines are included in the kit.
|
|
.PP
|
|
The variables HEAD, TAIL and RTS are set by \fIack\fP and used
|
|
to compose the arguments for the linker.
|
|
.NH
|
|
Example
|
|
.sp 1
|
|
description for front-end
|
|
.DS X
|
|
name cpp # the C-preprocessor
|
|
# no from, it's governed by the P property
|
|
to .i # result files have suffix i
|
|
program {EM}/lib/cpp # pathname of loadfile
|
|
mapflag -I* CPP_F={CPP_F?} -I* # grab -I.. -U.. and
|
|
mapflag -U* CPP_F={CPP_F?} -U* # -D.. to use as arguments
|
|
mapflag -D* CPP_F={CPP_F?} -D* # in the variable CPP_F
|
|
args {CPP_F?} {INCLUDES?} -D{NAME} -DEM_WSIZE={w} -DEM_PSIZE={p} \
|
|
-DEM_SSIZE={s} -DEM_LSIZE={l} -DEM_FSIZE={f} -DEM_DSIZE={d} <
|
|
# The arguments are: first the -[IUD]...
|
|
# then the include dir's for this machine
|
|
# then the NAME and size valeus finally
|
|
# followed by the input file name
|
|
prop >P # Output on stdout, is preprocessor
|
|
end
|
|
name cem # the C-compiler proper
|
|
from .c # used for files with suffix .c
|
|
to .k # produces compact code files
|
|
program {EM}/lib/em_cem # pathname of loadfile
|
|
mapflag -p CEM_F={CEM_F?} -Xp # pass -p as -Xp to cem
|
|
mapflag -L CEM_F={CEM_F?} -l # pass -L as -l to cem
|
|
args -Vw{w}i{w}p{p}f{f}s{s}l{l}d{d} {CEM_F?}
|
|
# the arguments are the object sizes in
|
|
# the -V... flag and possibly -l and -Xp
|
|
prop <>p # input on stdin, output on stdout, use cpp
|
|
rts .c # use the C run-time system
|
|
need .c # use the C libraries
|
|
end
|
|
name decode # make human readable files from compact code
|
|
from .k.m # accept files with suffix .k or .m
|
|
to .e # produce .e files
|
|
program {EM}/lib/em_decode # pathname of loadfile
|
|
args < # the input file name is the only argument
|
|
prop > # the output comes on stdout
|
|
end
|
|
.DE
|
|
|
|
.DS X
|
|
Example of a backend, in this case the EM assembler/loader.
|
|
|
|
var w=2 # wordsize 2
|
|
var p=2 # pointersize 2
|
|
var s=2 # short size 2
|
|
var l=4 # long size 4
|
|
var f=4 # float size 4
|
|
var d=8 # double size 8
|
|
var M=int # Unused in this example
|
|
var NAME=int22 # for cpp (NAME=int results in #define int 1)
|
|
var LIB=mach/int/lib/tail_ # part of file name for libraries
|
|
var RT=mach/int/lib/head_ # part of file name for run-time startoff
|
|
var SIZE_FLAG=-sm # default internal table size flag
|
|
var INCLUDES=-I{EM}/include # use {EM}/include for #include files
|
|
name asld # Assembler/loader
|
|
from .k.m.a # accepts compact code and archives
|
|
to e.out # output file name
|
|
program {EM}/lib/em_ass # load file pathname
|
|
mapflag -l* LNAME={EM}/{LIB}* # e.g. -ly becomes
|
|
# {EM}/mach/int/lib/tail_y
|
|
mapflag -+* ASS_F={ASS_F?} -+* # recognize -+ and --
|
|
mapflag --* ASS_F={ASS_F?} --*
|
|
mapflag -s* SIZE_FLAG=-s* # overwrite old value of SIZE_FLAG
|
|
args {SIZE_FLAG} \
|
|
({RTS}:.c={EM}/{RT}cc) ({RTS}:.p={EM}/{RT}pc) -o > < \
|
|
(.p:{TAIL}={EM}/{LIB}pc) \
|
|
(.c:{TAIL}={EM}/{LIB}cc.1s {EM}/{LIB}cc.2g) \
|
|
(.c.p:{TAIL}={EM}/{LIB}mon)
|
|
# -s[sml] must be first argument
|
|
# the next line contains the choice for head_cc or head_pc
|
|
# and the specification of in- and output.
|
|
# the last three args lines choose libraries
|
|
prop C # This is the final stage
|
|
end
|
|
.DE
|
|
|
|
The command "ack -mint -v -v -I../h -L -ly prog.c"
|
|
would result in the following
|
|
calls (with exec(II)):
|
|
.DS X
|
|
1) /lib/cpp -I../h -I/usr/em/include -Dint22 -DEM_WSIZE=2 -DEM_PSIZE=2
|
|
-DEM_SSIZE=2 -DEM_LSIZE=4 -DEM_FSIZE=4 -DEM_DSIZE=8 prog.c
|
|
2) /usr/em/lib/em_cem -Vw2i2p2f4s2l4d8 -l
|
|
3) /usr/em/lib/em_ass -sm /usr/em/mach/int/lib/head_cc -o e.out prog.k
|
|
/usr/em/mach/int/lib/tail_y /usr/em/mach/int/lib/tail_cc.1s
|
|
/usr/em/mach/int/lib/tail_cc.2g /usr/em/mach/int/lib/tail_mon
|
|
.DE
|