1991-09-27 16:19:24 +00:00
|
|
|
.so init
|
|
|
|
.nr H1 2
|
|
|
|
.NH
|
|
|
|
PROBLEMS
|
|
|
|
.NH 2
|
|
|
|
Maintain SPARC speed
|
|
|
|
.PP
|
|
|
|
If we want to generate SPARC code, we should try to generate efficient code
|
|
|
|
as fast as possible. It would be quite embarrassing to find out that the
|
|
|
|
same program would run faster on a Motorola 68020 than on a SPARC processor,
|
|
|
|
when both operate at the same clock frequency.
|
|
|
|
Looking at some code generated by Sun's C-compiler and optimizing assembler,
|
|
|
|
we can spot a few remarkable characteristics of the generated SPARC code:
|
|
|
|
.IP -
|
|
|
|
There are almost no memory references
|
|
|
|
.IP -
|
|
|
|
Parameters to functions are passed through registers.
|
|
|
|
.IP -
|
|
|
|
Almost all delay slots\(dg
|
|
|
|
.FS
|
|
|
|
\(dg For details about delay slots see the SPARC Architecture Manual, chapter 4, pp. 42-48
|
|
|
|
.FE
|
|
|
|
are filled in by the assembler
|
|
|
|
.LP
|
|
|
|
If we want to generate efficient code, we should at least try to
|
|
|
|
reduce the number of memory references and use registers wherever we can.
|
|
|
|
Since EM is stack-oriented it references its stack for every operation so
|
|
|
|
this will not be an easy task; a suitable solution will however be given in
|
|
|
|
the next chapter.
|
|
|
|
.NH 2
|
|
|
|
Increase compilation speed
|
|
|
|
.PP
|
|
|
|
Because we will implement a code expander (fast backend) we should keep
|
|
|
|
a close eye on efficiency; if we cannot beat regular compilers on producing
|
|
|
|
efficient code we will try to beat them on fast code generation.
|
|
|
|
The usual trick to achieve fast compilation is to pack the frontend,
|
|
|
|
optimizer, code-generator and
|
|
|
|
assembler all into a single large binary to reduce the overhead of
|
|
|
|
reading and writing temporary files. Unfortunately, due to the
|
|
|
|
SPARC instruction set, its relocation information is slightly bizarre
|
|
|
|
and cannot be represented with the present primitives.
|
|
|
|
This means that it will not be possible to generate the required output
|
|
|
|
format directly from our backend.
|
|
|
|
.PP
|
|
|
|
There are three solutions here: generate assembler code, and let an
|
|
|
|
existing assembler generate the required object (\fI.o\fR) files,
|
|
|
|
create our own primitives than can handle the SPARC relocation format, or
|
|
|
|
do not use any of the addressing modes that require the bizarre relocation.
|
|
|
|
Because we have enough on our hands already we will
|
|
|
|
let the existing assembler deal with generating object files.
|
|
|
|
.NH 2
|
|
|
|
Convert stack to register operations
|
|
|
|
.PP
|
|
|
|
As we wrote in the previous chapter, for RISC machines a code expander can
|
|
|
|
produce almost as efficient code as a code generator. The fact that this is
|
|
|
|
true for stack-oriented RISC processors is rather obvious. The problem we
|
|
|
|
face, however, is that the SPARC processor is register, instead of
|
|
|
|
stack oriented. In the next chapter we will give a suitable solution to
|
|
|
|
convert most stack accesses to register accesses.
|
|
|
|
.NH 2
|
|
|
|
Miscellaneous
|
|
|
|
.PP
|
|
|
|
Besides performance and \fI.o\fR-compatibility there are some other
|
|
|
|
peculiarities of the SPARC processor and Sun's C-compiler (henceforth
|
|
|
|
simply called \fIcc\fR).
|
|
|
|
.PP
|
|
|
|
For some reason, the SPARC stack pointer requires alignment
|
1991-11-19 13:37:20 +00:00
|
|
|
on 8 bytes, so it is impossible to push a 4-byte integer on the stack
|
1991-09-27 16:19:24 +00:00
|
|
|
and then \*(Sisub 4, %sp\*(So\(dd.
|
|
|
|
.FS
|
|
|
|
\(dd For more information about SPARC assembler see the Sun-4 Assembly
|
|
|
|
Language Reference Manual
|
|
|
|
.FE
|
|
|
|
This too will be discussed in the next chapter, where we will take a
|
|
|
|
more in-depth look into this problem and also discuss a couple of
|
|
|
|
possible solutions.
|
|
|
|
.PP
|
|
|
|
Another thing is that \fIcc\fR usually passes the first six parameters of a
|
|
|
|
function-call through registers. To be \fI.o\fR-compatible we would have to
|
|
|
|
pass the first six parameters of each function call through registers as well.
|
|
|
|
Exactly why this is not feasible will also be discussed in the next chapter.
|
|
|
|
.bp
|