d0p1/ack - Cute Engineering : Cute solutions to hard problems

d0p1/ack

Author	SHA1	Message	Date
George Koehler	5aa2ac2246	Teach the assembler about PowerPC extended mnemonics. Also make a few changes to basic mnemonics. Fix typo in name of the basic "creqv". Add the basic "addc" and relatives, because it would be odd to have the extended "subc" without "addc". Fix the basic "rldicl", "rldicr", "rldic", "rldimi" to correctly encode the 6-bit MB field. Fix "slw" and relatives to correctly swap their RA and RS operands. Add many, but not all, of the extended mnemonics from IBM's Power ISA Version 2.06 Book I Appendix E. (I used 2.06, published 2009, just because I already had the PDF of it.) This commit includes mnemonics for branching, subtraction, traps, bit rotation, and a few others, like "mflr" and "nop". The assembler now understands branches like `beq cr7, label` and bit shifts like `slwi r7, r7, 2`. These encode the same machine instructions as the basic "bc" and "rlwinm". Some operands to basic names become optional. The assembler no longer requires the level in "sc" or the branch hint in "bcctr" and "bclr"; they default to zero. Some extended names take an optional branch hint or condition register. Some extended names are still missing. I don't provide names with static branch prediction, like "beq+" or "bge-", because the assembler parses '+' and '-' as operators, not as part of an instruction name. I also don't provide some names that 2.06 has for moving to or from the condition register or some special purpose registers, names like "mtcr" or "mfuamr". This commit also deletes some unused tokens and one unused yacc rule.	2017-01-21 23:49:29 -05:00
David Given	d7df126730	Merge pull request #44 from kernigh/kernigh-pr-as mach/proto/as: allow more tokens	2017-01-18 23:33:40 +01:00
George Koehler	f705339f86	Allow more tokens in the assembler. I need this so I can add more %token lines to mach/powerpc/as/mach2.c The assembler's tempfile encoded each token in a byte. This only worked with tokens 0 to 127 and 256 and 383. If a token 384 or higher existed, the assembler stopped working. I need tokens 384 and higher. I change the token encoding to a 2-byte little-endian integer. I also change a byte in the string encoding.	2017-01-17 22:41:11 -05:00
David Given	232545606d	Merge from default.	2017-01-18 00:02:32 +01:00
George Koehler	ba2a03705e	Use prototypes in mach/proto/as/comm5.c Order the function prototypes in comm1.h to match the order of the function definitions in *.c files.	2017-01-17 16:41:29 -05:00
David Given	81c677d218	Add a bunch more set operations to the PowerPC backends, and the Pascal test for the same.	2017-01-17 22:31:38 +01:00
George Koehler	916d270534	Delay inclusion of <stdint.h> when compiling comm2.y See issue #1 (https://github.com/davidgiven/ack/issues/1). The file mach/proto/as/comm2.y goes through cpp twice. The _include macro, defined in comm2.y and used in comm0.h, delays the inclusion of system header files. The inclusion of <stdint.h> wasn't delayed. This caused multiple inclusions of <sys/_types.h> in FreeBSD and <machine/_types.h> in OpenBSD. Use _include to delay <stdint.h>. Also use _include for "arch.h" and "out.h", because h/out.h includes <stdint.h> and h/arch.h might include it in the future. Sort the system includes in comm0.h by moving them up to be with <stdint.h>. Must include <stdint.h> before "mach0.c", because mach/powerpc/as/mach0.c needs it. Must include "mach0.c" before checking ASLD.	2017-01-16 22:39:44 -05:00
George Koehler	e97116c037	Remove some obsolete code that causes a gcc warning. In my OpenBSD/amd64 system, the code becomes if (0) outname.on_valu &= ~(((0xFFFFFFFF)<<32)<<32); The 0xFFFFFFFF is a 32-bit int, so the left shift by 32 is out of range and causes the gcc warning. The intent might be to clear any sign-extended bits, if the assignment outname.on_valu = valu did sign extension. Old C had no unsigned long, so .on_valu would have been long. The code is obsolete because h/out.h now declares .on_valu as uint32_t.	2017-01-16 18:09:55 -05:00
David Given	c471f617b7	Ensure that memory is zero-initialised.	2017-01-16 22:45:03 +01:00
David Given	2cdcc16bc2	Fix a buffer overrun that was manifesting on OpenBSD; also fix a bounds check and some uninitialised variable problems.	2017-01-16 22:44:37 +01:00
David Given	fa5675d439	Run through clang-format.	2017-01-16 21:16:33 +01:00
David Given	e7e29d34ff	Add a test (currently failing) to check that Pascal char sets can store all 256 possible values. Add the PowerPC ncg and mcg backend support to let the test actually run, including modifying a bunch of PowrePC libem functions so that they can be called from both ncg and mcg.	2017-01-15 22:28:14 +01:00
David Given	9a346c382d	Turns out Apple's hi16/ha16 exactly match my ha16/has16, so renamed accordingly. (Memo to self: read the docs before doing the work.)	2017-01-15 11:59:33 +01:00
David Given	f80acfe9f5	Signed vs unsigned lower halves of powerpc fixups are now handled by having two assembler directives, ha16() and has16(), for the upper half; has16() applies the sign adjustment. .powerpcfixup is now gone, as we generate the relocation in ha*() instead. Add special logic to the linker for undoing and redoing the sign adjustment when reading/writing fixups. Tests still pass.	2017-01-15 11:51:37 +01:00
David Given	3c0bc205fc	Update the hi/lo syntax to be a bit more standard.	2017-01-15 10:21:02 +01:00
David Given	8edbff9795	Add assembler support for fixing up arbitrary oris/addi pairs of instructions; this should allow oris/lwz constant value loads, which will save an opcode.	2017-01-15 00:15:01 +01:00
David Given	efab08178b	Fix a bunch of issues with pushing and popping mismatched sizes, which the B compiler does a lot; dup 8 for pairs of words is now optimised.	2017-01-07 18:47:00 +01:00
David Given	6b4f8d72b8	ine and ste are now declared to modify memory (preventing cached values being propagated across the modification).	2017-01-07 13:25:09 +01:00
David Given	7710c76d56	Introduce sequence points before store instructions to prevent loads from the same address being delayed until after the store (at which point they'll return the wrong value).	2017-01-07 13:17:39 +01:00
David Given	0da248dced	Use a better NOT; and after remembering that PowerPC bit numbers are all backwards in the documentation, rewrote IFEQ/IFLT/IFLE to actually work. Probably. Thanks to the B test suite for spotting this.	2017-01-07 01:03:15 +01:00
David Given	73922f1d16	Ensure that procedure labels are word-aligned.	2017-01-06 22:29:52 +01:00
David Given	e3f8fb84dc	Change the i80 assembler to be three-pass, which allows forward references; required for assembling B.	2016-12-29 17:08:53 +00:00
David Given	e50f4be710	Merge from default.	2016-12-26 19:44:48 +00:00
David Given	bf2e0be69a	Merge pull request #27 from kernigh/pr-qemu-doze Teach qemuppc to halt the cpu on _exit().	2016-12-11 23:17:12 +01:00
George Koehler	8605a2fcfc	Add Modula-2 set operations to PowerPC ncg. This provides and, ior, xor, com, zer, set, cms when defined($1) and ior, set when !defined($1). I don't provide the other operations !defined($1) because our Modula-2 compiler hasn't used them. I wrote a Modula-2 example in https://gist.github.com/kernigh/add79662bb3c63ffb7c46d01dc8ae788 Put a dummy comment in mach/powerpc/libem/build.lua so git checkout will touch that file. Without the touch, the build system doesn't see the new *.s files.	2016-12-10 12:23:07 -05:00
George Koehler	fcda786fe9	Add some missing clauses to los, sts, aar, inn, cmi, cmu. We only implement 'los 4', 'sts 4', 'cmi 4', 'cmu 4', not for sizes other than 4. Add clause $1==4. We only implement inn when defined($1). The rule for aar needs 'kills ALL' because it kills many registers, like other rules that call libem.	2016-12-09 19:49:50 -05:00
George Koehler	436114fce4	Add a move from CONST smalls(%val) to GPR. This allows 'move {CONST, $1}, R3' with a small enough $1 to emit one instruction (addi) instead of two instructions (addis, ori). The CONST token confusingly isn't in the CONST_ALL set.	2016-12-09 18:40:14 -05:00
George Koehler	17211eef47	Fix ass to match the EM spec. The spec says, "ASS w: Adjust the stack pointer by w-byte integer". The w argument "can either be given as argument or on top of the stack." Therefore, 'ass 4' would pop the 4-byte integer from the stack, but 'ass' would pop the size w from the stack, then pop the w-byte integer. PowerPC ncg wrongly implemented 'ass' as if it was 'ass 4'. Fix it to accept only 'ass 4'.	2016-12-09 17:32:42 -05:00
George Koehler	5bd0ad4269	Remove the bogus rules for 'lor 2' and 'str 2'. These instructions would load or store the EM heap pointer. They don't work. Programs must use brk() or sbrk() in libsys. The last file to use 'lor 2' and 'str 2' was lang/pc/libpc/sav.e in the Pascal library. Commit `c084f9f` deleted the file, so we no longer need rules 'lor 2' or 'str 2' to build the ACK.	2016-12-09 17:00:56 -05:00
George Koehler	805883e377	Fill in a hint for enabling the COMMENT macro. If you want to enable comments in the .s file, change #define COMMENT(n) /* comment {LABEL, n} */ to #define COMMENT(n) comment {LABEL, n}	2016-12-09 16:58:47 -05:00
George Koehler	244e554f2f	Remove trailing whitespace in mach/powerpc/ncg/table	2016-12-09 16:36:42 -05:00
George Koehler	b8c921ca70	Allow mfspr, mtspr with a register number. PowerPC has a few hundred special-purpose registers. The assembler had only accepted the names "xer", "lr", "ctr". Most programs use only those three SPRs. If I add more names, they would almost never get used, and they might conflict with labels. I want to use "mfspr r3, 0x3f0" and "mtspr 0x3f0, r3" in plat/qemu/boot.s to access register hid0 from supervisor mode.	2016-12-07 17:28:00 -05:00
David Given	55e24e1f24	inn was assuming that bitfields were arrays of bytes, when actually they're arrays of words (which makes the LSB move on big-endian systems).	2016-12-06 21:45:20 +01:00
David Given	fbd6e8f63d	Add support for consecutive labels; needed by the B compiler.	2016-11-27 21:18:00 +01:00
David Given	5bce5fc4da	Change the extension used by Basic files for .b to .bas, to avoid conflicts with B.	2016-11-27 20:38:33 +01:00
David Given	f8fa3ece42	inn on ncg now passes the CPU tests.	2016-11-20 19:35:34 +01:00
David Given	953c08839f	inn works now; add a helper for it.	2016-11-20 12:53:44 +01:00
David Given	196fa914b3	lxa now works, I hope; traps are better (and stubbed out on qemuppc).	2016-11-20 11:57:21 +01:00
David Given	d5328492d7	Better handling of float conversions; more tests; converting to unsigned ints works now.	2016-11-20 11:27:40 +01:00
David Given	454a7494bb	cif8 and cuf8 work now. More tests.	2016-11-19 11:42:30 +01:00
David Given	cc660b230f	Floats and doubles are now written out correctly.	2016-11-19 11:39:13 +01:00
David Given	d31bc6a3f9	Made csa and csb work with mcg; adjust the libem functions and the corresponding invocation in the ncg table so the same helpers can be used for both mcg and ncg. Add a new IR opcode, FARJUMP, which jumps to a helper function but saves volatile registers.	2016-11-19 10:55:41 +01:00
David Given	5208e5f751	Yet another OB1 stack format fix.	2016-11-19 10:42:22 +01:00
David Given	43439c6d0c	Remember to push the result of lor onto the stack.	2016-11-17 22:04:32 +01:00
David Given	81bc2c74c5	A bb's regsin are no longer the same as those of its first instruction; occasionally the first hop of a block would try to rearrange its registers (due to evicted throughs), resulting in the phi moves copying values into the wrong registers.	2016-11-16 20:52:15 +01:00
David Given	581fa4a457	Reenable eviction of corrupted registers, which had been broken by a previous change. Change the register move code to get swaps right, or at least righter.	2016-11-15 21:55:10 +01:00
David Given	86c832ef86	Put saved registers in actually the write place. I hope.	2016-11-15 21:54:15 +01:00
David Given	cc686ded62	Get subtractions the right way round.	2016-11-15 20:25:11 +01:00
David Given	0289b1004e	Allow values left on the stack at the end of the procedure (it's legal!).	2016-11-14 21:47:49 +01:00
David Given	e7132183fb	Fix buffer overrun: if LABEL_STARTER is seen but LABEL_TERMINATOR is not, the label parser will keep going forever looking for the end of the label. It now stops at the end of the string.	2016-11-13 14:04:58 +01:00
David Given	852d3a691d	Update the table to return call output values in the right registers. Fix the register allocator so the corrupted registers only apply to throughs (otherwise, you can't put output registers in corrupted registers).	2016-11-11 21:48:36 +01:00
David Given	b5c1d622f5	Rework the way stack frames are laid out to be simpler and, hopefully, more correct. Saved registers are now placed in what may be the right place.	2016-11-11 21:17:45 +01:00
David Given	84ee75ec07	Merge from default.	2016-11-11 20:17:54 +01:00
David Given	d82df74a7a	Rename addr_t to address_t to avoid clashes with the system addr_t.	2016-11-11 20:17:10 +01:00
David Given	fd91851005	Add enough return types to the K&R C that the ACK builds (on Linux) using clang now.	2016-11-10 22:04:18 +01:00
David Given	4fa2c94a4a	Correctly mangle labels used in initialisers.	2016-10-31 23:21:33 +01:00
David Given	9261cd978d	Typo fix.	2016-10-31 23:16:02 +01:00
David Given	941072e0d7	Add, I hope, patterns for fmsub, fnmadd, and fnmsub (also float versions).	2016-10-31 22:36:54 +01:00
David Given	44f0cea6ca	Also use fmadd for single-precision floats.	2016-10-31 19:55:16 +01:00
David Given	064d1a5d5d	Use fmadd for multiply-and-add instructions.	2016-10-31 19:52:17 +01:00
David Given	e19850b114	Fix a few c11isms.	2016-10-30 16:51:06 +01:00
David Given	ca5b6e07bb	Properly export symbols.	2016-10-29 23:52:17 +02:00
David Given	8c3670483f	Get top working with the PowerPC; use it to eliminate useless branches and moves.	2016-10-29 23:37:11 +02:00
David Given	a8c4dac67c	Merge from default (merging in George Koehler's PowerPC changes).	2016-10-29 22:40:40 +02:00
David Given	a311e61360	Add support for preserved registers.	2016-10-29 20:22:44 +02:00
David Given	e3ebf986e9	More opcodes.	2016-10-29 13:32:09 +02:00
David Given	1ae8b90238	More opcodes.	2016-10-29 12:55:34 +02:00
David Given	acaae765af	Emit negative constants correctly.	2016-10-29 12:55:21 +02:00
David Given	61349389fb	More opcodes. sti can now cope with non-standard sizes (really need a better fix for this). Hack in crude support for mismatched stack pushes and pops (ints vs longs).	2016-10-29 12:48:05 +02:00
David Given	68419da235	Actually, the locals need to go above the spills and saved regs, so fp == lb.	2016-10-29 12:00:33 +02:00
David Given	2cc2c0ae98	Lots more opcodes. Rearrange the stack layout so that fp->ab is a fixed value (needed for CHAINFP and FPTOAB). Wire up lfrs to calls via a phi when necessary, to allow call-bra-lfr chains.	2016-10-29 11:57:56 +02:00
David Given	bfa65168e2	Don't generate phis if unnecessary (because this breaks the critical-edge-splitting guarantee and causes insertion of phi copies to fail).	2016-10-29 10:55:48 +02:00
David Given	658db4ba71	Mangle label names (turns out that the ACK assembler can't really cope with labels that are the same name as instructions...).	2016-10-27 23:17:16 +02:00
David Given	81525c0f2c	Swaps work (at least for registers). More opcodes. Rearrange the stack layout so we can always trivially find fp, which lets CHAINFP work.	2016-10-27 21:50:58 +02:00
David Given	be3dece5af	Allow emission of strings containing ".	2016-10-27 21:48:46 +02:00
David Given	51bd3ee4dd	Fix bug where some phis weren't being inserted when a given variable definition needed more than one phi (due to the dominance frontier containing more than one basic block).	2016-10-27 21:40:25 +02:00
David Given	9977ce841a	Remove the bytes1, bytes2, bytes4, bytes8 attributes; remove the concept of a register 'type'; now use int/float/long/double throughout to identify registers. Lots of register allocator tweaks and table bugfixes --- we now get through the dreading Mathlib.mod!	2016-10-25 23:04:20 +02:00
David Given	45a7f2e993	Phi copies are now inserted as part of type inference. More opcodes.	2016-10-24 22:14:08 +02:00
David Given	111c13e253	More opcodes.	2016-10-24 20:15:22 +02:00
David Given	a4644dee4d	More opcodes.	2016-10-24 12:08:40 +02:00
David Given	b22780c075	More opcodes, including the difficult and fairly stupid los/sts.	2016-10-23 22:24:08 +02:00
David Given	abd0cedd61	Massive change to how IR types are handled; we use the type code for matching rather than the size. Much cleaner and simpler.	2016-10-23 21:54:14 +02:00
David Given	b1a3d76d6f	Re-re-add the type inference layer, now I know more about how things work. Remove that terrible float promotion code.	2016-10-22 23:04:13 +02:00
David Given	11b0bc1055	More opcodes.	2016-10-22 20:32:51 +02:00
David Given	2d52b1fdaa	Remove GETRET; values are now returned directly by CALL. Fix a bug in convertstackops which was resulting in duplicate IR groups.	2016-10-22 12:13:57 +02:00
David Given	ceb938fb3c	More opcodes.	2016-10-22 11:26:28 +02:00
David Given	7ae888b754	Hacky workaround the way the Modula-2 compiler generates non-standard sized loads and saves. More opcodes; simplified table using macros.	2016-10-22 10:48:22 +02:00
David Given	90d0661639	Typo fix.	2016-10-22 00:48:55 +02:00
David Given	f851ab83af	Better (and more correct) floating point conversions; fif; various new opcodes.	2016-10-22 00:48:26 +02:00
David Given	d535be87b1	fef4 and fef8 is now cleaner, albeit slower; add some more register alias stuff.	2016-10-22 00:02:15 +02:00
David Given	4db402f229	Add (pretty crummy) support for register aliases and static pairs of registers. We should have enough functionality now for rather buggy 8-bit ints and doubles. Rework the table and the platform.c to match.	2016-10-21 23:31:00 +02:00
David Given	e4fec71f9c	Lots more opcodes; better eviction behaviour; better register moves. Lots more PowerPC stuff (some working).	2016-10-19 23:29:05 +02:00
David Given	ffb1eabf45	Floating point promotion is less buggy.	2016-10-19 23:27:53 +02:00
George Koehler	99dee0ad24	Remove f14 to f31 from FREG and FSREG. This would have happened later, if f14 to f31 became regvar (like r13 to r31 are now). I am doing it now because ncg is too slow for rules "with FREG FREG uses FREG". We use such rules for adf 8 and other EM instructions that operate on 2 floats. Like my last commit `cfbc537`, this commit speeds ncg by removing choices for register allocation.	2016-10-18 21:16:47 -04:00
David Given	d5071e7df1	Promote values accessed via NOP.	2016-10-18 23:58:03 +02:00
David Given	5413d47029	'!' tracing is now always emitted; tracing goes to stderr.	2016-10-18 22:32:09 +02:00
David Given	3520704ea8	Add support for floating point constants.	2016-10-18 22:29:42 +02:00
George Koehler	cfbc537959	In powerpc ncg, add a speed hack for sti 8. ncg is too slow with this many registers. A stack pattern "with GPR GPR GPR" or "with REG REG REG" takes too long to pick registers, causing ncg 8 to take about 2 seconds on each sti 8. I introduce REG_PAIR and there are only 4 such pairs. For programs that use sti 8 (including C programs that copy 8-byte structs), this speed hack improves the ncg run from several seconds to almost instantaneous. Also add a few COMMENT(...) lines in stacking rules.	2016-10-17 20:31:59 -04:00
David Given	938fb8c2fc	Lots more opcodes.	2016-10-18 00:31:26 +02:00
David Given	4a093b9eba	Add li and mr pseudoinstructions.	2016-10-18 00:21:32 +02:00

1 2 3 4 5 ...

2467 commits