Commit Graph

6096 Commits

Author SHA1 Message Date
Kartik Agaram 30f844ee8f 6095 2020-03-07 18:32:36 -08:00
Kartik Agaram 3cf0315859 6094 - new 'compute-offset' instruction
If indexing into a type with power-of-2-sized elements we can access them
in one instruction:

  x/reg1: (addr int) <- index A/reg2: (addr array int), idx/reg3: int

This translates to a single instruction because x86 instructions support
an addressing mode with left-shifts.

For non-powers-of-2, however, we need a multiply. To keep things type-safe,
it is performed like this:

  x/reg1: (offset T) <- compute-offset A: (addr array T), idx: int
  y/reg2: (addr T) <- index A, x

An offset is just an int that is guaranteed to be a multiple of size-of(T).
Offsets can only be used in index instructions, and the types will eventually
be required to line up.

In the process, I have to expand Input-size because mu.subx is growing
big.
2020-03-07 17:40:45 -08:00
Kartik Agaram 9ee4b34e06 6093
Some much-needed reorganization.
2020-03-07 15:48:11 -08:00
Kartik Agaram 78da4f7135 6092 2020-03-06 23:58:17 -08:00
Kartik Agaram ee0e67b9d7 6091 2020-03-06 18:39:16 -08:00
Kartik Agaram 8f256f1f2e 6090 - new instruction: multiply by immediate
This is a 3-operand instruction:
  r32 = rm32 * imm32

It looks like https://c9x.me/x86/html/file_module_x86_id_138.html has a
bug, implying the same opcode supports a 2-operand version. I don't see
that in the Intel manual pdf, or at alternative sites like https://www.felixcloutier.com/x86/imul

Native runs seem to validate my understanding.

In the process I also fixed a bug in the existing multiply instruction
0f af: the only flags it sets are OF and CF. The other existing multiply
instruction f7 was doing things right.
2020-03-06 18:34:27 -08:00
Kartik Agaram 4c19dd3968 6089 2020-03-06 18:09:16 -08:00
Kartik Agaram 5c26afb1de 6088 - start using setCC instructions 2020-03-06 17:42:17 -08:00
Kartik Agaram 0743b981a8 6087 2020-03-06 13:51:51 -08:00
Kartik Agaram 7c109dffc8 6086 - `index` into arrays with a literal 2020-03-06 13:50:12 -08:00
Kartik Agaram b5fbf20556 6085
Support parsing ints from strings rather than slices.
2020-03-06 13:44:54 -08:00
Kartik Agaram 651fc300a4 6084 2020-03-06 12:28:57 -08:00
Kartik Agaram c1737cbaae 6083 2020-03-06 12:08:42 -08:00
Kartik Agaram 4032286f9b 6082 - bugfix in spilling register vars
In the process I'm starting to realize that my approach to avoiding spills
isn't ideal. It works for local variables but not to avoid spilling outputs.

To correctly decide whether to spill to an output register or not, we really
need to analyze when a variable is live. If we don't do that, we'll end
up in one of two bad situations:

a) Don't spill the outermost use of an output register (or just the outermost
scope in a function). This is weird because it's hard to explain to the
programmer why they can overwrite a local with an output above a '{' but
not below.

b) Disallow overwriting entirely. This is easier to communicate but quite
inconvenient. It's nice to be able to use eax for some temporary purpose
before overwriting it with the final result of a function.

If we instead track liveness, things are convenient and also easier to
explain. If a temporary is used after the output has been written that's
an obvious problem: "you clobbered the output". (It seems more reasonable
to disallow multiple live ranges for the output. Once an output is written
it can only be shadowed in a nested block.)

That's the bad news. Now for some good news:

One lovely property Mu the language has at the moment is that live ranges
are guaranteed to be linear segments of code. We don't need to analyze
loop-carried dependences. This means that we can decide whether a variable
is live purely by scanning later statements for its use. (Defining 'register
use' is slightly non-trivial; primitives must somehow specify when they
read their output register.)

So we don't actually need to worry about a loop reading a register with
one type and writing to another type at the end of an iteration. The only
way that can happen is if the write at the end was to a local variable,
and we're guaranteeing that local variables will be reclaimed at the end
of the iteration.

So, the sequence of tasks:
  a) compute register liveness
  b1) verify that all register variables used at any point in a program
  are always the topmost use of that register.
  b2) decide whether to spill/shadow, clobber or flag an error.

There's still the open question of where to attach liveness state. It can't
be on a var, because liveness varies by use of the var. It can't be on a
statement because we may want to know the liveness of variables not referenced
in a given statement. Conceptually we want a matrix of locals x stmts (flattened).
But I think it's simpler than that. We just want to know liveness at the
time of variable declarations. A new register variable can be in one of
three states w.r.t. its previous definition: either it's shadowing it,
or it can clobber it, or there's a conflict and we need to raise an error.

I think we can compute this information for each variable definition by
an analysis similar to existing ones, maintaining a stack of variable definitions.
The major difference is that we don't pop variables when a block ends.
Details to be worked out. But when we do I hope to get these pending tests
passing.
2020-03-06 00:06:42 -08:00
Kartik Agaram 0657fc16ab 6081 - ctags for .mu files 2020-03-05 18:29:00 -08:00
Kartik Agaram 16962ee468 6080 2020-03-05 18:25:37 -08:00
Kartik Agaram 0c89528a38 6079 - optimize register spills
The second var to the same register in a block doesn't need to spill. We're
never going to restore the var it's shadowing.
2020-03-05 18:06:55 -08:00
Kartik Agaram a53ab5266f 6078 - highlight hex literals in Vim 2020-03-02 08:23:54 -08:00
Kartik Agaram 32d5099fa0 6077 2020-03-02 08:20:53 -08:00
Kartik Agaram 189d34dcff 6076 2020-03-02 08:20:20 -08:00
Kartik Agaram 2d4d1440e1 6075 2020-03-02 08:18:08 -08:00
Kartik Agaram c984ace5c5 6074 2020-02-29 23:22:32 -08:00
Kartik Agaram 667865a95f 6073 2020-02-29 22:48:13 -08:00
Kartik Agaram 3ac95f6a40 6072
I just did an experiment, and test_apps is spending over half its time
(95 seconds that could be 40) redundantly clearing every stream before
every test. And some of those streams are _large_; the translators for
SubX and Mu use streams sizes pessimistically for the largest possible
program they can handle (1MB per segment).

A single test (the one in assort.subx) spends 24 seconds initializing 2
1MB-segments before processing a dozen lines of text in the blink of an
eye.

I'm not going to speed this up. But good to know.
2020-02-29 22:36:03 -08:00
Kartik Agaram 7ae5b71368 6071 - array indexing for non-int power-of-2 types 2020-02-29 06:49:47 -08:00
Kartik Agaram af326d9e39 6070 2020-02-29 05:53:13 -08:00
Kartik Agaram c51f590273 6069 2020-02-29 05:33:03 -08:00
Kartik Agaram 3a2285c613 6068 2020-02-28 11:23:22 -08:00
Kartik Agaram eaeb43de2e 6067
Missed the file.
2020-02-28 11:19:53 -08:00
Kartik Agaram e62702efd0 6066 - Vim syntax highlighting for Mu files 2020-02-28 11:10:17 -08:00
Kartik Agaram a09ab577d1 6065 2020-02-27 21:29:10 -08:00
Kartik Agaram 17c46e0b8c 6064
Fix CI.
2020-02-27 21:28:02 -08:00
Kartik Agaram 067e1d5102 6063 2020-02-27 18:50:24 -08:00
Kartik Agaram 8dc8f2f65d 6062 2020-02-27 18:49:50 -08:00
Kartik Agaram 6163a55370 6061 2020-02-27 17:31:35 -08:00
Kartik Agaram 2aba46d781 6060 2020-02-27 17:03:08 -08:00
Kartik Agaram 8b85a07f97 6059 2020-02-27 17:03:00 -08:00
Kartik Agaram a9151143f3 6058 2020-02-27 16:54:34 -08:00
Kartik Agaram be8799b795 6057 2020-02-27 16:53:32 -08:00
Kartik Agaram 510a5429ca 6056 2020-02-27 16:47:40 -08:00
Kartik Agaram 127a3dae73 6055 - record types and the 'get' instruction
This is a lot of code for a single test, and it took a long time to get
my data model just right. But the test coverage seems ok because it feels
mostly like straight-line code. We'll see.

I've also had to add a lot of prints. We really need app-level trace generation
pretty urgently. That requires deciding how to turn it on/off from the
commandline. And I've been reluctant to start relying on the hairy interface
that is POSIX open().
2020-02-27 16:43:00 -08:00
Kartik Agaram 5f3c324783 6054 2020-02-24 22:46:45 -08:00
Kartik Agaram 08b9511af5 6053 2020-02-23 14:42:22 -08:00
Kartik Agaram aa1e2b4f5d 6052 2020-02-23 14:36:18 -08:00
Kartik Agaram d9b2b78e96 6051 2020-02-23 00:35:02 -08:00
Kartik Agaram 5a405cb2e0 6050 2020-02-23 00:30:31 -08:00
Kartik Agaram 74953f1e18 6049 2020-02-23 00:11:59 -08:00
Kartik Agaram 43aa0fe310 6048 2020-02-21 20:29:08 -08:00
Kartik Agaram f19abd4dd1 6047 2020-02-21 19:38:11 -08:00
Kartik Agaram de4664ffb9 6046
One more error condition when desugaring function calls in SubX.
2020-02-21 18:43:16 -08:00