Produce new-style Cmm from the Cmm parser

The main change here is that the Cmm parser now allows high-level cmm code with argument-passing and function calls. For example: foo ( gcptr a, bits32 b ) { if (b > 0) { // we can make tail calls passing arguments: jump stg_ap_0_fast(a); } return (x,y); } More details on the new cmm syntax are in Note [Syntax of .cmm files] in CmmParse.y. The old syntax is still more-or-less supported for those occasional code fragments that really need to explicitly manipulate the stack. However there are a couple of differences: it is now obligatory to give a list of live GlobalRegs on every jump, e.g. jump %ENTRY_CODE(Sp(0)) [R1]; Again, more details in Note [Syntax of .cmm files]. I have rewritten most of the .cmm files in the RTS into the new syntax, except for AutoApply.cmm which is generated by the genapply program: this file could be generated in the new syntax instead and would probably be better off for it, but I ran out of enthusiasm. Some other changes in this batch: - The PrimOp calling convention is gone, primops now use the ordinary NativeNodeCall convention. This means that primops and "foreign import prim" code must be written in high-level cmm, but they can now take more than 10 arguments. - CmmSink now does constant-folding (should fix #7219) - .cmm files now go through the cmmPipeline, and as a result we generate better code in many cases. All the object files generated for the RTS .cmm files are now smaller. Performance should be better too, but I haven't measured it yet. - RET_DYN frames are removed from the RTS, lots of code goes away - we now have some more canned GC points to cover unboxed-tuples with 2-4 pointers, which will reduce code size a little.
author: Simon Marlow <marlowsd@gmail.com> 2012-10-03 09:30:56 +0100
committer: Simon Marlow <marlowsd@gmail.com> 2012-10-08 09:04:40 +0100
commit: a7c0387d20c1c9994d1100b14fbb8fb4e28a259e (patch)
tree: b95d0a512f951a4a463f1aa5178b0cd5c4fdb410 /compiler/nativeGen/RegAlloc/Linear/Main.hs
parent: aed37acd4d157791381800d5de960a2461bcbef3 (diff)
download: haskell-a7c0387d20c1c9994d1100b14fbb8fb4e28a259e.tar.gz
1 files changed, 12 insertions, 1 deletions
diff --git a/compiler/nativeGen/RegAlloc/Linear/Main.hs b/compiler/nativeGen/RegAlloc/Linear/Main.hs
index a15bca07e7..3f1efe5824 100644
--- a/compiler/nativeGen/RegAlloc/Linear/Main.hs
+++ b/compiler/nativeGen/RegAlloc/Linear/Main.hs
@@ -434,10 +434,21 @@ raInsn block_live new_instrs id (LiveInstr (Instr instr) (Just live))
                         (uniqSetToList $ liveDieRead live)
                         (uniqSetToList $ liveDieWrite live)
 
-
 raInsn _ _ _ instr
         = pprPanic "raInsn" (text "no match for:" <> ppr instr)
 
+-- ToDo: what can we do about
+--
+--     R1 = x
+--     jump I64[x] // [R1]
+--
+-- where x is mapped to the same reg as R1.  We want to coalesce x and
+-- R1, but the register allocator doesn't know whether x will be
+-- assigned to again later, in which case x and R1 should be in
+-- different registers.  Right now we assume the worst, and the
+-- assignment to R1 will clobber x, so we'll spill x into another reg,
+-- generating another reg->reg move.
+
 
 isInReg :: Reg -> RegMap Loc -> Bool
 isInReg src assig | Just (InReg _) <- lookupUFM assig src = True
author	Simon Marlow <marlowsd@gmail.com>	2012-10-03 09:30:56 +0100
committer	Simon Marlow <marlowsd@gmail.com>	2012-10-08 09:04:40 +0100
commit	a7c0387d20c1c9994d1100b14fbb8fb4e28a259e (patch)
tree	b95d0a512f951a4a463f1aa5178b0cd5c4fdb410 /compiler/nativeGen/RegAlloc/Linear/Main.hs
parent	aed37acd4d157791381800d5de960a2461bcbef3 (diff)
download	haskell-a7c0387d20c1c9994d1100b14fbb8fb4e28a259e.tar.gz