summaryrefslogtreecommitdiff
path: root/compiler/codeGen
Commit message (Collapse)AuthorAgeFilesLines
...
| * | | | Merge branch 'master' into type-natsIavor S. Diatchki2011-12-294-13/+13
| |\ \ \ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Conflicts: compiler/typecheck/TcCanonical.lhs compiler/typecheck/TcSMonad.lhs
| * | | | | Extend GHC's type with a representation for type level literals.Iavor S. Diatchki2011-12-182-0/+13
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Currently, we support only numeric literals but---hopefully---these modifications should make it fairly easy to add other ones, if necessary.
* | | | | | Improve the case-alternative heap checksSimon Marlow2012-03-072-32/+89
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The code we were generating for heap-checks in algebraic case alternatives wasn't working well with the common-block eliminator. A small tweak to make the heap-check failure jump back to the same place in all branches lets the common-block eliminator squash more code.
* | | | | | fix the updfr offset in the jump after newCAFSimon Marlow2012-03-071-1/+2
| | | | | |
* | | | | | Lower safe foreign calls in the new CmmLayoutStackSimon Marlow2012-03-065-93/+189
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | We also generate much better code for safe foreign calls (and maybe also unsafe foreign calls) than previously. See the two new Notes: Note [lower safe foreign calls] Note [safe foreign call convention]
* | | | | | New codegen: fix bad code for comparisons (see Note [case on bool])Simon Marlow2012-02-152-44/+79
| | | | | |
* | | | | | Merge remote-tracking branch 'laptop/newcg' into newcgMe at work2012-02-1417-1225/+1223
|\ \ \ \ \ \
| * | | | | | merge fixesSimon Marlow2012-02-131-45/+1
| | | | | | |
| * | | | | | Merge remote-tracking branch 'origin/master' into newcgSimon Marlow2012-02-1317-1216/+1258
| |\ \ \ \ \ \ | | | |_|_|_|/ | | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Conflicts: compiler/cmm/CmmLint.hs compiler/cmm/OldCmm.hs compiler/codeGen/CgMonad.lhs compiler/main/CodeOutput.lhs
| | * | | | | Port "Add new primtypes 'ArrayArray#' and 'MutableArrayArray#'" to new codegen.Edward Z. Yang2012-01-201-2/+22
| | | |_|_|/ | | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 021a0dd265ff34c1e292813c06185eff1d6b5c1c appears to have only partially added the new primops associated with ArrayArray# and MutableArrayArray# Signed-off-by: Edward Z. Yang <ezyang@mit.edu>
| | * | | | Track STG live register information for use in LLVMDavid Terei2012-01-098-94/+127
| | | |_|/ | | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | We now carry around with CmmJump statements a list of the STG registers that are live at that jump site. This is used by the LLVM backend so it can avoid unnesecarily passing around dead registers, improving perfromance. This gives us the framework to finally fix trac #4308.
| | * | | Remove unused arg field of CmmReturnDavid Terei2012-01-051-1/+1
| | | | |
| | * | | Remove unused argument field on CmmJumpDavid Terei2012-01-057-13/+13
| | | | |
| | * | | Formatting fixesDavid Terei2012-01-054-526/+508
| | | | |
| | * | | Tabs -> Spaces + formatting fixesDavid Terei2012-01-052-596/+559
| | | | |
| | * | | Fixup to 4464c92badaedc45ce53d6349f6790f6d2298103Simon Marlow2012-01-052-2/+5
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Instead of enterLocalIdLabel we should get the label from the ClosureInfo, because that knows better whether the label should be local or not. Needed by #5357
| | * | | Rename the CCCS field of StgTSO so as not to conflict with the CCCS ↵Simon Marlow2012-01-052-2/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | pseudo-register Needed by #5357
| | * | | remove dead codeSimon Marlow2012-01-041-6/+1
| | | | |
| | * | | mkSlowEntryCode: enterLocalIdLabel should be enterIdLabel (#5357)Simon Marlow2012-01-041-1/+1
| | | |/ | | |/|
| | * | Add a class HasDynFlags(getDynFlags)Ian Lynagh2011-12-192-4/+4
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | We no longer have many separate, clashing getDynFlags functions I've given each GhcMonad its own HasDynFlags instance, rather than using UndecidableInstances to make a GhcMonad m => HasDynFlags m instance.
* | | | tiny refactoringSimon Marlow2012-02-141-1/+1
| | | |
* | | | Fix an SRT-related bugSimon Marlow2012-02-146-73/+48
|/ / / | | | | | | | | | | | | | | | | | | | | | We were using the SRT information generated by the computeSRTs pass to decide whether to add a static link field to a constructor or not, and this broke when I disabled computeSRTs for the new code generator. So I've hacked it for now to only rely on the SRT information generated by CoreToStg.
* | | fix a bugSimon Marlow2012-02-081-2/+2
| | |
* | | New stack layout algorithmSimon Marlow2012-02-088-73/+222
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Also: - improvements to code generation: push slow-call continuations on the stack instead of generating explicit continuations - remove unused CmmInfo wrapper type (replace with CmmInfoTable) - squash Area and AreaId together, remove now-unused RegSlot - comment out old unused stack-allocation code that no longer compiles after removal of RegSlot
* | | Make the old codegen run in constant space tooSimon Marlow2012-01-272-29/+41
| | |
* | | Don't call computeSRTs when -fnew-codegen is onSimon Marlow2012-01-271-1/+1
| | |
* | | Run the complete backend (Stg -> .S) incrementally on each StgBindingSimon Marlow2012-01-262-88/+62
| | | | | | | | | | | | | | | | | | | | | | | | | | | This is so that we can process the Stg code in constant space. Before we were generating all the C-- up front, leading to a large space leak. I haven't converted the LLVM or C back ends to the incremental scheme, but it's not hard to do.
* | | small refactorSimon Marlow2012-01-251-5/+6
| | |
* | | Different implementation of MkGraphSimon Marlow2012-01-2510-189/+281
|/ /
* | Fix alignment in the CostCentre struct (#5710)Simon Marlow2011-12-192-9/+9
|/
* Fix comment in CgMonadDavid Terei2011-12-111-6/+5
|
* Merge branch 'master' of http://darcs.haskell.org/ghcDavid Terei2011-12-082-4/+34
|\
| * Add new primtypes 'ArrayArray#' and 'MutableArrayArray#'Manuel M T Chakravarty2011-12-072-4/+34
| | | | | | | | | | | | | | | | The primitive array types, such as 'ByteArray#', have kind #, but are represented by pointers. They are boxed, but unpointed types (i.e., they cannot be 'undefined'). The two categories of array types —[Mutable]Array# and [Mutable]ByteArray#— are containers for unboxed (and unpointed) as well as for boxed and pointed types. So far, we lacked support for containers for boxed, unpointed types (i.e., containers for the primitive arrays themselves). This is what the new primtypes provide. Containers for boxed, unpointed types are crucial for the efficient implementation of scattered nested arrays, which are central to the new DPH backend library dph-lifted-vseg. Without such containers, we cannot eliminate all unboxing from the inner loops of traversals processing scattered nested arrays.
* | Tabs -> Spaces + FormattingDavid Terei2011-12-062-436/+402
|/
* Code generation: Always ask for result of newSparkPeter Wortmann2011-12-052-2/+5
| | | | | | | Otherwise the LLVM backend gets confused over whether its type should be "void (i8*, i8*)" or "i64 (i8*, i8*)". Signed-off-by: David Terei <davidterei@gmail.com>
* Remove dead function.David Terei2011-12-031-23/+1
|
* Merge branch 'master' of http://darcs.haskell.org/ghcIan Lynagh2011-12-022-3/+15
|\
| * More changes aimed at improving call stacks.Simon Marlow2011-12-022-3/+15
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Attach a SrcSpan to every CostCentre. This had the side effect that CostCentres that used to be merged because they had the same name are now considered distinct; so I had to add a Unique to CostCentre to give them distinct object-code symbols. - New flag: -fprof-auto-calls. This flag adds an automatic SCC to every call site (application, to be precise). This is typically more useful for call stacks than annotating whole functions. Various tidy-ups at the same time: removed unused NoCostCentre constructor, and refactored a bit in Coverage.lhs. The call stack we get from traceStack now looks like this: Stack trace: Main.CAF (<entire-module>) Main.main.xs (callstack002.hs:18:12-24) Main.map (callstack002.hs:13:12-16) Main.map.go (callstack002.hs:15:21-34) Main.map.go (callstack002.hs:15:21-23) Main.f (callstack002.hs:10:7-43)
* | Move includes/DerivedConstants.h and includes/GHCConstants.h into dist dirsIan Lynagh2011-12-024-4/+4
|/ | | | | | | | | | When they existed, they were getting included in the includes_H_FILES variable (as it uses wildcard to find all header files). But the .depends files for the programs that generate the headers depend on $(includes_H_FILES), so the .depends files looked out-of-date once the headers had been created. This caused unnecessary make reinvocations. So now we put them in dist* directories, where they ought to be anyway.
* Further tweaks to the ccs primopsSimon Marlow2011-11-302-2/+16
| | | | | | | | | - add getCCSOf# :: a -> State# s -> (# State# s, Addr# #) (returns the CCS attached to the supplied object) - remove traceCcs# (obsoleted by getCCSOf#) - rename getCCCS# to getCurrentCCS#
* Add a new primop: getCCCS# :: State# s -> (# State# s, Addr# #)Simon Marlow2011-11-292-0/+5
| | | | | Returns a pointer to the current cost-centre stack when profiling, NULL otherwise.
* Make profiling work with multiple capabilities (+RTS -N)Simon Marlow2011-11-298-30/+38
| | | | | | | | | | | | | | | | | | | | | | | | | | | This means that both time and heap profiling work for parallel programs. Main internal changes: - CCCS is no longer a global variable; it is now another pseudo-register in the StgRegTable struct. Thus every Capability has its own CCCS. - There is a new built-in CCS called "IDLE", which records ticks for Capabilities in the idle state. If you profile a single-threaded program with +RTS -N2, you'll see about 50% of time in "IDLE". - There is appropriate locking in rts/Profiling.c to protect the shared cost-centre-stack data structures. This patch does enough to get it working, I have cut one big corner: the cost-centre-stack data structure is still shared amongst all Capabilities, which means that multiple Capabilities will race when updating the "allocations" and "entries" fields of a CCS. Not only does this give unpredictable results, but it runs very slowly due to cache line bouncing. It is strongly recommended that you use -fno-prof-count-entries to disable the "entries" count when profiling parallel programs. (I shall add a note to this effect to the docs).
* Get rid of the "safety" field of CmmCall (OldCmm)Simon Marlow2011-11-294-24/+18
| | | | | This field was doing nothing. I think it originally appeared in a very old incarnation of the new code generator.
* Remove a tabs kludgeIan Lynagh2011-11-261-7/+0
|
* Whitespace only in codeGen/CgForeignCall.hsIan Lynagh2011-11-261-101/+101
|
* Whitespace only in codeGen/StgCmmForeign.hsIan Lynagh2011-11-261-99/+92
|
* GHC gets a new constraint solver. More efficient and smaller in size.Dimitrios Vytiniotis2011-11-161-4/+14
|
* fix profiling bug in copyArray#/cloneArray# (cgrun068(profasm) segfault)Simon Marlow2011-11-142-2/+2
|
* Cost centre names are now in UTF-8 (#5559)Simon Marlow2011-11-072-6/+10
| | | | | | | | | So the .prof file will be UTF-8. This is mostly ok, except that the RTS doesn't calculate the column widths correctly (it assumes bytes = chars). hp2ps doesn't do anything sensible with Unicode strings, it just dumps the bytes into the .ps file.
* Allow the use of R9 and R10 in primops; fixes trac #5423Ian Lynagh2011-11-062-0/+18
|