delta/haskell.git - gitlab.haskell.org: ghc/ghc.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	rts: Flush eventlog buffers from flushEventLogwip/T18043	Ben Gamari	2020-11-21	8	-9/+58
\| \| \| \| \| \| \| \| \| \| \| \|	As noted in #18043, flushTrace failed flush anything beyond the writer. This means that a significant amount of data sitting in capability-local event buffers may never get flushed, despite the users' pleads for us to flush. Fix this by making flushEventLog flush all of the event buffers before flushing the writer. Fixes #18043.
*	rts/linker: Align bssSize to page size when mapping symbol extras	Ben Gamari	2020-11-20	1	-1/+3
\| \| \| \| \| \| \|	We place symbol_extras right after bss. We also need to ensure that symbol_extras can be mprotect'd independently from the rest of the image. To ensure this we round up the size of bss to a page boundary, thus ensuring that symbol_extras is also page-aligned.
*	AArch64/arm64 adjustments	Moritz Angermann	2020-11-15	7	-12/+12
\| \| \| \| \| \| \| \|	This addes the necessary logic to support aarch64 on elf, as well as aarch64 on mach-o, which Apple calls arm64. We change architecture name to AArch64, which is the official arm naming scheme.
*	Add rts_listThreads and rts_listMiscRoots to RtsAPI.h	David Eichmann	2020-11-13	1	-0/+53
\| \| \| \| \| \| \| \|	These are used to find the current roots of the garbage collector. Co-authored-by: Sven Tennie's avatarSven Tennie <sven.tennie@gmail.com> Co-authored-by: Matthew Pickering's avatarMatthew Pickering <matthewtpickering@gmail.com> Co-authored-by: default avatarBen Gamari <bgamari.foss@gmail.com>
*	rts: Introduce highMemDynamic	GHC GitLab CI	2020-11-11	1	-1/+8
\|
*	Add loadNativeObj and unloadNativeObj	Ray Shih	2020-11-11	4	-15/+261
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(This change is originally written by niteria) This adds two functions: * `loadNativeObj` * `unloadNativeObj` and implements them for Linux. They are useful if you want to load a shared object with Haskell code using the system linker and have GHC call dlclose() after the code is no longer referenced from the heap. Using the system linker allows you to load the shared object above outside the low-mem region. It also loads the DWARF sections in a way that `perf` understands. `dl_iterate_phdr` is what makes this implementation Linux specific.
*	Fix and enable object unloading in GHCi	Ömer Sinan Ağacan	2020-11-11	15	-485/+584
\| \| \| \| \| \| \|	Fixes #16525 by tracking dependencies between object file symbols and marking symbol liveness during garbage collection See Note [Object unloading] in CheckUnload.c for details.
*	ghc-heap: expose decoding from heap representation	David Eichmann	2020-11-10	1	-18/+23
\| \| \| \| \| \|	Co-authored-by: Sven Tennie <sven.tennie@gmail.com> Co-authored-by: Matthew Pickering <matthewtpickering@gmail.com> Co-authored-by: Ben Gamari <bgamari.foss@gmail.com>
*	rts/linker: Fix relocation overflow in PE linker	Ben Gamari	2020-11-10	1	-4/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously the overflow check for the IMAGE_REL_AMD64_ADDR32NB relocation failed to account for the signed nature of the value. Specifically, the overflow check was: uint64_t v; v = S + A; if (v >> 32) { ... } However, `v` ultimately needs to fit into 32-bits as a signed value. Consequently, values `v > 2^31` in fact overflow yet this is not caught by the existing overflow check. Here we rewrite the overflow check to rather ensure that `INT32_MIN <= v <= INT32_MAX`. There is now quite a bit of repetition between the `IMAGE_REL_AMD64_REL32` and `IMAGE_REL_AMD64_ADDR32` cases but I am leaving fixing this for future work. This bug was first noticed by @awson. Fixes #15808.
*	Merge remote-tracking branch 'origin/wip/tsan/all'	Ben Gamari	2020-11-08	44	-743/+1054
\|\
\| *	Merge branch 'wip/tsan/stats' into wip/tsan/all	Ben Gamari	2020-11-01	4	-27/+62
\| \|\
\| \| *	rts: Tear down stats_mutex after exitHeapProfilingwip/tsan/stats	Ben Gamari	2020-11-01	4	-5/+14
\| \| \| \| \| \| \| \| \| \| \| \|	Since the latter wants to call getRTSStats.
\| \| *	rts/Stats: Protect with mutex	Ben Gamari	2020-11-01	1	-3/+55
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	While on face value this seems a bit heavy, I think it's far better than enforcing ordering on every access.
\| \| *	rts/Stats: Hide a few unused unnecessarily global functions	Ben Gamari	2020-10-24	2	-22/+0
\| \| \|
\| * \|	Merge branch 'wip/tsan/timer' into wip/tsan/all	Ben Gamari	2020-11-01	7	-34/+65
\| \|\ \
\| \| * \|	rts: Fix races in Pthread timer backend shudownwip/tsan/timer	Ben Gamari	2020-10-24	1	-8/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We can generally be pretty relaxed in the barriers here since the timer thread is a loop.
\| \| * \|	rts: Fix timer initialization	Ben Gamari	2020-10-24	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously `initScheduler` would attempt to pause the ticker and in so doing acquire the ticker mutex. However, initTicker, which is responsible for initializing said mutex, hadn't been called yet.
\| \| * \|	suppress #17289 (ticker) race	Ben Gamari	2020-10-24	1	-0/+4
\| \| \| \|
\| \| * \|	Fix #17289	Ben Gamari	2020-10-24	2	-11/+19
\| \| \| \|
\| \| * \|	rts: Pause timer while changing capability count	Ben Gamari	2020-10-24	2	-11/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This avoids #17289.
\| \| * \|	rts: Accept benign races in Proftimer	Ben Gamari	2020-10-24	1	-5/+5
\| \| \|/
\| * \|	Merge branch 'wip/tsan/event-mgr' into wip/tsan/all	Ben Gamari	2020-11-01	3	-21/+30
\| \|\ \
\| \| * \|	Suppress data race due to close	Ben Gamari	2020-11-01	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This suppresses the other side of a race during shutdown.
\| \| * \|	Mitigate data races in event manager startup/shutdownwip/tsan/event-mgr	Ben Gamari	2020-10-24	2	-21/+29
\| \| \|/
\| * \|	Merge branch 'wip/tsan/stm' into wip/tsan/all	Ben Gamari	2020-11-01	1	-37/+55
\| \|\ \
\| \| * \|	rts/stm: Strengthen orderings to SEQ_CST instead of volatilewip/tsan/stm	Ben Gamari	2020-10-24	1	-20/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously the `current_value`, `first_watch_queue_entry`, and `num_updates` fields of `StgTVar` were marked as `volatile` in an attempt to provide strong ordering. Of course, this isn't sufficient. We now use proper atomic operations. In most of these cases I strengthen the ordering all the way to SEQ_CST although it's possible that some could be weakened with some thought.
\| \| * \|	rts/STM: Use atomics	Ben Gamari	2020-10-24	1	-27/+45
\| \| \|/ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes a potentially harmful race where we failed to synchronize before looking at a TVar's current_value. Also did a bit of refactoring to avoid abstract over management of max_commits.
\| * \|	Merge branch 'wip/tsan/misc' into wip/tsan/all	Ben Gamari	2020-11-01	4	-6/+10
\| \|\ \
\| \| * \|	rts: Use proper relaxe operations in getCurrentThreadCPUTimewip/tsan/misc	GHC GitLab CI	2020-10-24	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Here we are doing lazy initialization; it's okay if we do the check more than once, hence relaxed operation is fine.
\| \| * \|	rts: Avoid lock order inversion during fork	Ben Gamari	2020-10-24	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fixes #17275.
\| \| * \|	rts: Use relaxed atomics for whitehole spin stats	Ben Gamari	2020-10-24	2	-3/+3
\| \| \|/
\| * \|	Merge branch 'wip/tsan/wsdeque' into wip/tsan/all	Ben Gamari	2020-11-01	3	-174/+96
\| \|\ \
\| \| * \|	rts/WSDeque: Rewrite with proper atomicswip/tsan/wsdeque	Ben Gamari	2020-10-24	3	-174/+96
\| \| \|/ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	After a few attempts at shoring up the previous implementation, I ended up turning to the literature and now use the proven implementation, > N.M. Lê, A. Pop, A.Cohen, and F.Z. Nardelli. "Correct and Efficient > Work-Stealing for Weak Memory Models". PPoPP'13, February 2013, > ACM 978-1-4503-1922/13/02. Note only is this approach formally proven correct under C11 semantics but it is also proved to be a bit faster in practice.
\| * \|	Merge branch 'wip/tsan/storage' into wip/tsan/all	Ben Gamari	2020-11-01	22	-267/+415
\| \|\ \
\| \| * \|	Strengthen ordering in releaseGCThreads	Ben Gamari	2020-11-01	1	-2/+2
\| \| \| \|
\| \| * \|	rts: Annotate hopefully "benign" races in freeGroup	Ben Gamari	2020-11-01	1	-0/+25
\| \| \| \|
\| \| * \|	rts: Use relaxed ordering on spinlock counterswip/tsan/storage	Ben Gamari	2020-10-30	1	-2/+2
\| \| \| \|
\| \| * \|	rts/SpinLock: Separate out slow path	Ben Gamari	2020-10-30	2	-0/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Not only is this in general a good idea, but it turns out that GCC unrolls the retry loop, resulting is massive code bloat in critical parts of the RTS (e.g. `evacuate`).
\| \| * \|	rts: Fix race in GC CPU time accounting	GHC GitLab CI	2020-10-30	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Ensure that the GC leader synchronizes with workers before calling stat_endGC.
\| \| * \|	rts: Join to concurrent mark thread during shutdown	Ben Gamari	2020-10-30	3	-0/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously we would take all capabilities but fail to join on the thread itself, potentially resulting in a leaked thread.
\| \| * \|	rts/Storage: Accept races on heap size counters	Ben Gamari	2020-10-30	1	-5/+8
\| \| \| \|
\| \| * \|	rts: Use RELEASE ordering in unlockClosure	Ben Gamari	2020-10-30	1	-3/+2
\| \| \| \|
\| \| * \|	rts/GC: Use atomics	Ben Gamari	2020-10-30	10	-172/+189
\| \| \| \|
\| \| * \|	rts/Weak: Eliminate data races	Ben Gamari	2020-10-24	2	-18/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	By taking all_tasks_mutex in stat_exit. Also better-document the fact that the task statistics are protected by all_tasks_mutex.
\| \| * \|	rts/Updates: Use proper atomic operations	Ben Gamari	2020-10-24	1	-4/+2
\| \| \| \|
\| \| * \|	rts/Storage: Use atomics	Ben Gamari	2020-10-24	1	-18/+17
\| \| \| \|
\| \| * \|	rts: Avoid data races in StablePtr implementation	Ben Gamari	2020-10-24	1	-4/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes two potentially problematic data races in the StablePtr implementation: * We would fail to RELEASE the stable pointer table when enlarging it, causing other cores to potentially see uninitialized memory. * We would fail to ACQUIRE when dereferencing a stable pointer.
\| \| * \|	rts: Rework handling of mutlist scavenging statistics	Ben Gamari	2020-10-24	3	-37/+83
\| \| \| \|
\| \| * \|	rts/BlockAlloc: Use relaxed operations	Ben Gamari	2020-10-24	1	-6/+7
\| \| \|/
\| * \|	rts: Make write of to_cap->inbox atomicwip/tsan/sched	Ben Gamari	2020-10-24	2	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is necessary since emptyInbox may read from to_cap->inbox without taking cap->lock.