ruby.git - The Ruby Programming Language

Age	Commit message (Collapse)	Author
7 days	Take file and line in GC VM locks	Peter Zhu
	This commit adds file and line to GC VM locking functions for debugging purposes and adds upper case macros to pass __FILE__ and __LINE__. Notes: Merged: https://github.com/ruby/ruby/pull/13550
13 days	Allow pass special constants to the write barrier	Peter Zhu
	Some GC implementations want to always know when an object is written to, even if the written value is a special constant. Checking special constants in rb_obj_written was a micro-optimization that made assumptions about the GC implementation. Notes: Merged: https://github.com/ruby/ruby/pull/13497
2025-06-02	Remove dependancy of default.c on internal/object.h	Peter Zhu
	We don't want the default GC to depend on Ruby internals so we can build it as a modular GC. Notes: Merged: https://github.com/ruby/ruby/pull/13476
2025-05-31	`Ractor::Port`	Koichi Sasada
	* Added `Ractor::Port` * `Ractor::Port#receive` (support multi-threads) * `Rcator::Port#close` * `Ractor::Port#closed?` * Added some methods * `Ractor#join` * `Ractor#value` * `Ractor#monitor` * `Ractor#unmonitor` * Removed some methods * `Ractor#take` * `Ractor.yield` * Change the spec * `Racotr.select` You can wait for multiple sequences of messages with `Ractor::Port`. ```ruby ports = 3.times.map{ Ractor::Port.new } ports.map.with_index do \|port, ri\| Ractor.new port,ri do \|port, ri\| 3.times{\|i\| port << "r#{ri}-#{i}"} end end p ports.each{\|port\| pp 3.times.map{port.receive}} ``` In this example, we use 3 ports, and 3 Ractors send messages to them respectively. We can receive a series of messages from each port. You can use `Ractor#value` to get the last value of a Ractor's block: ```ruby result = Ractor.new do heavy_task() end.value ``` You can wait for the termination of a Ractor with `Ractor#join` like this: ```ruby Ractor.new do some_task() end.join ``` `#value` and `#join` are similar to `Thread#value` and `Thread#join`. To implement `#join`, `Ractor#monitor` (and `Ractor#unmonitor`) is introduced. This commit changes `Ractor.select()` method. It now only accepts ports or Ractors, and returns when a port receives a message or a Ractor terminates. We removes `Ractor.yield` and `Ractor#take` because: * `Ractor::Port` supports most of similar use cases in a simpler manner. * Removing them significantly simplifies the code. We also change the internal thread scheduler code (thread_pthread.c): * During barrier synchronization, we keep the `ractor_sched` lock to avoid deadlocks. This lock is released by `rb_ractor_sched_barrier_end()` which is called at the end of operations that require the barrier. * fix potential deadlock issues by checking interrupts just before setting UBF. https://bugs.ruby-lang.org/issues/21262 Notes: Merged: https://github.com/ruby/ruby/pull/13445
2025-05-26	Add shape_id to RBasic under 32 bit	John Hawthorn
	This makes `RBobject` `4B` larger on 32 bit systems but simplifies the implementation a lot. [Feature #21353] Co-authored-by: Jean Boussier <[email protected]> Notes: Merged: https://github.com/ruby/ruby/pull/13341
2025-05-23	Drop unnecessary compiler guards for memory_sanitizer	Peter Zhu
	We unpoison slots allocated out of the GC, so we don't need to disable the assertions that read from the memory. Notes: Merged: https://github.com/ruby/ruby/pull/13351
2025-05-23	Remove dependency on bits.h in default.c when BUILDING_MODULAR_GC	Peter Zhu
	We can assume that the compiler will have __builtin_clzll so we can implement nlz_int64 using that. Notes: Merged: https://github.com/ruby/ruby/pull/13351
2025-05-20	Only define RVALUE_OVERHEAD if undefined	Peter Zhu
	This allows RVALUE_OVERHEAD to be defined elsewhere. Notes: Merged: https://github.com/ruby/ruby/pull/13381
2025-05-16	gc: Execute run_final with the lock held	Jean Boussier
	The finalizer table can't be read nor modified without the VM lock. Notes: Merged: https://github.com/ruby/ruby/pull/13350
2025-05-16	rb_gc_impl_copy_finalizer: generate a new object id	Jean Boussier
	Fix a regression introduced by: https://github.com/ruby/ruby/pull/13155 Notes: Merged: https://github.com/ruby/ruby/pull/13350
2025-05-16	Add missing lock to `rb_gc_impl_copy_finalizer`	Jean Boussier
	Notes: Merged: https://github.com/ruby/ruby/pull/13350
2025-05-15	Remove dependency on sanitizers.h in default.c when BUILDING_MODULAR_GC	Peter Zhu
	Notes: Merged: https://github.com/ruby/ruby/pull/13340
2025-05-15	Add missing lock in `rb_gc_impl_undefine_finalizer`	Jean Boussier
	The table is global so accesses must be synchronized. Notes: Merged: https://github.com/ruby/ruby/pull/13349
2025-05-15	newobj_fill: don't assume RBasic size	Jean Boussier
	The previous implementation assumed `RBasic` size is `2 * sizeof(VALUE)`, might as well not make assumption and use a proper `sizeof`. Co-Authored-By: John Hawthorn <[email protected]> Notes: Merged: https://github.com/ruby/ruby/pull/13348
2025-05-09	Only clear Ractor cache in child	John Hawthorn
	This avoids a race condition where we were clearing the cache from another ractor while it was in use. Oops! Fixes this failure http://ci.rvm.jp/results/master@oci-aarch64/5750416 Notes: Merged: https://github.com/ruby/ruby/pull/13294
2025-05-09	Remove dead code in `rb_gc_impl_ractor_cache_free`	Jean Boussier
	Followup: https://github.com/ruby/ruby/pull/13286 Notes: Merged: https://github.com/ruby/ruby/pull/13288
2025-05-09	Fix allocation count when forking with Ractors	John Hawthorn
	After fork we reset to single ractor mode (which IMO we shouldn't do, but it requires more work to fix) and so we need to add the pending object counts back to the main heap. Notes: Merged: https://github.com/ruby/ruby/pull/13286
2025-05-08	Remove dependency on debug_counter.h when BUILDING_MODULAR_GC	Peter Zhu
	This allows the default GC to not need debug_counter.h when building as a modular GC. Notes: Merged: https://github.com/ruby/ruby/pull/13269
2025-05-08	Stop checking for USE_DEBUG_COUNTER in default.c	Peter Zhu
	We don't need to check for USE_DEBUG_COUNTER because the code is no-op if USE_DEBUG_COUNTER is not enabled. Notes: Merged: https://github.com/ruby/ruby/pull/13269
2025-05-08	Move `object_id` in object fields.	Jean Boussier
	And get rid of the `obj_to_id_tbl` It's no longer needed, the `object_id` is now stored inline in the object alongside instance variables. We still need the inverse table in case `_id2ref` is invoked, but we lazily build it by walking the heap if that happens. The `object_id` concern is also no longer a GC implementation concern, but a generic implementation. Co-Authored-By: Matt Valentine-House <[email protected]> Notes: Merged: https://github.com/ruby/ruby/pull/13159
2025-05-07	Remove dependence on internal/hash.h for default GC	Peter Zhu
	Notes: Merged: https://github.com/ruby/ruby/pull/13264
2025-05-06	Improve correctness contention for allocated object counts	Étienne Barrié
	Currently the count of allocated object for a heap is incremented without regards to parallelism which leads to incorrect counts. By maintaining a local counter in the ractor newobj cache, and only syncing atomically with some granularity, we can improve the correctness without increasing contention. The allocated object count is also synced when the ractor is freed. Co-authored-by: Jean Boussier <[email protected]> Notes: Merged: https://github.com/ruby/ruby/pull/13192
2025-04-25	Count metadata entries automatically from the names list	Nobuyoshi Nakada
	Notes: Merged: https://github.com/ruby/ruby/pull/13181
2025-04-25	Update `RB_GC_OBJECT_METADATA_ENTRY_COUNT`	Jean Boussier
	In cb1ea54bbf6cdf49c53f42720fec1a151069810c I added one more metadata flag, but didn't notice `RB_GC_OBJECT_METADATA_ENTRY_COUNT` had to be incremented. This should fix ASAN builds. Interestingly, bdb25959fb047af0358f33d7327b7752dca14aa4 already caused the count to be off by one, so I had to increment it by 2. Notes: Merged: https://github.com/ruby/ruby/pull/13179
2025-04-24	objspace_dump: Include `shareable` flag	Jean Boussier
	Given that the currently planned ractor local GC implementation performance will heavilly be influenced by the number of shareable objects it would be valuable to be able to know how many of them are in the heap.
2025-04-23	Eagerly store a copy of `object_id` in finalizer table.	Jean Boussier
	This makes the finalizer table fully self contained, so GC no longer need to delay cleaning the `obj_to_id_tbl`. Notes: Merged: https://github.com/ruby/ruby/pull/13155
2025-04-23	rb_gc_impl_define_finalizer: unlock on early return	Jean Boussier

2025-04-22	Add missing lock in `rb_gc_impl_define_finalizer`	Jean Boussier
	`objspace->finalizer_table` must be synchronized, otherwise concurrent insertion from multiple ractors will cause a crash. Repro: ```ruby ractors = 5.times.map do \|i\| Ractor.new do 100_000.times.map do o = Object.new ObjectSpace.define_finalizer(o, ->(id) {}) o end end end ractors.each(&:take) ``` Notes: Merged: https://github.com/ruby/ruby/pull/13151
2025-04-15	Lazily create `objspace->id_to_obj_tbl`	Jean Boussier
	This inverse table is only useful if `ObjectSpace._id2ref` is used, which is extremely rare. The only notable exception is the `drb` gem and even then it has an option not to rely on `_id2ref`. So if we assume this table will never be looked up, we can just not maintain it, and if it turns out `_id2ref` is called, we can lock the VM and re-build it. ``` compare-ruby: ruby 3.5.0dev (2025-04-10T09:44:40Z master 684cfa42d7) +YJIT +PRISM [arm64-darwin24] built-ruby: ruby 3.5.0dev (2025-04-10T10:13:43Z lazy-id-to-obj d3aa9626cc) +YJIT +PRISM [arm64-darwin24] warming up.. \| \|compare-ruby\|built-ruby\| \|:----------\|-----------:\|---------:\| \|baseline \| 26.364M\| 25.974M\| \| \| 1.01x\| -\| \|object_id \| 10.293M\| 14.202M\| \| \| -\| 1.38x\| ``` Notes: Merged: https://github.com/ruby/ruby/pull/13115
2025-04-07	Grow GC heaps independently	Peter Zhu
	[Bug #21214] If we allocate objects where one heap holds transient objects and another holds long lived objects, then the heap with transient objects will grow along the heap with long lived objects, causing higher memory usage. For example, we can see this issue in this script: def allocate_small_object = [] def allocate_large_object = Array.new(10) arys = Array.new(1_000_000) do # Allocate 10 small transient objects 10.times { allocate_small_object } # Allocate 1 large object that is persistent allocate_large_object end pp GC.stat pp GC.stat_heap Before this change: heap_live_slots: 2837243 {0 => {slot_size: 40, heap_eden_pages: 1123, heap_eden_slots: 1838807}, 2 => {slot_size: 160, heap_eden_pages: 2449, heap_eden_slots: 1001149}, } After this change: heap_live_slots: 1094474 {0 => {slot_size: 40, heap_eden_pages: 58, heap_eden_slots: 94973}, 2 => {slot_size: 160, heap_eden_pages: 2449, heap_eden_slots: 1001149}, } Notes: Merged: https://github.com/ruby/ruby/pull/13061
2025-03-31	Don't preserve `object_id` when moving object to another Ractor	Jean Boussier
	That seemed like the logical thing to do to me, but ko1 disagree. Notes: Merged: https://github.com/ruby/ruby/pull/13008
2025-03-31	Ractor: Fix moving embedded objects	Jean Boussier
	[Bug #20271] [Bug #20267] [Bug #20255] `rb_obj_alloc(RBASIC_CLASS(obj))` will always allocate from the basic 40B pool, so if `obj` is larger than `40B`, we'll create a corrupted object when we later copy the shape_id. Instead we can use the same logic than ractor copy, which is to use `rb_obj_clone`, and later ask the GC to free the original object. We then must turn it into a `T_OBJECT`, because otherwise just changing its class to `RactorMoved` leaves a lot of ways to keep using the object, e.g.: ``` a = [1, 2, 3] Ractor.new{}.send(a, move: true) [].concat(a) # Should raise, but wasn't. ``` If it turns out that `rb_obj_clone` isn't performant enough for some uses, we can always have carefully crafted specialized paths for the types that would benefit from it. Notes: Merged: https://github.com/ruby/ruby/pull/13008
2025-03-25	Prefer FL_TEST_RAW() in GC on known on-heap objects	Alan Wu
	Was reading some assembly and noticed the dead branches generated for FL_TEST(). Just a quick basic pass to change the obvious places; there may be other opportunities. Notes: Merged: https://github.com/ruby/ruby/pull/12980 Merged-By: XrXr
2025-03-25	Make ruby_autocompact_compare_func static	Peter Zhu
	It's not used outside of default.c. Notes: Merged: https://github.com/ruby/ruby/pull/12964
2025-03-25	Make ruby_enable_autocompact static	Peter Zhu
	It's not used outside of defaut.c Notes: Merged: https://github.com/ruby/ruby/pull/12964
2025-03-13	Move object_id to flags for ObjectSpace dumps	Peter Zhu
	Moving object_id dumping from ObjectSpace to the GC flags allows ObjectSpace to not assume the FL_SEEN_OBJ_ID flag and instead move it to the responsibility of the GC. Notes: Merged: https://github.com/ruby/ruby/pull/12915
2025-02-19	Fix value of RB_GC_OBJECT_METADATA_ENTRY_COUNT	Peter Zhu
	There are 7 entries in RB_GC_OBJECT_METADATA_ENTRY_COUNT.
2025-02-19	Add age to rb_gc_object_metadata	Peter Zhu
	This will allow ObjectSpace.dump to output the age of the object. Notes: Merged: https://github.com/ruby/ruby/pull/12777
2025-02-19	Add rb_gc_object_metadata API	Peter Zhu
	This function replaces the internal rb_obj_gc_flags API. rb_gc_object_metadata returns an array of name and value pairs, with the last element having 0 for the name. Notes: Merged: https://github.com/ruby/ruby/pull/12777
2025-02-19	[wasm] Stop using mprotect(PROT_NONE) on WASI	Yuta Saito
	we had been using a stub weak definition of `mprotect` in wasm/missing.c so far, but wasi-sdk 23 added mprotect emulation to wasi-libc[^1], so the emulation is now linked instead. However, the emulation doesn't support PROT_NONE and fails with ENOSYS, so we need to avoid calling mprotect completely on WASI. [^1]: https://github.com/WebAssembly/wasi-libc/commit/7528b13170462c82e367d91ae0ecead84e470ceb Notes: Merged: https://github.com/ruby/ruby/pull/12776
2025-02-10	gc.c: Remove no-op code	Daisuke Aritomo
	In this context, `vm_locked` is a argument variable, and is not used later in the function. Notes: Merged: https://github.com/ruby/ruby/pull/12718
2025-01-29	Use an identity hash instead of array for stress_to_class	Peter Zhu
	Notes: Merged: https://github.com/ruby/ruby/pull/12664
2025-01-29	Fix GC.add_stress_to_class and GC.remove_stress_to_class	Peter Zhu
	These methods were accidentally removed in [Feature #20470]. This commit adds them back. Notes: Merged: https://github.com/ruby/ruby/pull/12664
2025-01-29	Suppress unused-value warnings	Nobuyoshi Nakada

2025-01-27	Fix gc_update_references_weak_table_i for ASAN	Peter Zhu
	If the object is a T_MOVED, then it is poisoned in ASAN, so we need to unpoison it before checking the type. Notes: Merged: https://github.com/ruby/ruby/pull/12644
2025-01-27	Use rb_gc_vm_weak_table_foreach for reference updating	Peter Zhu
	We can use rb_gc_vm_weak_table_foreach for reference updating of weak tables in the default GC. Notes: Merged: https://github.com/ruby/ruby/pull/12629
2025-01-22	Add generic ivar reference updating step	Peter Zhu
	Previously, generic ivars worked differently than the other global tables during compaction. The other global tables had their references updated through iteration during rb_gc_update_vm_references. Generic ivars updated the keys when the object moved and updated the values while reference updating the object. This is inefficient as this required one lookup for every moved object and one lookup for every object with generic ivars. Instead, this commit changes it to iterate over the generic ivar table to update both the keys and values. Notes: Merged: https://github.com/ruby/ruby/pull/12607
2025-01-05	gc/default/default.c: don't call `malloc_usable_size` when hint is present	Jean Boussier
	Depending on the allocator, `malloc_usable_size` may be very cheap or quite expensive. On `macOS` for instance, it's about as expensive as `malloc`. In many case we call `objspace_malloc_size` with as size we initially requested as `hint`. The real usable size may be a few bytes bigger, but since we only use that data to feed GC heuristics, I don't think it's very important to be perfectly accurate. It would make sense to call `malloc_usable_size` after growing a String or Array to use the extra capacity, but here we don't do that, so the call isn't worth its cost. Notes: Merged: https://github.com/ruby/ruby/pull/12490
2025-01-02	Use rb_darray_insert_without_gc for heap_pages darray	Peter Zhu
	rb_darray_insert could trigger a GC, which would cause problems if it freed pages while a new page was being inserted. For example, the following script fails: GC.stress = true GC.auto_compact = :empty 10.times do GC.verify_compaction_references(expand_heap: true, toward: :empty) end It errors out with: 'GC.verify_compaction_references': malloc: possible integer overflow (8*18446744073709551603) (ArgumentError) Notes: Merged: https://github.com/ruby/ruby/pull/12459
2025-01-02	Revert "Remove with_gc functions in darray"	Peter Zhu
	This reverts commit 24a740796050b72aa2d35339ba2a317d4eda7b75. Notes: Merged: https://github.com/ruby/ruby/pull/12459