ruby.git - The Ruby Programming Language

Age	Commit message (Collapse)	Author
5 days	Turn `rb_classext_t.fields` into a T_IMEMO/class_fields	Jean Boussier
	This behave almost exactly as a T_OBJECT, the layout is entirely compatible. This aims to solve two problems. First, it solves the problem of namspaced classes having a single `shape_id`. Now each namespaced classext has an object that can hold the namespace specific shape. Second, it open the door to later make class instance variable writes atomics, hence be able to read class variables without locking the VM. In the future, in multi-ractor mode, we can do the write on a copy of the `fields_obj` and then atomically swap it. Considerations: - Right now the `RClass` shape_id is always synchronized, but with namespace we should likely mark classes that have multiple namespace with a specific shape flag. Notes: Merged: https://github.com/ruby/ruby/pull/13411
6 days	Refactor the last references to `rb_shape_t`	Jean Boussier
	The type isn't opaque because Ruby isn't often compiled with LTO, so for optimization purpose it's better to allow as much inlining as possible. However ideally only `shape.c` and `shape.h` should deal with the actual struct, and everything else should just deal with opaque `shape_id_t`. Notes: Merged: https://github.com/ruby/ruby/pull/13586
10 days	Get rid of SHAPE_T_OBJECT	Jean Boussier
	Now that we have the `heap_index` in shape flags we no longer need `T_OBJECT` shapes. Notes: Merged: https://github.com/ruby/ruby/pull/13556
13 days	Get rid of TOO_COMPLEX shape type	Jean Boussier
	Instead it's now a `shape_id` flag. This allows to check if an object is complex without having to chase the `rb_shape_t` pointer. Notes: Merged: https://github.com/ruby/ruby/pull/13511
13 days	Get rid of frozen shapes.	Jean Boussier
	Instead `shape_id_t` higher bits contain flags, and the first one tells whether the shape is frozen. This has multiple benefits: - Can check if a shape is frozen with a single bit check instead of dereferencing a pointer. - Guarantees it is always possible to transition to frozen. - This allow reclaiming `FL_FREEZE` (not done yet). The downside is you have to be careful to preserve these flags when transitioning. Notes: Merged: https://github.com/ruby/ruby/pull/13289
2025-05-27	Get rid of `rb_shape_id(rb_shape_t *)`	Jean Boussier
	We should avoid conversions from `rb_shape_t *` into `shape_id_t` outside of `shape.c` as the short term goal is to have `shape_id_t` contain tags. Notes: Merged: https://github.com/ruby/ruby/pull/13448
2025-05-15	Ensure shape_id is never used on T_IMEMO	Jean Boussier
	It doesn't make sense to set ivars or anything shape related on a T_IMEMO. Co-Authored-By: John Hawthorn <[email protected]> Notes: Merged: https://github.com/ruby/ruby/pull/13347
2025-05-11	Update common.mk dependencies	Yusuke Endoh

2025-05-11	namespace on read	Satoshi Tagomori

2025-05-09	Rename `RB_OBJ_SHAPE` -> `rb_obj_shape`	Jean Boussier
	As well as `RB_OBJ_SHAPE_ID` -> `rb_obj_shape_id` and `RSHAPE` is now a simple alias for `rb_shape_lookup`. I tried to turn all these into `static inline` but I'm having trouble with `RUBY_EXTERN rb_shape_tree_t *rb_shape_tree_ptr;` not being exposed as I'd expect. Notes: Merged: https://github.com/ruby/ruby/pull/13283
2025-05-09	Rename `rb_shape_get_shape_id` -> `RB_OBJ_SHAPE_ID`	Jean Boussier
	And `rb_shape_get_shape` -> `RB_OBJ_SHAPE`. Notes: Merged: https://github.com/ruby/ruby/pull/13283
2025-05-09	Rename `rb_shape_obj_too_complex` -> `rb_shape_obj_too_complex_p`	Jean Boussier
	Notes: Merged: https://github.com/ruby/ruby/pull/13283
2025-05-09	Refactor `rb_shape_depth` to take an ID rather than a pointer.	Jean Boussier
	As well as `rb_shape_edges_count` and `rb_shape_memsize`. Notes: Merged: https://github.com/ruby/ruby/pull/13283
2025-05-08	Move `object_id` in object fields.	Jean Boussier
	And get rid of the `obj_to_id_tbl` It's no longer needed, the `object_id` is now stored inline in the object alongside instance variables. We still need the inverse table in case `_id2ref` is invoked, but we lazily build it by walking the heap if that happens. The `object_id` concern is also no longer a GC implementation concern, but a generic implementation. Co-Authored-By: Matt Valentine-House <[email protected]> Notes: Merged: https://github.com/ruby/ruby/pull/13159
2025-05-08	Rename `ivptr` -> `fields`, `next_iv_index` -> `next_field_index`	Jean Boussier
	Ivars will longer be the only thing stored inline via shapes, so keeping the `iv_index` and `ivptr` names would be confusing. Instance variables won't be the only thing stored inline via shapes, so keeping the `ivptr` name would be confusing. `field` encompass anything that can be stored in a VALUE array. Similarly, `gen_ivtbl` becomes `gen_fields_tbl`. Notes: Merged: https://github.com/ruby/ruby/pull/13159
2025-04-27	Use a `set_table` for `rb_vm_struct.unused_block_warning_table`	Jean Boussier
	Now that we have a hash-set implementation we can use that instead of a hash-table with a static value.
2025-03-13	Move object_id to flags for ObjectSpace dumps	Peter Zhu
	Moving object_id dumping from ObjectSpace to the GC flags allows ObjectSpace to not assume the FL_SEEN_OBJ_ID flag and instead move it to the responsibility of the GC. Notes: Merged: https://github.com/ruby/ruby/pull/12915
2025-02-19	Add rb_gc_object_metadata API	Peter Zhu
	This function replaces the internal rb_obj_gc_flags API. rb_gc_object_metadata returns an array of name and value pairs, with the last element having 0 for the name. Notes: Merged: https://github.com/ruby/ruby/pull/12777
2025-01-30	Output object_id in ObjectSpace.dump	Peter Zhu
	Outputs the object ID in the dump for objects that have it seen. Notes: Merged: https://github.com/ruby/ruby/pull/12657
2024-12-23	use `st_update` to prevent table extension	Koichi Sasada
	to prevent the following scenario: 1. `delete_unique_str()` can be called while GC (sweeping) 2. it calls `st_insert()` to decrement the counter 3. `st_insert()` can try to extend the table even if the key exists 4. `xmalloc` while GC and cause BUG Notes: Merged: https://github.com/ruby/ruby/pull/12407
2024-12-19	Prefix asan_poison_object with rb	Peter Zhu
	Notes: Merged: https://github.com/ruby/ruby/pull/12385
2024-12-16	Check whether object is valid in allocation_info_tracer_compact	Peter Zhu
	When reference updating ObjectSpace.trace_object_allocations, we need to check whether the object is valid or not because it does not mark the object so the object may be dead. This can cause a segmentation fault if the object is on a free heap page. For example, the following script crashes: require "objspace" objs = [] ObjectSpace.trace_object_allocations do 1_000_000.times do objs << Object.new end end objs = nil # Free pages that the objs were on GC.start # Run compaction and check that it doesn't crash GC.compact Notes: Merged: https://github.com/ruby/ruby/pull/12360
2024-12-16	Fix ObjectSpace.trace_object_allocations for compaction	Peter Zhu
	We need to reinsert into the ST table when an object moves because it is a numtable that hashes on the object address, so when an object moves we need to reinsert it rather than just updating the key. Notes: Merged: https://github.com/ruby/ruby/pull/12339
2024-12-16	Fix compaction check for ObjectSpace.trace_object_allocations	Peter Zhu
	We should be checking for key for moved objects rather than the value because the key is a Ruby object and the value is malloc'd memory. Notes: Merged: https://github.com/ruby/ruby/pull/12339
2024-12-09	objspace_dump: Use FILE* to avoid crashing in mark functions	Alan Wu
	We observed crashes from rb_io_bufwrite() thread switching (through rb_thread_check_ints()) in the middle of rb_execution_context_mark(). By the time rb_execution_context_mark() gets a timeslice again, it read garbage from a frame that was already popped in another thread, crashing the process in SEGV. Other mark functions probably have their own ways of breaking, but clearly, the usual IO code do too much for this perilous pseudo GC context. Use `FILE*` like before 5001cc47169614ea07d87651c95c2ee185e374e0 ("Optimize ObjectSpace.dump_all"). Also, add type checking for the private _dump methods. Co-authored-by: Peter Zhu <[email protected]> Notes: Merged: https://github.com/ruby/ruby/pull/12285
2024-11-12	ObjectSpace.dump: handle Module#set_temporary_name	Jean Boussier
	[Bug #20892] Until the introduction of that method, it was impossible for a Module name not to be valid JSON, hence it wasn't going through the slower escaping function. This assumption no longer hold. Notes: Merged: https://github.com/ruby/ruby/pull/12067
2024-07-03	[Feature #20470] Split GC into gc_impl.c	Peter Zhu
	This commit splits gc.c into two files: - gc.c now only contains code not specific to Ruby GC. This includes code to mark objects (which the GC implementation may choose not to use) and wrappers for internal APIs that the implementation may need to use (e.g. locking the VM). - gc_impl.c now contains the implementation of Ruby's GC. This includes marking, sweeping, compaction, and statistics. Most importantly, gc_impl.c only uses public APIs in Ruby and a limited set of functions exposed in gc.c. This allows us to build gc_impl.c independently of Ruby and plug Ruby's GC into itself.
2024-04-27	ruby tool/update-deps --fix	卜部昌平

2024-03-19	Implement chilled strings	Étienne Barrié
	[Feature #20205] As a path toward enabling frozen string literals by default in the future, this commit introduce "chilled strings". From a user perspective chilled strings pretend to be frozen, but on the first attempt to mutate them, they lose their frozen status and emit a warning rather than to raise a `FrozenError`. Implementation wise, `rb_compile_option_struct.frozen_string_literal` is no longer a boolean but a tri-state of `enabled/disabled/unset`. When code is compiled with frozen string literals neither explictly enabled or disabled, string literals are compiled with a new `putchilledstring` instruction. This instruction is identical to `putstring` except it marks the String with the `STR_CHILLED (FL_USER3)` and `FL_FREEZE` flags. Chilled strings have the `FL_FREEZE` flag as to minimize the need to check for chilled strings across the codebase, and to improve compatibility with C extensions. Notes: - `String#freeze`: clears the chilled flag. - `String#-@`: acts as if the string was mutable. - `String#+@`: acts as if the string was mutable. - `String#clone`: copies the chilled flag. Co-authored-by: Jean Boussier <[email protected]>
2024-03-06	Move FL_SINGLETON to FL_USER1	Jean Boussier
	This frees FL_USER0 on both T_MODULE and T_CLASS. Note: prior to this, FL_SINGLETON was never set on T_MODULE, so checking for `FL_SINGLETON` without first checking that `FL_TYPE` was `T_CLASS` was valid. That's no longer the case.
2024-03-05	[DOC] Fix invalid documentation for `reachable_objects_from` (#10172)	Lazarus Lazaridis
	Previous documentation is stating the opposite (that the method won't work for CRuby).
2024-02-23	Use rb_hash_foreach in objspace.c	Peter Zhu
	Using RHASH_TBL_RAW is a private API, so we should use rb_hash_foreach rather than RHASH_TBL_RAW with st_foreach.
2024-01-19	Mark asan fake stacks during machine stack marking	KJ Tsanaktsidis
	ASAN leaves a pointer to the fake frame on the stack; we can use the __asan_addr_is_in_fake_stack API to work out the extent of the fake stack and thus mark any VALUEs contained therein. [Bug #20001]
2024-01-12	Revert "Mark asan fake stacks during machine stack marking"	KJ Tsanaktsidis
	This reverts commit d10bc3a2b8300cffc383e10c3730871e851be24c.
2024-01-12	Mark asan fake stacks during machine stack marking	KJ Tsanaktsidis
	ASAN leaves a pointer to the fake frame on the stack; we can use the __asan_addr_is_in_fake_stack API to work out the extent of the fake stack and thus mark any VALUEs contained therein. [Bug #20001]
2023-11-22	objspace_dump.c: dump call cache ids with dump_append_id	Jean Boussier
	Not all `ID` have an associated string. Fixes a SEGFAULT in ObjectSpace.dump_all spec.
2023-11-21	`ObjectSpace.count_nodes` doesn't count nodes	yui-knk
	Node has not been managed by GC from Ruby 2.5. Therefore these codes are not needed. If ObjectSpace depends on Node, it needs to update the file when node type is updated. Delete node related codes to avoid such update.
2023-11-20	Don't try compacting ivars on Classes that are "too complex"	Aaron Patterson
	Too complex classes use a hash table to store ivs, and should always pin their IVs. We shouldn't touch those classes in compaction.
2023-11-13	Revert "Revert "Remove SHAPE_CAPACITY_CHANGE shapes""	Peter Zhu
	This reverts commit 5f3fb4f4e397735783743fe52a7899b614bece20.
2023-11-13	Record more info from CALLCACHE in heap dumps	John Hawthorn
	This records the called_id and klass from imemo_callcache objects in heap dumps.
2023-11-10	Revert "Remove SHAPE_CAPACITY_CHANGE shapes"	Peter Zhu
	This reverts commit f6910a61122931e4193bcc0fad18d839c319b720. We're seeing crashes in the test suite of Shopify's core monolith after this change.
2023-11-09	Remove SHAPE_CAPACITY_CHANGE shapes	Peter Zhu
	We don't need to create a shape to transition capacity as we can transition the capacity when the capacity of the SHAPE_IVAR changes.
2023-11-02	Make every initial size pool shape a root shape	Peter Zhu
	This commit makes every initial size pool shape a root shape and assigns it a capacity of 0.
2023-10-12	Switch mid dump to dump_append_string_value	John Hawthorn
	I don't think it's possible to create a CI with a mid which would need escaping to be in a JSON string, but I think we might as well not rely on that assumption.
2023-10-12	Fix ObjectSpace.dump with super() callinfo	John Hawthorn
	super() uses 0 as mid for its callinfo, so we need to check for that to avoid a segfault when using dump_all.
2023-10-06	Remove `NODE_VALUES`	Nobuyoshi Nakada
	This node type was added for the multi-value experiment back in 2004. The feature itself was removed after a few years, but this is its remnant.
2023-10-05	Move internal NODE_DEF_TEMP to parse.y	Nobuyoshi Nakada

2023-10-02	Dump name of method for imemo callinfo	Peter Zhu
	This commit dumps the `mid` of the imemo callinfo when calling `ObjectSpace.dump_all`.
2023-09-29	Merge NODE_DEF_TEMP and NODE_DEF_TEMP2	yui-knk

2023-09-28	Change RNode structure from union to struct	yui-knk
	All kind of AST nodes use same struct RNode, which has u1, u2, u3 union members for holding different kind of data. This has two problems. 1. Low flexibility of data structure Some nodes, for example NODE_TRUE, don’t use u1, u2, u3. On the other hand, NODE_OP_ASGN2 needs more than three union members. However they use same structure definition, need to allocate three union members for NODE_TRUE and need to separate NODE_OP_ASGN2 into another node. This change removes the restriction so make it possible to change data structure by each node type. 2. No compile time check for union member access It’s developer’s responsibility for using correct member for each node type when it’s union. This change clarifies which node has which type of fields and enables compile time check. This commit also changes node_buffer_elem_struct buf management to handle different size data with alignment.