ruby.git - The Ruby Programming Language

Age	Commit message (Collapse)	Author
2024-04-04	Revert "hijack SIGCHLD handler for internal use"	Nobuyoshi Nakada
	This reverts commit 054a412d540e7ed2de63d68da753f585ea6616c3. SIGCHLD `waidpid`, `waitpid_lock` and related code, have been removed at ruby/ruby#7527.
2024-04-04	Move shareable_constant_value logic from parse.y to compile.c	yui-knk

2024-04-02	Remove `rb_imemo_tmpbuf_t` from parser	yui-knk
	No parser semantic value types are `VALUE` then no need to use imemo for managing semantic value stack anymore.
2024-04-02	[Feature #20331] Simplify parser warnings for hash keys duplication and when ↵	yui-knk
	clause duplication This commit simplifies warnings for hash keys duplication and when clause duplication, based on the discussion of https://bugs.ruby-lang.org/issues/20331. Warnings are reported only when strings are same to ohters.
2024-04-02	Remove VALUE from `struct rb_strterm_struct`	yui-knk
	In the past, it was imemo. However a075c55 changed it. Therefore no need to use `VALUE` for the first field.
2024-03-27	Fix setting GC stress at boot when objspace not available	Peter Zhu

2024-03-26	Refactor init_copy gc attributes	eileencodes
	This PR moves `rb_copy_wb_protected_attribute` and `rb_gc_copy_finalizer` into a single function called `rb_gc_copy_attributes` to be called by `init_copy`. This reduces the surface area of the GC API. Co-authored-by: Peter Zhu <[email protected]>
2024-03-25	Fix --debug=gc_stress flag	Peter Zhu
	ruby_env_debug_option gets called after Init_gc_stress, so the --debug=gc_stress flag never works.
2024-03-21	Fix Ripper memory allocation size when enabled Universal Parser	S-H-GAMELINKS
	The size of `struct parser_params` is 8 bytes difference in `ripper_s_allocate` and `rb_ruby_parser_allocate` when the universal parser is enabled. This causes a situation where `r->p` is not fully initialized in `ripper_s_allocate` as shown below. ```console (gdb) p r->p $2 = {heap = 0x0, lval = 0x0, yylloc = 0x0, lex = {strterm = 0x0, gets = 0x0, input = 0, string_buffer = {head = 0x0, last = 0x0}, lastlin e = 0x0, nextline = 0x0, pbeg = 0x0, pcur = 0x0, pend = 0x0, ptok = 0x0, gets_ = {ptr = 0, call = 0x0}, state = EXPR_NONE, paren_nest = 0, lpar _seen = 0, debug = 0, has_shebang = 0, token_seen = 0, token_info_enabled = 0, error_p = 0, cr_seen = 0, value = 0, result = 0, parsing_thread = 0, s_value = 0, s_lvalue = 0, s_value_stack = 2097} ```` This seems to cause `double free or corruption (!prev)` and SEGV. So, fixing this by introduce `rb_ripper_parser_params_allocate` and `rb_ruby_parser_config` functions for Ripper, and `struct parser_params` same size is returned.
2024-03-20	Make rb_aligned_malloc private	Peter Zhu
	It is not used anywhere else.
2024-03-19	[DOC] Unify Doxygen formats (#10285)	Takashi Kokubun

2024-03-19	Implement chilled strings	Étienne Barrié
	[Feature #20205] As a path toward enabling frozen string literals by default in the future, this commit introduce "chilled strings". From a user perspective chilled strings pretend to be frozen, but on the first attempt to mutate them, they lose their frozen status and emit a warning rather than to raise a `FrozenError`. Implementation wise, `rb_compile_option_struct.frozen_string_literal` is no longer a boolean but a tri-state of `enabled/disabled/unset`. When code is compiled with frozen string literals neither explictly enabled or disabled, string literals are compiled with a new `putchilledstring` instruction. This instruction is identical to `putstring` except it marks the String with the `STR_CHILLED (FL_USER3)` and `FL_FREEZE` flags. Chilled strings have the `FL_FREEZE` flag as to minimize the need to check for chilled strings across the codebase, and to improve compatibility with C extensions. Notes: - `String#freeze`: clears the chilled flag. - `String#-@`: acts as if the string was mutable. - `String#+@`: acts as if the string was mutable. - `String#clone`: copies the chilled flag. Co-authored-by: Jean Boussier <[email protected]>
2024-03-18	Remove duplicated function prototype rb_gc_disable_no_rest	Peter Zhu

2024-03-18	Remove rb_raw_obj_info_basic	Peter Zhu
	It's not used outside of gc.c.
2024-03-18	Faster Integer.sqrt for large bignum	tompng
	Integer.sqrt uses Newton's method. This pull request reduces the precision which was unnecessarily high in each calculation step.
2024-03-14	[Feature #20265] Remove rb_newobj_of and RB_NEWOBJ_OF	Peter Zhu

2024-03-14	`Exception#set_backtrace` accept arrays of `Backtrace::Location`	Jean Boussier
	[Feature #13557] Setting the backtrace with an array of strings is lossy. The resulting exception will return nil on `#backtrace_locations`. By accepting an array of `Backtrace::Location` instance, we can rebuild a `Backtrace` instance and have a fully functioning Exception. Co-Authored-By: Étienne Barrié <[email protected]>
2024-03-13	Don't directly read the SIZE_POOL_COUNT in shapes	Peter Zhu
	This removes the assumption about SIZE_POOL_COUNT for shapes.
2024-03-13	Simplify NEWOBJ_OF macro	Peter Zhu

2024-03-11	Use NEWOBJ_OF_ec in NEWOBJ_OF_0	Peter Zhu

2024-03-08	Retire RUBY_MARK_UNLESS_NULL	Jean Boussier
	Marking `Qnil` or `Qfalse` works fine, having an extra macro to avoid it isn't needed.
2024-03-06	Refactor VM root modules	Jean Boussier
	This `st_table` is used to both mark and pin classes defined from the C API. But `vm->mark_object_ary` already does both much more efficiently. Currently a Ruby process starts with 252 rooted classes, which uses `7224B` in an `st_table` or `2016B` in an `RArray`. So a baseline of 5kB saved, but since `mark_object_ary` is preallocated with `1024` slots but only use `405` of them, it's a net `7kB` save. `vm->mark_object_ary` is also being refactored. Prior to this changes, `mark_object_ary` was a regular `RArray`, but since this allows for references to be moved, it was marked a second time from `rb_vm_mark()` to pin these objects. This has the detrimental effect of marking these references on every minors even though it's a mostly append only list. But using a custom TypedData we can save from having to mark all the references on minor GC runs. Addtionally, immediate values are now ignored and not appended to `vm->mark_object_ary` as it's just wasted space.
2024-03-06	Move FL_SINGLETON to FL_USER1	Jean Boussier
	This frees FL_USER0 on both T_MODULE and T_CLASS. Note: prior to this, FL_SINGLETON was never set on T_MODULE, so checking for `FL_SINGLETON` without first checking that `FL_TYPE` was `T_CLASS` was valid. That's no longer the case.
2024-03-01	Don't pin named structs defined in Ruby	Jean Boussier
	[Bug #20311] `rb_define_class_under` assumes it's called from C and that the reference might be held in a C global variable, so it adds the class to the VM root. In the case of `Struct.new('Name')` it's wasteful and make the struct immortal.
2024-02-28	Make rb_define_finalizer_no_check private	Peter Zhu

2024-02-28	Remove unused rb_gc_id2ref_obj_tbl	Peter Zhu

2024-02-26	Remove rb_objspace_marked_object_p	Peter Zhu
	rb_objspace_marked_object_p is no longer used in the objspace module, so we can remove it.
2024-02-26	Make rb_objspace_data_type_memsize private	Peter Zhu
	rb_objspace_data_type_memsize is not used in the objspace module, so we can make it private.
2024-02-26	Remove unused rb_objspace_each_objects_without_setup	Peter Zhu

2024-02-22	Extract imemo functions from gc.c into imemo.c	Peter Zhu

2024-02-21	Add IMEMO_NEW	Peter Zhu
	Rather than exposing that an imemo has a flag and four fields, this changes the implementation to only expose one field (the klass) and fills the rest with 0. The type will have to fill in the values themselves.
2024-02-20	De-dup identical callinfo objects	John Hawthorn
	Previously every call to vm_ci_new (when the CI was not packable) would result in a different callinfo being returned this meant that every kwarg callsite had its own CI. When calling, different CIs result in different CCs. These CIs and CCs both end up persisted on the T_CLASS inside cc_tbl. So in an eval loop this resulted in a memory leak of both types of object. This also likely resulted in extra memory used, and extra time searching, in non-eval cases. For simplicity in this commit I always allocate a CI object inside rb_vm_ci_lookup, but ideally we would lazily allocate it only when needed. I hope to do that as a follow up in the future.
2024-02-21	Introduce NODE_REGX to manage regexp literal	yui-knk

2024-02-20	[Feature #20257] Rearchitect Ripper	yui-knk
	Introduce another semantic value stack for Ripper so that Ripper can manage both Node and Ruby Object separately. This rearchitectutre of Ripper solves these issues. Therefore adding test cases for them. * [Bug 10436] https://bugs.ruby-lang.org/issues/10436 * [Bug 18988] https://bugs.ruby-lang.org/issues/18988 * [Bug 20055] https://bugs.ruby-lang.org/issues/20055 Checked the differences of `Ripper.sexp` for files under `/test/ruby` are only on test_pattern_matching.rb. The differences comes from the differences between `new_hash_pattern_tail` functions between parser and Ripper. Ripper `new_hash_pattern_tail` didn’t call `assignable` then `kw_rest_arg` wasn’t marked as local variable. This is also fixed by this commit. ``` --- a/./tmp/before/test_pattern_matching.rb +++ b/./tmp/after/test_pattern_matching.rb @@ -3607,7 +3607,7 @@ [:in, [:hshptn, nil, [], [:var_field, [:@ident, “a”, [984, 13]]]], [[:binary, - [:vcall, [:@ident, “a”, [985, 10]]], + [:var_ref, [:@ident, “a”, [985, 10]]], :==, [:hash, nil]]], nil]]], @@ -3662,7 +3662,7 @@ [:in, [:hshptn, nil, [], [:var_field, [:@ident, “a”, [993, 13]]]], [[:binary, - [:vcall, [:@ident, “a”, [994, 10]]], + [:var_ref, [:@ident, “a”, [994, 10]]], :==, [:hash, [:assoclist_from_args, @@ -3813,7 +3813,7 @@ [:command, [:@ident, “raise”, [1022, 10]], [:args_add_block, - [[:vcall, [:@ident, “b”, [1022, 16]]]], + [[:var_ref, [:@ident, “b”, [1022, 16]]]], false]]], [:else, [[:var_ref, [:@kw, “true”, [1024, 10]]]]]]]], nil, @@ -3876,7 +3876,7 @@ [:@int, “0”, [1033, 15]]], :“&&“, [:binary, - [:vcall, [:@ident, “b”, [1033, 20]]], + [:var_ref, [:@ident, “b”, [1033, 20]]], :==, [:hash, nil]]]], nil]]], @@ -3946,7 +3946,7 @@ [:@int, “0”, [1042, 15]]], :“&&“, [:binary, - [:vcall, [:@ident, “b”, [1042, 20]]], + [:var_ref, [:@ident, “b”, [1042, 20]]], :==, [:hash, [:assoclist_from_args, @@ -5206,7 +5206,7 @@ [[:assoc_new, [:@label, “c:“, [1352, 22]], [:@int, “0”, [1352, 25]]]]]], - [:vcall, [:@ident, “r”, [1352, 29]]]], + [:var_ref, [:@ident, “r”, [1352, 29]]]], false]]], [:binary, [:call, @@ -5299,7 +5299,7 @@ [:assoc_new, [:@label, “c:“, [1367, 34]], [:@int, “0”, [1367, 37]]]]]], - [:vcall, [:@ident, “r”, [1367, 41]]]], + [:var_ref, [:@ident, “r”, [1367, 41]]]], false]]], [:binary, [:call, @@ -5931,7 +5931,7 @@ [:in, [:hshptn, nil, [], [:var_field, [:@ident, “r”, [1533, 11]]]], [[:binary, - [:vcall, [:@ident, “r”, [1534, 8]]], + [:var_ref, [:@ident, “r”, [1534, 8]]], :==, [:hash, [:assoclist_from_args, ```
2024-02-19	[Bug #20280] Check by `rb_parser_enc_str_coderange`	Nobuyoshi Nakada
	Co-authored-by: Yuichiro Kaneko <[email protected]>
2024-02-19	[Bug #20280] Raise SyntaxError on invalid encoding symbol	Nobuyoshi Nakada

2024-02-14	Move rb_class_allocate_instance from gc.c to object.c	Peter Zhu

2024-02-13	Specialize String#byteslice(a, b) (#9939)	Aaron Patterson
	* Specialize String#byteslice(a, b) This adds a specialization for String#byteslice when there are two parameters. This makes our protobuf parser go from 5.84x slower to 5.33x slower ``` Comparison: decode upstream (53738 bytes): 7228.5 i/s decode protobuff (53738 bytes): 1236.8 i/s - 5.84x slower Comparison: decode upstream (53738 bytes): 7024.8 i/s decode protobuff (53738 bytes): 1318.5 i/s - 5.33x slower ``` * Update yjit/src/codegen.rs --------- Co-authored-by: Maxime Chevalier-Boisvert <[email protected]>
2024-02-12	proc.c: get rid of `CLONESETUP`	Jean Boussier
	[Bug #20253] All the way down to Ruby 1.9, `Proc`, `Method`, `UnboundMethod` and `Binding` always had their own specific clone and dup routine. This caused various discrepancies with how other objects behave on `dup` and `clone. [Bug #20250], [Bug #20253]. This commit get rid of `CLONESETUP` and use the the same codepath as all other types, so ensure consistency. NB: It's still not accepting the `freeze` keyword argument on `clone`. Co-Authored-By: Étienne Barrié <[email protected]>
2024-02-09	Remove ruby object from string nodes	yui-knk
	String nodes holds ruby string object on `VALUE nd_lit`. This commit changes it to `struct rb_parser_string *string` to reduce dependency on ruby object. Sometimes these strings are concatenated with other string therefore string concatenate functions are needed.
2024-02-05	Make io_fwrite safe for compaction	Peter Zhu
	[Bug #20169] Embedded strings are not safe for system calls without the GVL because compaction can cause pages to be locked causing the operation to fail with EFAULT. This commit changes io_fwrite to use rb_str_tmp_frozen_no_embed_acquire, which guarantees that the return string is not embedded.
2024-02-01	Parenthesize casted argument	Nobuyoshi Nakada

2024-01-31	Introduced `rb_node_const_decl_val` function	S.H
	Introduce `rb_node_const_decl_val` function to allow `rb_ary_join` and `rb_ary_reverse` functions to be removed from Universal Parser.
2024-01-30	Use `UNDEF_P`	Nobuyoshi Nakada

2024-01-27	Introduce `NODE_ENCODING`	S.H
	`__ENCODING__ `was managed by `NODE_LIT` with Encoding object. Introduce `NODE_ENCODING` for 1. `__ENCODING__` is detectable from AST Node. 2. Reduce dependency Ruby object for parse.y
2024-01-24	Define `IO_WITHOUT_GVL` macro	Nobuyoshi Nakada

2024-01-23	Make lastline and nextline to be rb_parser_string	yui-knk
	This commit changes `struct parser_params` lastline and nextline from `VALUE` (String object) to `rb_parser_string_t *` so that dependency on Ruby Object is reduced. `parser_string_buffer_t string_buffer` is added to `struct parser_params` to manage `rb_parser_string_t` pointers of each line. All allocated line strings are freed in `rb_ruby_parser_free`.
2024-01-19	Mark asan fake stacks during machine stack marking	KJ Tsanaktsidis
	ASAN leaves a pointer to the fake frame on the stack; we can use the __asan_addr_is_in_fake_stack API to work out the extent of the fake stack and thus mark any VALUEs contained therein. [Bug #20001]
2024-01-19	Define special macros for asan/msan being enabled	KJ Tsanaktsidis
	__has_feature is a clang-ism, and GCC has a different way to tell if sanitizers are enabled. For this reason, I don't want to spray __has_feature all over the codebase for other places where conditional compilation based on sanitizers is required. [Bug #20001]
2024-01-19	Make stack bounds detection work with ASAN	KJ Tsanaktsidis
	Where a local variable is used as part of the stack bounds detection, it has to actually be on the stack. ASAN can put local variable on "fake stacks", however, with addresses in different memory mappings. This completely destroys the stack bounds calculation, and can lead to e.g. things not getting GC marked on the machine stack or stackoverflow checks that always fail. The __asan_addr_is_in_fake_stack helper can be used to get the _real_ stack address of such variables, and thus perform the stack size calculation properly [Bug #20001]