summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorAlan Wu <[email protected]>2022-01-06 21:57:43 -0500
committerAlan Wu <[email protected]>2022-01-07 19:29:03 -0500
commit54c91042ed61a869d4a66fc089b21f56d165265f (patch)
tree9d4e7f2acbbbabdd62b1fb1c24082a580ce10094
parentbc643bbe2e2c1afbed18ce2bcf4aed138fece412 (diff)
YJIT: Discard local var type info on routine call
Routines that are called from YJIT's output code can call methods, and calling methods mean they can capture and change the environment of the calling frame. Discard type info whenever we perform routine calls. This is more conservative than strictly necessary as some routines need to perform GC allocation but can never call methods and so should never be able to change local variables. However, manually analyzing C functions for whether they have code paths that call methods is error prone and can go out of date as changes land in the codebase. Closes: shopify/yjit#300
Notes
Notes: Merged: https://github.com/ruby/ruby/pull/5416
-rw-r--r--bootstraptest/test_yjit.rb31
-rw-r--r--yjit_codegen.c4
2 files changed, 35 insertions, 0 deletions
diff --git a/bootstraptest/test_yjit.rb b/bootstraptest/test_yjit.rb
index 30298a820d..47744efb73 100644
--- a/bootstraptest/test_yjit.rb
+++ b/bootstraptest/test_yjit.rb
@@ -1,3 +1,34 @@
+assert_equal '2022', %q{
+ def contrivance(hash, key)
+ # Expect this to compile to an `opt_aref`.
+ hash[key]
+
+ # The [] call above tracks that the `hash` local has a VALUE that
+ # is a heap pointer and the guard for the Kernel#itself call below
+ # doesn't check that it's a heap pointer VALUE.
+ #
+ # As you can see from the crash, the call to rb_hash_aref() can set the
+ # `hash` local, making eliding the heap object guard unsound.
+ hash.itself
+ end
+
+ # This is similar to ->(recv, mid) { send(recv, mid).local_variable_set(...) }.
+ # By composing we avoid creating new Ruby frames and so sending :binding
+ # captures the environment of the frame that does the missing key lookup.
+ # We use it to capture the environment inside of `contrivance`.
+ cap_then_set =
+ Kernel.instance_method(:send).method(:bind_call).to_proc >>
+ ->(binding) { binding.local_variable_set(:hash, 2022) }
+ special_missing = Hash.new(&cap_then_set)
+
+ # Make YJIT speculate that it's a hash and generate code
+ # that calls rb_hash_aref().
+ contrivance({}, :warmup)
+ contrivance({}, :warmup)
+
+ contrivance(special_missing, :binding)
+}
+
assert_equal '18374962167983112447', %q{
# regression test for incorrectly discarding 32 bits of a pointer when it
# comes to default values.
diff --git a/yjit_codegen.c b/yjit_codegen.c
index aa352dac70..25fcfca083 100644
--- a/yjit_codegen.c
+++ b/yjit_codegen.c
@@ -184,6 +184,10 @@ jit_prepare_routine_call(jitstate_t *jit, ctx_t *ctx, x86opnd_t scratch_reg)
jit->record_boundary_patch_point = true;
jit_save_pc(jit, scratch_reg);
jit_save_sp(jit, ctx);
+
+ // In case the routine calls Ruby methods, it can set local variables
+ // through Kernel#binding and other means.
+ ctx_clear_local_types(ctx);
}
// Record the current codeblock write position for rewriting into a jump into