diff options
author | Jean Boussier <[email protected]> | 2024-10-19 12:55:57 +0200 |
---|---|---|
committer | Hiroshi SHIBATA <[email protected]> | 2024-10-26 18:44:15 +0900 |
commit | e52b47680ea35a8cdca918f9853f7cdcb4ced6af (patch) | |
tree | ceb5d3bbcebfdf87be74111a49d04a6d3fa175cd /benchmark | |
parent | 97713ac952cf8d3fa7d9ca07c3aaeef3e8956c85 (diff) |
[ruby/json] Reduce encoding benchmark size
Profiling revealed that we were spending lots of time growing the buffer.
Buffer operations is definitely something we want to optimize, but for
this specific benchmark what we're interested in is UTF-8 scanning performance.
Each iteration of the two scaning benchmark were producing 20MB of JSON,
now they only produce 5MB.
Now:
```
== Encoding mostly utf8 (5001001 bytes)
ruby 3.4.0dev (2024-10-18T19:01:45Z master https://github.com/ruby/json/commit/7be9a333ca) +YJIT +PRISM [arm64-darwin23]
Warming up --------------------------------------
json 35.000 i/100ms
oj 36.000 i/100ms
rapidjson 10.000 i/100ms
Calculating -------------------------------------
json 359.161 (± 1.4%) i/s (2.78 ms/i) - 1.820k in 5.068542s
oj 359.699 (± 0.6%) i/s (2.78 ms/i) - 1.800k in 5.004291s
rapidjson 99.687 (± 2.0%) i/s (10.03 ms/i) - 500.000 in 5.017321s
Comparison:
json: 359.2 i/s
oj: 359.7 i/s - same-ish: difference falls within error
rapidjson: 99.7 i/s - 3.60x slower
```
https://github.com/ruby/json/commit/1a338532d2
Diffstat (limited to 'benchmark')
-rw-r--r-- | benchmark/encoder.rb | 8 |
1 files changed, 3 insertions, 5 deletions
diff --git a/benchmark/encoder.rb b/benchmark/encoder.rb index bd0105ae38..2b02d5d5e9 100644 --- a/benchmark/encoder.rb +++ b/benchmark/encoder.rb @@ -59,11 +59,9 @@ end benchmark_encoding "small nested array", [[1,2,3,4,5]]*10 benchmark_encoding "small hash", { "username" => "jhawthorn", "id" => 123, "event" => "wrote json serializer" } -# On this one we're a bit slower (~25%). -benchmark_encoding "mostly utf8", ([("€" * 3333)] * 2000), except: %i(json_state) - -# On these three benchmarks we perform well. Either on par or very closely faster/slower -benchmark_encoding "mixed utf8", ([("a" * 5000) + "€" + ("a" * 5000)] * 2000), except: %i(json_state) +# On these benchmarks we perform well. Either on par or very closely faster/slower +benchmark_encoding "mixed utf8", ([("a" * 5000) + "€" + ("a" * 5000)] * 500), except: %i(json_state) +benchmark_encoding "mostly utf8", ([("€" * 3333)] * 500), except: %i(json_state) benchmark_encoding "twitter.json", JSON.load_file("#{__dir__}/data/twitter.json"), except: %i(json_state) benchmark_encoding "citm_catalog.json", JSON.load_file("#{__dir__}/data/citm_catalog.json"), except: %i(json_state) |