Make Shared transparent to parent kernels#8576
Performance Regression: -15.92%
⚠️ Unknown Walltime execution environment detected
Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.
For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.
⚡ 2 improved benchmarks
❌ 9 regressed benchmarks
✅ 1584 untouched benchmarks
⏩ 4 skipped benchmarks1
Warning
Please fix the performance issues or acknowledge them on CodSpeed.
Performance Changes
| Mode | Benchmark | BASE |
HEAD |
Efficiency | |
|---|---|---|---|---|---|
| ❌ | Simulation | chunked_bool_canonical_into[(1000, 10)] |
15.9 µs | 26.9 µs | -40.83% |
| ❌ | Simulation | copy_nullable[65536] |
1 ms | 1.4 ms | -24.27% |
| ❌ | Simulation | search_index_below_min_chunked |
1.3 ms | 1.6 ms | -20.33% |
| ❌ | Simulation | search_index_mixed_out_of_range_chunked |
1.3 ms | 1.7 ms | -20.19% |
| ❌ | Simulation | search_index_full_range_random_chunked |
1.4 ms | 1.7 ms | -19.76% |
| ❌ | Simulation | chunked_varbinview_into_canonical[(1000, 10)] |
168.9 µs | 206.6 µs | -18.22% |
| ❌ | Simulation | search_index_above_max_chunked |
1.7 ms | 2 ms | -17.59% |
| ❌ | Simulation | copy_non_nullable[65536] |
908.5 µs | 1,089.2 µs | -16.59% |
| ❌ | Simulation | search_index_in_range_chunked |
1.9 ms | 2.3 ms | -16.23% |
| ⚡ | Simulation | chunked_varbinview_canonical_into[(1000, 10)] |
191.1 µs | 154.7 µs | +23.54% |
| ⚡ | Simulation | compact_sliced[(4096, 90)] |
837.5 ns | 750 ns | +11.67% |
Tip
Investigate this regression by commenting @codspeedbot fix this regression on this PR, or directly use the CodSpeed MCP with your agent.
Comparing ngates/onpair-split-1-shared-parent-kernels (95c4949) with develop (797b650)
Footnotes
-
4 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩