Misc minor optimizations to query optimizer performance by AdamGS · Pull Request #21128 · apache/datafusion

AdamGS · 2026-03-23T19:23:43Z

Which issue does this PR close?

Closes #.

Rationale for this change

Inspired by @blaginin, trying to find more places that might drag the optimizer's performance. On my laptop , this improves many of the sql planner's benchmarks by a fairly consistent 2-5%.

What changes are included in this PR?

A slew of minor optimization in the logical planner, trying to avoid wasted work or repeated allocations

Are these changes tested?

Existing tests.

Are there any user-facing changes?

None

AdamGS · 2026-03-23T19:27:39Z

datafusion/optimizer/src/push_down_limit.rs

        true
    }

+    #[expect(clippy::only_used_in_recursion)]


this is just wasteful, just a lint away.

As in you plan to remove it as a follow on PR?

datafusion/optimizer/src/push_down_filter.rs

AdamGS · 2026-03-23T19:29:07Z

datafusion/optimizer/src/analyzer/type_coercion.rs


    fn analyze(&self, plan: LogicalPlan, config: &ConfigOptions) -> Result<LogicalPlan> {
-        let empty_schema = DFSchema::empty();
+        static EMPTY_SCHEMA: LazyLock<DFSchema> = LazyLock::new(DFSchema::empty);


empty DFSchema isn't free, similar to #20534

Signed-off-by: Adam Gutglick <[email protected]>

alamb · 2026-03-27T17:06:06Z

run benchmark sql_planner

adriangbot · 2026-03-27T17:10:45Z

🤖 Criterion benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4144029417-586-jcd4f 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing adamg/optimizer-memory-optimizations (532b74e) to 2b986c8 (merge-base) diff
BENCH_NAME=sql_planner
BENCH_COMMAND=cargo bench --features=parquet --bench sql_planner
BENCH_FILTER=
Results will be posted here when complete

File an issue against this benchmark runner

alamb

Thanks @AdamGS -- looks good to me

assuming the benchmarks show reasonable improvements let's merge it in!

alamb · 2026-03-27T17:06:55Z

datafusion/optimizer/src/simplify_expressions/expr_simplifier.rs

        // The dummy column name is unused and doesn't matter as only
        // expressions without column references can be evaluated
-        static DUMMY_COL_NAME: &str = ".";
-        let schema = Arc::new(Schema::new(vec![Field::new(


this is nice to save several callocations for each call to the Const evaluator 👍

alamb · 2026-03-27T17:07:52Z

datafusion/optimizer/src/push_down_limit.rs

        true
    }

+    #[expect(clippy::only_used_in_recursion)]


As in you plan to remove it as a follow on PR?

alamb · 2026-03-27T17:08:39Z

datafusion/optimizer/src/push_down_limit.rs

        plan: LogicalPlan,
        config: &dyn OptimizerConfig,
    ) -> Result<Transformed<LogicalPlan>> {
-        let _ = config.options();


that is weird

to answer your question above, I think the lint has to stay here because this seems worse, and as far as I can tell in this rule the config is just passed along recursively

alamb · 2026-03-27T17:10:56Z

datafusion/optimizer/src/push_down_filter.rs

 fn extract_or_clauses_for_join<'a>(
    filters: &'a [Expr],
-    schema: &'a DFSchema,
+    schema_cols: &'a HashSet<Column>,


Do we need to have owned Columns?

Maybe this could be something like &HashSet<&Column> and avoid copying strings too

I don't think it'll work with &Column here, but I do think its possible to avoid the String allocation and everything here seems like internal functions, I'll try it out

I pushed a change, it widen the scope by a bit but basically just introduces a type that holds two references and passes it around, everything is contained within the module.

AdamGS · 2026-03-28T17:26:13Z

I think the benchmarks results got lost?

Dandandan · 2026-03-28T18:23:52Z

run benchmark sql_planner

adriangbot · 2026-03-28T18:28:24Z

🤖 Criterion benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4148567065-599-8nts6 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing adamg/optimizer-memory-optimizations (9b9e4f5) to 2b986c8 (merge-base) diff
BENCH_NAME=sql_planner
BENCH_COMMAND=cargo bench --features=parquet --bench sql_planner
BENCH_FILTER=
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-03-28T19:20:54Z

🤖 Criterion benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

group                                                 adamg_optimizer-memory-optimizations    main
-----                                                 ------------------------------------    ----
logical_aggregate_with_join                           1.05    453.7±1.33µs        ? ?/sec     1.00    432.7±2.28µs        ? ?/sec
logical_plan_struct_join_agg_sort                     1.00    170.4±0.90µs        ? ?/sec     1.00    171.2±3.52µs        ? ?/sec
logical_select_all_from_1000                          1.04      8.3±0.03ms        ? ?/sec     1.00      7.9±0.04ms        ? ?/sec
logical_select_one_from_700                           1.02    324.5±1.63µs        ? ?/sec     1.00    317.9±1.89µs        ? ?/sec
logical_trivial_join_high_numbered_columns            1.03    270.1±0.87µs        ? ?/sec     1.00    261.7±0.72µs        ? ?/sec
logical_trivial_join_low_numbered_columns             1.03    257.2±0.79µs        ? ?/sec     1.00    249.4±0.62µs        ? ?/sec
physical_intersection                                 1.01    602.6±1.97µs        ? ?/sec     1.00    596.0±2.24µs        ? ?/sec
physical_join_consider_sort                           1.02   1035.8±4.23µs        ? ?/sec     1.00   1018.8±3.45µs        ? ?/sec
physical_join_distinct                                1.02    250.8±0.74µs        ? ?/sec     1.00    245.3±0.57µs        ? ?/sec
physical_many_self_joins                              1.03      7.7±0.10ms        ? ?/sec     1.00      7.5±0.10ms        ? ?/sec
physical_plan_clickbench_all                          1.00    110.1±4.90ms        ? ?/sec     1.04    114.1±4.29ms        ? ?/sec
physical_plan_clickbench_q1                           1.00  1108.7±21.47µs        ? ?/sec     1.00  1110.1±18.39µs        ? ?/sec
physical_plan_clickbench_q10                          1.00  1821.7±76.67µs        ? ?/sec     1.01  1843.9±76.77µs        ? ?/sec
physical_plan_clickbench_q11                          1.00  1891.8±85.97µs        ? ?/sec     1.01  1906.8±85.10µs        ? ?/sec
physical_plan_clickbench_q12                          1.00  1990.1±105.93µs        ? ?/sec    1.01      2.0±0.10ms        ? ?/sec
physical_plan_clickbench_q13                          1.01  1767.1±71.69µs        ? ?/sec     1.00  1744.3±64.02µs        ? ?/sec
physical_plan_clickbench_q14                          1.00  1924.3±70.86µs        ? ?/sec     1.01  1950.3±48.04µs        ? ?/sec
physical_plan_clickbench_q15                          1.01  1825.6±88.16µs        ? ?/sec     1.00  1807.4±71.81µs        ? ?/sec
physical_plan_clickbench_q16                          1.00  1525.3±47.08µs        ? ?/sec     1.00  1521.0±41.74µs        ? ?/sec
physical_plan_clickbench_q17                          1.00  1572.5±50.46µs        ? ?/sec     1.01  1592.5±54.24µs        ? ?/sec
physical_plan_clickbench_q18                          1.02  1433.9±36.15µs        ? ?/sec     1.00  1403.8±35.27µs        ? ?/sec
physical_plan_clickbench_q19                          1.00  1839.6±74.99µs        ? ?/sec     1.03  1894.3±60.35µs        ? ?/sec
physical_plan_clickbench_q2                           1.00  1446.2±43.74µs        ? ?/sec     1.01  1463.1±45.18µs        ? ?/sec
physical_plan_clickbench_q20                          1.00  1160.8±23.39µs        ? ?/sec     1.00  1159.6±18.96µs        ? ?/sec
physical_plan_clickbench_q21                          1.00  1455.2±40.23µs        ? ?/sec     1.00  1457.5±40.43µs        ? ?/sec
physical_plan_clickbench_q22                          1.00  1894.1±90.76µs        ? ?/sec     1.01  1912.7±88.25µs        ? ?/sec
physical_plan_clickbench_q23                          1.04      2.2±0.06ms        ? ?/sec     1.00      2.1±0.10ms        ? ?/sec
physical_plan_clickbench_q24                          1.00      3.0±0.15ms        ? ?/sec     1.06      3.2±0.04ms        ? ?/sec
physical_plan_clickbench_q25                          1.00  1550.3±45.04µs        ? ?/sec     1.00  1544.1±39.31µs        ? ?/sec
physical_plan_clickbench_q26                          1.02  1422.9±36.53µs        ? ?/sec     1.00  1388.8±34.21µs        ? ?/sec
physical_plan_clickbench_q27                          1.01  1572.6±54.27µs        ? ?/sec     1.00  1563.1±42.34µs        ? ?/sec
physical_plan_clickbench_q28                          1.00      2.1±0.12ms        ? ?/sec     1.02      2.1±0.13ms        ? ?/sec
physical_plan_clickbench_q29                          1.00      2.3±0.12ms        ? ?/sec     1.05      2.4±0.14ms        ? ?/sec
physical_plan_clickbench_q3                           1.00  1410.6±45.54µs        ? ?/sec     1.00  1417.2±32.51µs        ? ?/sec
physical_plan_clickbench_q30                          1.00     16.3±0.31ms        ? ?/sec     1.02     16.7±0.21ms        ? ?/sec
physical_plan_clickbench_q31                          1.00      2.2±0.13ms        ? ?/sec     1.02      2.2±0.14ms        ? ?/sec
physical_plan_clickbench_q32                          1.00      2.2±0.13ms        ? ?/sec     1.03      2.2±0.13ms        ? ?/sec
physical_plan_clickbench_q33                          1.00  1810.8±86.66µs        ? ?/sec     1.02  1854.1±88.58µs        ? ?/sec
physical_plan_clickbench_q34                          1.01  1550.9±41.76µs        ? ?/sec     1.00  1540.8±46.74µs        ? ?/sec
physical_plan_clickbench_q35                          1.00  1605.7±56.21µs        ? ?/sec     1.04  1668.5±33.35µs        ? ?/sec
physical_plan_clickbench_q36                          1.00  1957.1±96.94µs        ? ?/sec     1.01  1976.8±95.52µs        ? ?/sec
physical_plan_clickbench_q37                          1.00      2.3±0.14ms        ? ?/sec     1.00      2.3±0.14ms        ? ?/sec
physical_plan_clickbench_q38                          1.00      2.3±0.15ms        ? ?/sec     1.01      2.3±0.14ms        ? ?/sec
physical_plan_clickbench_q39                          1.03      2.3±0.12ms        ? ?/sec     1.00      2.2±0.13ms        ? ?/sec
physical_plan_clickbench_q4                           1.00  1215.1±26.66µs        ? ?/sec     1.01  1224.8±13.11µs        ? ?/sec
physical_plan_clickbench_q40                          1.00      2.7±0.17ms        ? ?/sec     1.01      2.8±0.15ms        ? ?/sec
physical_plan_clickbench_q41                          1.00      2.3±0.14ms        ? ?/sec     1.04      2.4±0.07ms        ? ?/sec
physical_plan_clickbench_q42                          1.00      2.3±0.14ms        ? ?/sec     1.01      2.3±0.14ms        ? ?/sec
physical_plan_clickbench_q43                          1.00      2.5±0.16ms        ? ?/sec     1.00      2.5±0.16ms        ? ?/sec
physical_plan_clickbench_q44                          1.01  1323.9±36.27µs        ? ?/sec     1.00  1305.5±26.50µs        ? ?/sec
physical_plan_clickbench_q45                          1.04  1358.7±12.47µs        ? ?/sec     1.00  1304.5±30.52µs        ? ?/sec
physical_plan_clickbench_q46                          1.00  1663.9±57.25µs        ? ?/sec     1.04  1725.8±51.16µs        ? ?/sec
physical_plan_clickbench_q47                          1.00      2.4±0.14ms        ? ?/sec     1.03      2.4±0.14ms        ? ?/sec
physical_plan_clickbench_q48                          1.00      2.4±0.15ms        ? ?/sec     1.02      2.5±0.15ms        ? ?/sec
physical_plan_clickbench_q49                          1.00      2.6±0.17ms        ? ?/sec     1.02      2.7±0.18ms        ? ?/sec
physical_plan_clickbench_q5                           1.00  1338.7±31.39µs        ? ?/sec     1.02  1369.7±32.20µs        ? ?/sec
physical_plan_clickbench_q50                          1.02      2.6±0.17ms        ? ?/sec     1.00      2.6±0.17ms        ? ?/sec
physical_plan_clickbench_q51                          1.00  1785.1±53.84µs        ? ?/sec     1.01  1811.4±70.34µs        ? ?/sec
physical_plan_clickbench_q6                           1.00  1352.8±37.88µs        ? ?/sec     1.00  1346.5±30.93µs        ? ?/sec
physical_plan_clickbench_q7                           1.00  1129.2±25.77µs        ? ?/sec     1.00  1130.0±19.44µs        ? ?/sec
physical_plan_clickbench_q8                           1.02  1627.7±62.39µs        ? ?/sec     1.00  1600.9±51.85µs        ? ?/sec
physical_plan_clickbench_q9                           1.00  1694.2±35.38µs        ? ?/sec     1.03  1736.8±43.78µs        ? ?/sec
physical_plan_struct_join_agg_sort                    1.00   1293.1±9.25µs        ? ?/sec     1.03  1335.3±10.88µs        ? ?/sec
physical_plan_tpcds_all                               1.00    745.2±9.45ms        ? ?/sec     1.07   795.9±10.79ms        ? ?/sec
physical_plan_tpch_all                                1.00     47.8±1.08ms        ? ?/sec     1.02     48.9±1.35ms        ? ?/sec
physical_plan_tpch_q1                                 1.00   1529.7±7.78µs        ? ?/sec     1.01   1545.0±7.96µs        ? ?/sec
physical_plan_tpch_q10                                1.00      2.8±0.07ms        ? ?/sec     1.05      2.9±0.08ms        ? ?/sec
physical_plan_tpch_q11                                1.00      2.5±0.05ms        ? ?/sec     1.02      2.6±0.07ms        ? ?/sec
physical_plan_tpch_q12                                1.00   1286.4±5.75µs        ? ?/sec     1.03   1324.9±8.05µs        ? ?/sec
physical_plan_tpch_q13                                1.00    993.1±4.65µs        ? ?/sec     1.02   1012.5±6.00µs        ? ?/sec
physical_plan_tpch_q14                                1.00   1318.5±9.18µs        ? ?/sec     1.01  1335.0±10.46µs        ? ?/sec
physical_plan_tpch_q16                                1.00  1668.8±18.86µs        ? ?/sec     1.04  1734.0±22.51µs        ? ?/sec
physical_plan_tpch_q17                                1.00  1799.3±26.30µs        ? ?/sec     1.04  1877.8±27.67µs        ? ?/sec
physical_plan_tpch_q18                                1.00  1932.1±21.18µs        ? ?/sec     1.07      2.1±0.03ms        ? ?/sec
physical_plan_tpch_q19                                1.00      2.5±0.06ms        ? ?/sec     1.03      2.6±0.06ms        ? ?/sec
physical_plan_tpch_q2                                 1.00      4.3±0.13ms        ? ?/sec     1.05      4.5±0.13ms        ? ?/sec
physical_plan_tpch_q20                                1.00      2.3±0.06ms        ? ?/sec     1.03      2.3±0.06ms        ? ?/sec
physical_plan_tpch_q21                                1.00      3.0±0.08ms        ? ?/sec     1.07      3.2±0.10ms        ? ?/sec
physical_plan_tpch_q22                                1.00      2.1±0.05ms        ? ?/sec     1.02      2.1±0.06ms        ? ?/sec
physical_plan_tpch_q3                                 1.00  1884.0±25.43µs        ? ?/sec     1.05  1970.9±30.69µs        ? ?/sec
physical_plan_tpch_q4                                 1.00   1031.0±5.10µs        ? ?/sec     1.04   1070.1±6.31µs        ? ?/sec
physical_plan_tpch_q5                                 1.00      2.4±0.05ms        ? ?/sec     1.06      2.6±0.06ms        ? ?/sec
physical_plan_tpch_q6                                 1.00    636.8±2.20µs        ? ?/sec     1.02    648.3±5.46µs        ? ?/sec
physical_plan_tpch_q7                                 1.00      3.0±0.09ms        ? ?/sec     1.09      3.3±0.12ms        ? ?/sec
physical_plan_tpch_q8                                 1.00      3.9±0.14ms        ? ?/sec     1.08      4.3±0.16ms        ? ?/sec
physical_plan_tpch_q9                                 1.00      2.9±0.08ms        ? ?/sec     1.06      3.1±0.09ms        ? ?/sec
physical_select_aggregates_from_200                   1.00     14.6±0.15ms        ? ?/sec     1.00     14.6±0.15ms        ? ?/sec
physical_select_all_from_1000                         1.02     18.2±0.10ms        ? ?/sec     1.00     17.9±0.08ms        ? ?/sec
physical_select_one_from_700                          1.04    801.2±2.24µs        ? ?/sec     1.00    773.8±2.46µs        ? ?/sec
physical_sorted_union_order_by_10_int64               1.00      4.6±0.09ms        ? ?/sec     1.02      4.7±0.09ms        ? ?/sec
physical_sorted_union_order_by_10_uint64              1.00     11.4±0.16ms        ? ?/sec     1.04     11.9±0.16ms        ? ?/sec
physical_sorted_union_order_by_50_int64               1.00    111.5±1.99ms        ? ?/sec     1.03    114.5±2.07ms        ? ?/sec
physical_sorted_union_order_by_50_uint64              1.00   623.1±13.03ms        ? ?/sec     1.06   661.4±12.13ms        ? ?/sec
physical_theta_join_consider_sort                     1.00   1083.8±3.34µs        ? ?/sec     1.24   1344.7±4.92µs        ? ?/sec
physical_unnest_to_join                               1.00   1244.3±9.56µs        ? ?/sec     1.10   1371.8±8.17µs        ? ?/sec
physical_window_function_partition_by_12_on_values    1.00    716.6±2.47µs        ? ?/sec     1.02    733.6±2.00µs        ? ?/sec
physical_window_function_partition_by_30_on_values    1.00   1426.6±4.86µs        ? ?/sec     1.02   1453.6±6.86µs        ? ?/sec
physical_window_function_partition_by_4_on_values     1.00    434.9±1.78µs        ? ?/sec     1.04    450.7±5.00µs        ? ?/sec
physical_window_function_partition_by_7_on_values     1.00    542.0±2.01µs        ? ?/sec     1.02    553.6±6.29µs        ? ?/sec
physical_window_function_partition_by_8_on_values     1.00    588.4±3.09µs        ? ?/sec     1.00    588.3±1.68µs        ? ?/sec
with_param_values_many_columns                        1.01    467.7±2.07µs        ? ?/sec     1.00    464.3±3.05µs        ? ?/sec

Resource Usage

base (merge-base)

Metric	Value
Wall time	1264.5s
Peak memory	18.5 GiB
Avg memory	18.5 GiB
CPU user	1501.3s
CPU sys	1.7s
Disk read	0 B
Disk write	574.3 MiB

branch

Metric	Value
Wall time	1271.3s
Peak memory	18.5 GiB
Avg memory	18.5 GiB
CPU user	1510.2s
CPU sys	1.4s
Disk read	0 B
Disk write	25.0 MiB

File an issue against this benchmark runner

alamb · 2026-03-30T11:47:03Z

run benchmark sql_planner

adriangbot · 2026-03-30T11:51:26Z

🤖 Criterion benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4154423584-603-5hh46 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing adamg/optimizer-memory-optimizations (9b9e4f5) to 2b986c8 (merge-base) diff
BENCH_NAME=sql_planner
BENCH_COMMAND=cargo bench --features=parquet --bench sql_planner
BENCH_FILTER=
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-03-30T12:42:03Z

🤖 Criterion benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

group                                                 adamg_optimizer-memory-optimizations    main
-----                                                 ------------------------------------    ----
logical_aggregate_with_join                           1.04    445.4±5.93µs        ? ?/sec     1.00    429.7±3.41µs        ? ?/sec
logical_plan_struct_join_agg_sort                     1.06    168.3±1.03µs        ? ?/sec     1.00    159.3±0.49µs        ? ?/sec
logical_select_all_from_1000                          1.04      8.2±0.05ms        ? ?/sec     1.00      7.9±0.04ms        ? ?/sec
logical_select_one_from_700                           1.05    330.9±8.08µs        ? ?/sec     1.00    316.6±1.58µs        ? ?/sec
logical_trivial_join_high_numbered_columns            1.05    273.9±5.66µs        ? ?/sec     1.00    261.0±0.61µs        ? ?/sec
logical_trivial_join_low_numbered_columns             1.06    264.5±6.17µs        ? ?/sec     1.00    248.3±0.50µs        ? ?/sec
physical_intersection                                 1.04    603.3±1.30µs        ? ?/sec     1.00    579.0±1.56µs        ? ?/sec
physical_join_consider_sort                           1.03   1029.8±1.53µs        ? ?/sec     1.00   1000.1±2.04µs        ? ?/sec
physical_join_distinct                                1.05    255.9±5.68µs        ? ?/sec     1.00    243.0±0.66µs        ? ?/sec
physical_many_self_joins                              1.02      7.6±0.01ms        ? ?/sec     1.00      7.5±0.05ms        ? ?/sec
physical_plan_clickbench_all                          1.00    107.0±0.20ms        ? ?/sec     1.01    107.5±1.30ms        ? ?/sec
physical_plan_clickbench_q1                           1.00  1094.0±14.05µs        ? ?/sec     1.00   1089.6±4.66µs        ? ?/sec
physical_plan_clickbench_q10                          1.00   1752.2±5.23µs        ? ?/sec     1.02   1783.0±7.05µs        ? ?/sec
physical_plan_clickbench_q11                          1.00  1838.3±11.55µs        ? ?/sec     1.03   1893.1±6.90µs        ? ?/sec
physical_plan_clickbench_q12                          1.00   1922.9±8.37µs        ? ?/sec     1.02   1965.8±7.11µs        ? ?/sec
physical_plan_clickbench_q13                          1.00   1687.9±9.47µs        ? ?/sec     1.01  1700.3±11.61µs        ? ?/sec
physical_plan_clickbench_q14                          1.00   1838.8±5.42µs        ? ?/sec     1.02   1867.3±5.03µs        ? ?/sec
physical_plan_clickbench_q15                          1.00  1739.0±11.33µs        ? ?/sec     1.01   1757.7±8.07µs        ? ?/sec
physical_plan_clickbench_q16                          1.02   1501.1±6.65µs        ? ?/sec     1.00   1476.7±6.63µs        ? ?/sec
physical_plan_clickbench_q17                          1.01   1547.8±6.08µs        ? ?/sec     1.00   1528.2±6.60µs        ? ?/sec
physical_plan_clickbench_q18                          1.02   1391.9±5.63µs        ? ?/sec     1.00   1369.4±7.59µs        ? ?/sec
physical_plan_clickbench_q19                          1.00   1775.0±5.87µs        ? ?/sec     1.00  1777.8±52.76µs        ? ?/sec
physical_plan_clickbench_q2                           1.00   1436.4±5.41µs        ? ?/sec     1.00   1431.7±4.63µs        ? ?/sec
physical_plan_clickbench_q20                          1.00   1143.2±5.72µs        ? ?/sec     1.01   1155.9±5.11µs        ? ?/sec
physical_plan_clickbench_q21                          1.00   1416.0±4.85µs        ? ?/sec     1.02   1440.4±4.69µs        ? ?/sec
physical_plan_clickbench_q22                          1.00   1810.5±5.06µs        ? ?/sec     1.02   1849.9±5.74µs        ? ?/sec
physical_plan_clickbench_q23                          1.00   1999.5±3.60µs        ? ?/sec     1.07      2.1±0.10ms        ? ?/sec
physical_plan_clickbench_q24                          1.00      2.9±0.01ms        ? ?/sec     1.16      3.3±0.11ms        ? ?/sec
physical_plan_clickbench_q25                          1.00  1506.4±15.08µs        ? ?/sec     1.11  1667.9±83.51µs        ? ?/sec
physical_plan_clickbench_q26                          1.00   1360.6±4.37µs        ? ?/sec     1.04  1417.0±29.69µs        ? ?/sec
physical_plan_clickbench_q27                          1.00   1528.5±5.67µs        ? ?/sec     1.07  1630.8±29.53µs        ? ?/sec
physical_plan_clickbench_q28                          1.00      2.0±0.01ms        ? ?/sec     1.01      2.0±0.02ms        ? ?/sec
physical_plan_clickbench_q29                          1.00      2.2±0.00ms        ? ?/sec     1.01      2.2±0.01ms        ? ?/sec
physical_plan_clickbench_q3                           1.00   1374.2±5.44µs        ? ?/sec     1.01   1386.6±8.82µs        ? ?/sec
physical_plan_clickbench_q30                          1.00     15.9±0.04ms        ? ?/sec     1.03     16.4±0.14ms        ? ?/sec
physical_plan_clickbench_q31                          1.00      2.1±0.00ms        ? ?/sec     1.03      2.1±0.01ms        ? ?/sec
physical_plan_clickbench_q32                          1.00      2.1±0.00ms        ? ?/sec     1.04      2.2±0.03ms        ? ?/sec
physical_plan_clickbench_q33                          1.00   1742.4±6.01µs        ? ?/sec     1.03  1795.7±12.78µs        ? ?/sec
physical_plan_clickbench_q34                          1.00   1500.0±4.36µs        ? ?/sec     1.02  1532.5±12.37µs        ? ?/sec
physical_plan_clickbench_q35                          1.00   1563.3±4.32µs        ? ?/sec     1.02   1600.6±9.89µs        ? ?/sec
physical_plan_clickbench_q36                          1.00   1877.8±4.70µs        ? ?/sec     1.02   1913.4±5.14µs        ? ?/sec
physical_plan_clickbench_q37                          1.00      2.2±0.00ms        ? ?/sec     1.02      2.2±0.01ms        ? ?/sec
physical_plan_clickbench_q38                          1.00      2.2±0.00ms        ? ?/sec     1.02      2.2±0.00ms        ? ?/sec
physical_plan_clickbench_q39                          1.00      2.1±0.01ms        ? ?/sec     1.02      2.1±0.01ms        ? ?/sec
physical_plan_clickbench_q4                           1.00   1182.0±4.66µs        ? ?/sec     1.02  1206.2±11.46µs        ? ?/sec
physical_plan_clickbench_q40                          1.00      2.6±0.01ms        ? ?/sec     1.02      2.6±0.01ms        ? ?/sec
physical_plan_clickbench_q41                          1.00      2.2±0.00ms        ? ?/sec     1.02      2.3±0.00ms        ? ?/sec
physical_plan_clickbench_q42                          1.00      2.2±0.00ms        ? ?/sec     1.00      2.2±0.01ms        ? ?/sec
physical_plan_clickbench_q43                          1.00      2.4±0.01ms        ? ?/sec     1.00      2.4±0.01ms        ? ?/sec
physical_plan_clickbench_q44                          1.01   1310.9±4.17µs        ? ?/sec     1.00   1296.3±8.26µs        ? ?/sec
physical_plan_clickbench_q45                          1.01   1317.6±4.93µs        ? ?/sec     1.00   1299.8±5.34µs        ? ?/sec
physical_plan_clickbench_q46                          1.01   1633.4±4.95µs        ? ?/sec     1.00   1617.6±5.58µs        ? ?/sec
physical_plan_clickbench_q47                          1.00      2.3±0.00ms        ? ?/sec     1.00      2.3±0.01ms        ? ?/sec
physical_plan_clickbench_q48                          1.00      2.3±0.00ms        ? ?/sec     1.01      2.4±0.01ms        ? ?/sec
physical_plan_clickbench_q49                          1.00      2.5±0.00ms        ? ?/sec     1.01      2.5±0.02ms        ? ?/sec
physical_plan_clickbench_q5                           1.00   1323.0±4.70µs        ? ?/sec     1.01   1339.7±5.60µs        ? ?/sec
physical_plan_clickbench_q50                          1.00      2.4±0.00ms        ? ?/sec     1.00      2.4±0.02ms        ? ?/sec
physical_plan_clickbench_q51                          1.00   1721.6±5.35µs        ? ?/sec     1.00  1723.6±15.95µs        ? ?/sec
physical_plan_clickbench_q6                           1.00   1333.5±5.16µs        ? ?/sec     1.00   1339.9±5.10µs        ? ?/sec
physical_plan_clickbench_q7                           1.00   1116.1±5.11µs        ? ?/sec     1.01   1127.6±5.00µs        ? ?/sec
physical_plan_clickbench_q8                           1.00   1565.6±6.38µs        ? ?/sec     1.02   1593.1±7.49µs        ? ?/sec
physical_plan_clickbench_q9                           1.00   1630.3±5.74µs        ? ?/sec     1.02   1661.2±5.00µs        ? ?/sec
physical_plan_struct_join_agg_sort                    1.00   1302.8±2.52µs        ? ?/sec     1.00   1305.3±2.19µs        ? ?/sec
physical_plan_tpcds_all                               1.00    734.6±4.30ms        ? ?/sec     1.05    773.1±5.04ms        ? ?/sec
physical_plan_tpch_all                                1.00     46.7±0.37ms        ? ?/sec     1.02     47.8±0.45ms        ? ?/sec
physical_plan_tpch_q1                                 1.01   1504.0±2.41µs        ? ?/sec     1.00   1495.7±3.66µs        ? ?/sec
physical_plan_tpch_q10                                1.00      2.8±0.00ms        ? ?/sec     1.03      2.8±0.00ms        ? ?/sec
physical_plan_tpch_q11                                1.00      2.5±0.00ms        ? ?/sec     1.01      2.5±0.00ms        ? ?/sec
physical_plan_tpch_q12                                1.00   1270.5±3.22µs        ? ?/sec     1.01   1289.4±2.00µs        ? ?/sec
physical_plan_tpch_q13                                1.00   1000.3±2.05µs        ? ?/sec     1.00    999.7±1.97µs        ? ?/sec
physical_plan_tpch_q14                                1.02   1314.7±5.34µs        ? ?/sec     1.00   1292.8±2.42µs        ? ?/sec
physical_plan_tpch_q16                                1.00   1656.6±2.36µs        ? ?/sec     1.02   1686.3±2.88µs        ? ?/sec
physical_plan_tpch_q17                                1.00   1773.5±6.21µs        ? ?/sec     1.02   1807.0±4.27µs        ? ?/sec
physical_plan_tpch_q18                                1.00  1891.2±10.04µs        ? ?/sec     1.04  1966.7±12.47µs        ? ?/sec
physical_plan_tpch_q19                                1.00      2.5±0.06ms        ? ?/sec     1.00      2.5±0.02ms        ? ?/sec
physical_plan_tpch_q2                                 1.00      4.2±0.00ms        ? ?/sec     1.04      4.3±0.01ms        ? ?/sec
physical_plan_tpch_q20                                1.04      2.4±0.06ms        ? ?/sec     1.00      2.3±0.00ms        ? ?/sec
physical_plan_tpch_q21                                1.00      3.0±0.06ms        ? ?/sec     1.03      3.1±0.02ms        ? ?/sec
physical_plan_tpch_q22                                1.01      2.1±0.03ms        ? ?/sec     1.00      2.1±0.02ms        ? ?/sec
physical_plan_tpch_q3                                 1.00  1881.5±12.53µs        ? ?/sec     1.02   1911.6±3.58µs        ? ?/sec
physical_plan_tpch_q4                                 1.00   1023.2±2.52µs        ? ?/sec     1.02   1040.3±2.25µs        ? ?/sec
physical_plan_tpch_q5                                 1.00      2.3±0.00ms        ? ?/sec     1.09      2.5±0.00ms        ? ?/sec
physical_plan_tpch_q6                                 1.03    643.2±1.35µs        ? ?/sec     1.00    622.7±2.43µs        ? ?/sec
physical_plan_tpch_q7                                 1.00      2.9±0.00ms        ? ?/sec     1.05      3.1±0.00ms        ? ?/sec
physical_plan_tpch_q8                                 1.00      3.9±0.01ms        ? ?/sec     1.05      4.1±0.01ms        ? ?/sec
physical_plan_tpch_q9                                 1.00      2.8±0.00ms        ? ?/sec     1.04      3.0±0.00ms        ? ?/sec
physical_select_aggregates_from_200                   1.00     14.5±0.03ms        ? ?/sec     1.02     14.8±0.08ms        ? ?/sec
physical_select_all_from_1000                         1.03     18.2±0.08ms        ? ?/sec     1.00     17.7±0.05ms        ? ?/sec
physical_select_one_from_700                          1.04    798.7±1.94µs        ? ?/sec     1.00    768.8±2.36µs        ? ?/sec
physical_sorted_union_order_by_10_int64               1.00      4.5±0.01ms        ? ?/sec     1.02      4.6±0.04ms        ? ?/sec
physical_sorted_union_order_by_10_uint64              1.00     11.3±0.03ms        ? ?/sec     1.05     11.9±0.11ms        ? ?/sec
physical_sorted_union_order_by_50_int64               1.00    109.6±0.37ms        ? ?/sec     1.04    113.9±0.54ms        ? ?/sec
physical_sorted_union_order_by_50_uint64              1.00    599.1±2.34ms        ? ?/sec     1.09    652.3±5.58ms        ? ?/sec
physical_theta_join_consider_sort                     1.00   1070.4±2.48µs        ? ?/sec     1.23   1311.4±5.80µs        ? ?/sec
physical_unnest_to_join                               1.00   1238.6±2.75µs        ? ?/sec     1.09   1346.6±6.05µs        ? ?/sec
physical_window_function_partition_by_12_on_values    1.01    715.8±1.74µs        ? ?/sec     1.00    707.9±1.99µs        ? ?/sec
physical_window_function_partition_by_30_on_values    1.00   1422.5±3.55µs        ? ?/sec     1.00   1425.4±3.58µs        ? ?/sec
physical_window_function_partition_by_4_on_values     1.04    430.7±1.60µs        ? ?/sec     1.00    415.3±1.16µs        ? ?/sec
physical_window_function_partition_by_7_on_values     1.03    536.7±1.24µs        ? ?/sec     1.00    522.5±2.45µs        ? ?/sec
physical_window_function_partition_by_8_on_values     1.02    573.2±1.37µs        ? ?/sec     1.00    562.1±1.89µs        ? ?/sec
with_param_values_many_columns                        1.02    468.5±2.03µs        ? ?/sec     1.00    458.9±2.62µs        ? ?/sec

Resource Usage

base (merge-base)

Metric	Value
Wall time	1268.1s
Peak memory	18.5 GiB
Avg memory	18.5 GiB
CPU user	1513.6s
CPU sys	1.7s
Disk read	0 B
Disk write	633.5 MiB

branch

Metric	Value
Wall time	1257.1s
Peak memory	18.5 GiB
Avg memory	18.5 GiB
CPU user	1503.5s
CPU sys	1.3s
Disk read	0 B
Disk write	23.8 MiB

File an issue against this benchmark runner

## Which issue does this PR close? - Closes #. ## Rationale for this change Similar to #21128, just trying to shave time off the optimizer. Locally, it improves some sql-planner benchmarks by up to 10% but they seem relatively noisy on my laptop. ## What changes are included in this PR? 1. Avoid allocation `plan.children()` in a loop in `sort_pushdown.rs`. 2. Try and avoid some expensive tree rewrites in `join_selection.rs` 3. Avoid deep clones of exec limit nodes in `limit_pushdown.rs`, and only mutate the plan if it was actually changed. 4. Use cheaper code path to change the limit on an `AggregateExec` in `limited_distinct_aggregation.rs`. 5. Use a read-only traversal in `sanity_checker.rs`. Its read only and `transform_up` is always more expensive. I've considered extending `TreeNode` but this seems to be basically the only place in the codebase that does something like this. There are a few places where we unconditionally return `Transformed::yes` which might unintended downstream consequences because it breaks pointer equality and I think it also just end up allocating more memory, but they are harder to untangle so I'll try and do them in followups. ## Are these changes tested? One new test for limits, otherwise the existing tests. ## Are there any user-facing changes? Removes the `LimitExec` type, I can't imagine why someone would use it, and its only used in one place. Happy to bring it back as a deprecated type. --------- Signed-off-by: Adam Gutglick <[email protected]>

alamb · 2026-03-30T19:53:47Z

🤔 some of the logical plan changes seem to get slower repeatably.

I will rerun to see if I can reproduce

alamb · 2026-03-30T19:53:51Z

run benchmark sql_planner

alamb · 2026-03-30T19:54:12Z

run benchmark sql_planner

adriangbot · 2026-03-30T19:58:10Z

🤖 Criterion benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4157765566-613-8p8hd 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing adamg/optimizer-memory-optimizations (e53a07f) to 5ff80e4 (merge-base) diff
BENCH_NAME=sql_planner
BENCH_COMMAND=cargo bench --features=parquet --bench sql_planner
BENCH_FILTER=
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-03-30T19:58:15Z

🤖 Criterion benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4157767849-614-xxx99 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing adamg/optimizer-memory-optimizations (e53a07f) to 5ff80e4 (merge-base) diff
BENCH_NAME=sql_planner
BENCH_COMMAND=cargo bench --features=parquet --bench sql_planner
BENCH_FILTER=
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-03-30T20:52:52Z

🤖 Criterion benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

group                                                 adamg_optimizer-memory-optimizations    main
-----                                                 ------------------------------------    ----
logical_aggregate_with_join                           1.00    439.5±0.79µs        ? ?/sec     1.00    438.6±0.96µs        ? ?/sec
logical_plan_struct_join_agg_sort                     1.04    170.6±0.70µs        ? ?/sec     1.00    164.3±0.71µs        ? ?/sec
logical_select_all_from_1000                          1.00      8.2±0.02ms        ? ?/sec     1.01      8.2±0.05ms        ? ?/sec
logical_select_one_from_700                           1.00    318.3±1.09µs        ? ?/sec     1.01    321.6±1.59µs        ? ?/sec
logical_trivial_join_high_numbered_columns            1.00    264.9±0.65µs        ? ?/sec     1.01    267.2±0.92µs        ? ?/sec
logical_trivial_join_low_numbered_columns             1.00    252.2±0.68µs        ? ?/sec     1.01    254.5±0.86µs        ? ?/sec
physical_intersection                                 1.00    586.5±1.40µs        ? ?/sec     1.03    605.7±1.60µs        ? ?/sec
physical_join_consider_sort                           1.00   1020.6±3.29µs        ? ?/sec     1.02   1043.3±2.67µs        ? ?/sec
physical_join_distinct                                1.00    245.1±0.65µs        ? ?/sec     1.02    250.3±0.72µs        ? ?/sec
physical_many_self_joins                              1.00      7.5±0.02ms        ? ?/sec     1.02      7.6±0.01ms        ? ?/sec
physical_plan_clickbench_all                          1.00    109.0±0.23ms        ? ?/sec     1.01    109.9±0.36ms        ? ?/sec
physical_plan_clickbench_q1                           1.00   1106.2±4.78µs        ? ?/sec     1.02   1133.2±7.65µs        ? ?/sec
physical_plan_clickbench_q10                          1.00   1779.3±5.06µs        ? ?/sec     1.00  1787.7±22.59µs        ? ?/sec
physical_plan_clickbench_q11                          1.00   1893.3±5.71µs        ? ?/sec     1.00   1893.2±5.43µs        ? ?/sec
physical_plan_clickbench_q12                          1.00  1993.4±14.30µs        ? ?/sec     1.00   1984.0±5.65µs        ? ?/sec
physical_plan_clickbench_q13                          1.00   1738.5±7.93µs        ? ?/sec     1.00   1734.3±4.56µs        ? ?/sec
physical_plan_clickbench_q14                          1.00   1902.1±6.59µs        ? ?/sec     1.00  1899.5±10.04µs        ? ?/sec
physical_plan_clickbench_q15                          1.00   1794.8±5.92µs        ? ?/sec     1.00   1794.5±6.76µs        ? ?/sec
physical_plan_clickbench_q16                          1.00   1497.2±5.07µs        ? ?/sec     1.01   1509.2±8.14µs        ? ?/sec
physical_plan_clickbench_q17                          1.00   1541.3±5.64µs        ? ?/sec     1.02  1570.0±16.20µs        ? ?/sec
physical_plan_clickbench_q18                          1.00   1383.3±5.30µs        ? ?/sec     1.00   1382.5±5.74µs        ? ?/sec
physical_plan_clickbench_q19                          1.00  1780.3±14.58µs        ? ?/sec     1.00   1775.1±4.93µs        ? ?/sec
physical_plan_clickbench_q2                           1.02  1483.6±17.31µs        ? ?/sec     1.00   1461.0±5.63µs        ? ?/sec
physical_plan_clickbench_q20                          1.00   1213.2±6.58µs        ? ?/sec     1.00   1208.5±4.74µs        ? ?/sec
physical_plan_clickbench_q21                          1.00   1472.1±3.94µs        ? ?/sec     1.02  1499.3±32.81µs        ? ?/sec
physical_plan_clickbench_q22                          1.00  1892.1±20.71µs        ? ?/sec     1.01  1901.8±23.75µs        ? ?/sec
physical_plan_clickbench_q23                          1.00      2.1±0.01ms        ? ?/sec     1.02      2.1±0.06ms        ? ?/sec
physical_plan_clickbench_q24                          1.00      3.0±0.01ms        ? ?/sec     1.02      3.1±0.06ms        ? ?/sec
physical_plan_clickbench_q25                          1.00   1597.4±4.74µs        ? ?/sec     1.00  1592.7±22.74µs        ? ?/sec
physical_plan_clickbench_q26                          1.01   1442.7±3.30µs        ? ?/sec     1.00  1422.9±10.38µs        ? ?/sec
physical_plan_clickbench_q27                          1.01   1610.0±4.88µs        ? ?/sec     1.00   1597.6±7.38µs        ? ?/sec
physical_plan_clickbench_q28                          1.00      2.1±0.00ms        ? ?/sec     1.00      2.1±0.01ms        ? ?/sec
physical_plan_clickbench_q29                          1.00      2.2±0.00ms        ? ?/sec     1.01      2.3±0.01ms        ? ?/sec
physical_plan_clickbench_q3                           1.01   1406.2±7.11µs        ? ?/sec     1.00   1388.6±5.01µs        ? ?/sec
physical_plan_clickbench_q30                          1.00     16.0±0.03ms        ? ?/sec     1.02     16.4±0.17ms        ? ?/sec
physical_plan_clickbench_q31                          1.00      2.2±0.00ms        ? ?/sec     1.00      2.2±0.01ms        ? ?/sec
physical_plan_clickbench_q32                          1.02      2.2±0.01ms        ? ?/sec     1.00      2.2±0.00ms        ? ?/sec
physical_plan_clickbench_q33                          1.02  1834.7±20.22µs        ? ?/sec     1.00   1793.4±5.38µs        ? ?/sec
physical_plan_clickbench_q34                          1.00  1544.3±17.08µs        ? ?/sec     1.00  1550.5±10.68µs        ? ?/sec
physical_plan_clickbench_q35                          1.00   1611.2±9.96µs        ? ?/sec     1.00  1611.1±14.11µs        ? ?/sec
physical_plan_clickbench_q36                          1.00   1894.0±4.62µs        ? ?/sec     1.03  1944.5±19.12µs        ? ?/sec
physical_plan_clickbench_q37                          1.00      2.3±0.02ms        ? ?/sec     1.01      2.3±0.00ms        ? ?/sec
physical_plan_clickbench_q38                          1.00      2.3±0.01ms        ? ?/sec     1.02      2.3±0.02ms        ? ?/sec
physical_plan_clickbench_q39                          1.00      2.2±0.01ms        ? ?/sec     1.02      2.2±0.01ms        ? ?/sec
physical_plan_clickbench_q4                           1.01   1207.6±7.88µs        ? ?/sec     1.00   1200.8±8.40µs        ? ?/sec
physical_plan_clickbench_q40                          1.00      2.7±0.01ms        ? ?/sec     1.01      2.7±0.01ms        ? ?/sec
physical_plan_clickbench_q41                          1.00      2.3±0.01ms        ? ?/sec     1.01      2.3±0.01ms        ? ?/sec
physical_plan_clickbench_q42                          1.00      2.3±0.01ms        ? ?/sec     1.01      2.3±0.01ms        ? ?/sec
physical_plan_clickbench_q43                          1.00      2.5±0.03ms        ? ?/sec     1.00      2.5±0.00ms        ? ?/sec
physical_plan_clickbench_q44                          1.00   1292.4±4.35µs        ? ?/sec     1.01  1299.1±10.32µs        ? ?/sec
physical_plan_clickbench_q45                          1.00   1300.3±3.62µs        ? ?/sec     1.00   1299.6±5.06µs        ? ?/sec
physical_plan_clickbench_q46                          1.00   1619.3±4.20µs        ? ?/sec     1.00  1626.6±17.36µs        ? ?/sec
physical_plan_clickbench_q47                          1.00      2.3±0.00ms        ? ?/sec     1.01      2.3±0.02ms        ? ?/sec
physical_plan_clickbench_q48                          1.00      2.4±0.00ms        ? ?/sec     1.00      2.4±0.01ms        ? ?/sec
physical_plan_clickbench_q49                          1.00      2.6±0.00ms        ? ?/sec     1.01      2.6±0.03ms        ? ?/sec
physical_plan_clickbench_q5                           1.00   1334.9±3.12µs        ? ?/sec     1.00  1333.6±10.56µs        ? ?/sec
physical_plan_clickbench_q50                          1.00      2.5±0.01ms        ? ?/sec     1.01      2.5±0.02ms        ? ?/sec
physical_plan_clickbench_q51                          1.00  1741.8±10.10µs        ? ?/sec     1.00   1740.3±4.98µs        ? ?/sec
physical_plan_clickbench_q6                           1.03   1366.4±3.84µs        ? ?/sec     1.00   1329.2±3.91µs        ? ?/sec
physical_plan_clickbench_q7                           1.00   1143.1±3.91µs        ? ?/sec     1.00  1142.4±13.09µs        ? ?/sec
physical_plan_clickbench_q8                           1.00   1624.9±6.43µs        ? ?/sec     1.01  1641.2±16.90µs        ? ?/sec
physical_plan_clickbench_q9                           1.00   1655.8±3.95µs        ? ?/sec     1.00  1663.4±11.00µs        ? ?/sec
physical_plan_struct_join_agg_sort                    1.00   1387.3±2.38µs        ? ?/sec     1.00   1388.0±2.34µs        ? ?/sec
physical_plan_tpcds_all                               1.00   773.1±11.24ms        ? ?/sec     1.05   811.0±10.06ms        ? ?/sec
physical_plan_tpch_all                                1.00     47.9±0.10ms        ? ?/sec     1.05     50.3±0.15ms        ? ?/sec
physical_plan_tpch_q1                                 1.00   1542.6±2.38µs        ? ?/sec     1.02   1573.6±3.33µs        ? ?/sec
physical_plan_tpch_q10                                1.00      3.0±0.00ms        ? ?/sec     1.03      3.0±0.00ms        ? ?/sec
physical_plan_tpch_q11                                1.00      2.6±0.01ms        ? ?/sec     1.02      2.6±0.00ms        ? ?/sec
physical_plan_tpch_q12                                1.00   1325.6±4.01µs        ? ?/sec     1.01   1340.3±4.46µs        ? ?/sec
physical_plan_tpch_q13                                1.00   1026.9±2.80µs        ? ?/sec     1.00   1030.1±2.87µs        ? ?/sec
physical_plan_tpch_q14                                1.00   1371.5±2.26µs        ? ?/sec     1.00   1376.3±2.69µs        ? ?/sec
physical_plan_tpch_q16                                1.00   1701.9±1.98µs        ? ?/sec     1.02   1733.5±2.94µs        ? ?/sec
physical_plan_tpch_q17                                1.00   1857.2±3.27µs        ? ?/sec     1.04   1939.9±8.26µs        ? ?/sec
physical_plan_tpch_q18                                1.00   1946.0±2.75µs        ? ?/sec     1.07      2.1±0.01ms        ? ?/sec
physical_plan_tpch_q19                                1.00      2.5±0.08ms        ? ?/sec     1.04      2.6±0.00ms        ? ?/sec
physical_plan_tpch_q2                                 1.00      4.5±0.03ms        ? ?/sec     1.04      4.7±0.02ms        ? ?/sec
physical_plan_tpch_q20                                1.00      2.3±0.01ms        ? ?/sec     1.04      2.4±0.01ms        ? ?/sec
physical_plan_tpch_q21                                1.00      3.0±0.01ms        ? ?/sec     1.08      3.3±0.01ms        ? ?/sec
physical_plan_tpch_q22                                1.00      2.1±0.00ms        ? ?/sec     1.03      2.1±0.01ms        ? ?/sec
physical_plan_tpch_q3                                 1.00   1936.2±2.54µs        ? ?/sec     1.04      2.0±0.00ms        ? ?/sec
physical_plan_tpch_q4                                 1.00   1050.9±2.78µs        ? ?/sec     1.02   1072.0±2.13µs        ? ?/sec
physical_plan_tpch_q5                                 1.00      2.5±0.00ms        ? ?/sec     1.05      2.6±0.00ms        ? ?/sec
physical_plan_tpch_q6                                 1.01    663.6±1.57µs        ? ?/sec     1.00    653.9±1.43µs        ? ?/sec
physical_plan_tpch_q7                                 1.00      3.2±0.02ms        ? ?/sec     1.04      3.3±0.00ms        ? ?/sec
physical_plan_tpch_q8                                 1.00      4.2±0.03ms        ? ?/sec     1.05      4.4±0.01ms        ? ?/sec
physical_plan_tpch_q9                                 1.00      3.1±0.00ms        ? ?/sec     1.06      3.2±0.00ms        ? ?/sec
physical_select_aggregates_from_200                   1.00     14.8±0.04ms        ? ?/sec     1.00     14.8±0.05ms        ? ?/sec
physical_select_all_from_1000                         1.00     18.0±0.04ms        ? ?/sec     1.03     18.6±0.11ms        ? ?/sec
physical_select_one_from_700                          1.00    777.6±3.34µs        ? ?/sec     1.01    788.1±2.21µs        ? ?/sec
physical_sorted_union_order_by_10_int64               1.00      4.8±0.01ms        ? ?/sec     1.02      4.9±0.01ms        ? ?/sec
physical_sorted_union_order_by_10_uint64              1.00     11.7±0.03ms        ? ?/sec     1.04     12.2±0.04ms        ? ?/sec
physical_sorted_union_order_by_50_int64               1.00    116.5±0.55ms        ? ?/sec     1.02    118.8±0.51ms        ? ?/sec
physical_sorted_union_order_by_50_uint64              1.00    604.1±3.14ms        ? ?/sec     1.07    646.4±8.27ms        ? ?/sec
physical_theta_join_consider_sort                     1.00   1058.3±2.49µs        ? ?/sec     1.27   1342.1±2.30µs        ? ?/sec
physical_unnest_to_join                               1.00   1245.0±6.49µs        ? ?/sec     1.10   1374.7±2.17µs        ? ?/sec
physical_window_function_partition_by_12_on_values    1.01    758.8±1.95µs        ? ?/sec     1.00    753.2±1.96µs        ? ?/sec
physical_window_function_partition_by_30_on_values    1.00   1511.1±5.92µs        ? ?/sec     1.01   1524.5±2.96µs        ? ?/sec
physical_window_function_partition_by_4_on_values     1.01    450.8±4.24µs        ? ?/sec     1.00    446.7±1.70µs        ? ?/sec
physical_window_function_partition_by_7_on_values     1.00    559.2±4.54µs        ? ?/sec     1.00    558.7±1.33µs        ? ?/sec
physical_window_function_partition_by_8_on_values     1.00    605.4±1.43µs        ? ?/sec     1.01    613.0±2.67µs        ? ?/sec
with_param_values_many_columns                        1.04    483.7±4.40µs        ? ?/sec     1.00    463.6±2.55µs        ? ?/sec

Resource Usage

base (merge-base)

Metric	Value
Wall time	1260.3s
Peak memory	18.5 GiB
Avg memory	18.4 GiB
CPU user	1504.4s
CPU sys	1.8s
Disk read	0 B
Disk write	629.4 MiB

branch

Metric	Value
Wall time	1256.5s
Peak memory	18.5 GiB
Avg memory	18.5 GiB
CPU user	1502.5s
CPU sys	1.2s
Disk read	0 B
Disk write	23.9 MiB

File an issue against this benchmark runner

adriangbot · 2026-03-30T20:53:08Z

🤖 Criterion benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

group                                                 adamg_optimizer-memory-optimizations    main
-----                                                 ------------------------------------    ----
logical_aggregate_with_join                           1.00    433.2±0.89µs        ? ?/sec     1.01    437.0±3.14µs        ? ?/sec
logical_plan_struct_join_agg_sort                     1.03    167.0±1.40µs        ? ?/sec     1.00    162.9±0.85µs        ? ?/sec
logical_select_all_from_1000                          1.00      8.2±0.01ms        ? ?/sec     1.00      8.2±0.04ms        ? ?/sec
logical_select_one_from_700                           1.00    318.4±1.93µs        ? ?/sec     1.02    325.4±5.30µs        ? ?/sec
logical_trivial_join_high_numbered_columns            1.00    265.7±0.86µs        ? ?/sec     1.01    268.0±2.50µs        ? ?/sec
logical_trivial_join_low_numbered_columns             1.00    251.8±1.00µs        ? ?/sec     1.01    255.0±2.63µs        ? ?/sec
physical_intersection                                 1.00    588.4±3.59µs        ? ?/sec     1.02    599.1±3.27µs        ? ?/sec
physical_join_consider_sort                           1.00   1022.9±2.16µs        ? ?/sec     1.01   1035.9±2.76µs        ? ?/sec
physical_join_distinct                                1.00    245.7±1.22µs        ? ?/sec     1.01    248.8±2.46µs        ? ?/sec
physical_many_self_joins                              1.00      7.6±0.05ms        ? ?/sec     1.01      7.7±0.02ms        ? ?/sec
physical_plan_clickbench_all                          1.00    109.0±0.22ms        ? ?/sec     1.02    111.7±0.62ms        ? ?/sec
physical_plan_clickbench_q1                           1.01   1117.5±5.94µs        ? ?/sec     1.00  1107.2±12.44µs        ? ?/sec
physical_plan_clickbench_q10                          1.00   1773.9±3.94µs        ? ?/sec     1.01  1796.6±12.47µs        ? ?/sec
physical_plan_clickbench_q11                          1.00   1900.2±3.59µs        ? ?/sec     1.01  1917.5±11.81µs        ? ?/sec
physical_plan_clickbench_q12                          1.00      2.0±0.01ms        ? ?/sec     1.00      2.0±0.01ms        ? ?/sec
physical_plan_clickbench_q13                          1.03  1808.6±20.58µs        ? ?/sec     1.00  1756.4±10.80µs        ? ?/sec
physical_plan_clickbench_q14                          1.00   1943.2±7.78µs        ? ?/sec     1.01  1955.4±17.68µs        ? ?/sec
physical_plan_clickbench_q15                          1.00   1794.7±4.74µs        ? ?/sec     1.02  1826.5±10.35µs        ? ?/sec
physical_plan_clickbench_q16                          1.00   1493.7±4.04µs        ? ?/sec     1.02  1525.0±11.43µs        ? ?/sec
physical_plan_clickbench_q17                          1.00   1541.5±3.80µs        ? ?/sec     1.00  1548.7±10.63µs        ? ?/sec
physical_plan_clickbench_q18                          1.00   1384.0±3.84µs        ? ?/sec     1.02  1418.2±10.59µs        ? ?/sec
physical_plan_clickbench_q19                          1.00   1770.6±4.14µs        ? ?/sec     1.04  1838.8±15.60µs        ? ?/sec
physical_plan_clickbench_q2                           1.01   1481.4±5.26µs        ? ?/sec     1.00  1472.2±11.18µs        ? ?/sec
physical_plan_clickbench_q20                          1.00   1208.1±3.99µs        ? ?/sec     1.02  1235.5±10.78µs        ? ?/sec
physical_plan_clickbench_q21                          1.00   1491.8±3.65µs        ? ?/sec     1.01  1503.1±11.03µs        ? ?/sec
physical_plan_clickbench_q22                          1.00   1897.2±6.06µs        ? ?/sec     1.01  1918.0±12.94µs        ? ?/sec
physical_plan_clickbench_q23                          1.00      2.1±0.02ms        ? ?/sec     1.00      2.1±0.01ms        ? ?/sec
physical_plan_clickbench_q24                          1.00      3.1±0.03ms        ? ?/sec     1.01      3.1±0.01ms        ? ?/sec
physical_plan_clickbench_q25                          1.00   1568.4±8.51µs        ? ?/sec     1.02  1593.2±11.50µs        ? ?/sec
physical_plan_clickbench_q26                          1.00   1411.9±6.72µs        ? ?/sec     1.01  1428.9±10.90µs        ? ?/sec
physical_plan_clickbench_q27                          1.00  1585.8±13.28µs        ? ?/sec     1.02   1612.4±9.65µs        ? ?/sec
physical_plan_clickbench_q28                          1.00      2.0±0.01ms        ? ?/sec     1.02      2.1±0.01ms        ? ?/sec
physical_plan_clickbench_q29                          1.00      2.2±0.00ms        ? ?/sec     1.02      2.2±0.01ms        ? ?/sec
physical_plan_clickbench_q3                           1.01   1405.3±8.67µs        ? ?/sec     1.00  1394.0±11.30µs        ? ?/sec
physical_plan_clickbench_q30                          1.00     16.2±0.05ms        ? ?/sec     1.02     16.5±0.06ms        ? ?/sec
physical_plan_clickbench_q31                          1.00      2.2±0.00ms        ? ?/sec     1.02      2.2±0.01ms        ? ?/sec
physical_plan_clickbench_q32                          1.00      2.2±0.00ms        ? ?/sec     1.01      2.2±0.01ms        ? ?/sec
physical_plan_clickbench_q33                          1.00   1760.2±4.78µs        ? ?/sec     1.01  1785.9±11.87µs        ? ?/sec
physical_plan_clickbench_q34                          1.00   1506.4±3.99µs        ? ?/sec     1.01  1526.8±10.92µs        ? ?/sec
physical_plan_clickbench_q35                          1.00   1579.0±7.43µs        ? ?/sec     1.01  1593.8±10.77µs        ? ?/sec
physical_plan_clickbench_q36                          1.00  1915.9±15.20µs        ? ?/sec     1.00  1915.6±10.67µs        ? ?/sec
physical_plan_clickbench_q37                          1.00      2.3±0.00ms        ? ?/sec     1.01      2.3±0.01ms        ? ?/sec
physical_plan_clickbench_q38                          1.00      2.3±0.01ms        ? ?/sec     1.00      2.3±0.01ms        ? ?/sec
physical_plan_clickbench_q39                          1.00      2.2±0.02ms        ? ?/sec     1.00      2.2±0.01ms        ? ?/sec
physical_plan_clickbench_q4                           1.00   1203.0±5.05µs        ? ?/sec     1.02  1227.9±11.00µs        ? ?/sec
physical_plan_clickbench_q40                          1.00      2.7±0.02ms        ? ?/sec     1.00      2.7±0.01ms        ? ?/sec
physical_plan_clickbench_q41                          1.00      2.3±0.01ms        ? ?/sec     1.02      2.4±0.01ms        ? ?/sec
physical_plan_clickbench_q42                          1.00      2.3±0.01ms        ? ?/sec     1.01      2.3±0.01ms        ? ?/sec
physical_plan_clickbench_q43                          1.00      2.5±0.01ms        ? ?/sec     1.03      2.5±0.02ms        ? ?/sec
physical_plan_clickbench_q44                          1.00   1288.3±3.63µs        ? ?/sec     1.01  1302.6±11.69µs        ? ?/sec
physical_plan_clickbench_q45                          1.00   1292.1±3.14µs        ? ?/sec     1.01  1306.6±12.89µs        ? ?/sec
physical_plan_clickbench_q46                          1.00   1608.3±4.99µs        ? ?/sec     1.02  1637.4±11.39µs        ? ?/sec
physical_plan_clickbench_q47                          1.00      2.3±0.01ms        ? ?/sec     1.00      2.3±0.01ms        ? ?/sec
physical_plan_clickbench_q48                          1.00      2.4±0.01ms        ? ?/sec     1.01      2.4±0.01ms        ? ?/sec
physical_plan_clickbench_q49                          1.00      2.6±0.01ms        ? ?/sec     1.00      2.7±0.01ms        ? ?/sec
physical_plan_clickbench_q5                           1.00   1333.5±2.90µs        ? ?/sec     1.02  1362.8±12.09µs        ? ?/sec
physical_plan_clickbench_q50                          1.00      2.6±0.01ms        ? ?/sec     1.00      2.6±0.02ms        ? ?/sec
physical_plan_clickbench_q51                          1.00   1714.0±4.80µs        ? ?/sec     1.03  1760.7±10.41µs        ? ?/sec
physical_plan_clickbench_q6                           1.00   1336.8±3.28µs        ? ?/sec     1.02  1362.4±11.30µs        ? ?/sec
physical_plan_clickbench_q7                           1.00   1138.9±3.33µs        ? ?/sec     1.02  1156.5±11.38µs        ? ?/sec
physical_plan_clickbench_q8                           1.00   1617.4±4.65µs        ? ?/sec     1.02  1642.7±11.56µs        ? ?/sec
physical_plan_clickbench_q9                           1.00   1646.5±4.28µs        ? ?/sec     1.01  1664.2±11.41µs        ? ?/sec
physical_plan_struct_join_agg_sort                    1.00   1365.4±7.78µs        ? ?/sec     1.00   1368.0±2.99µs        ? ?/sec
physical_plan_tpcds_all                               1.00    762.3±1.62ms        ? ?/sec     1.07    812.3±5.71ms        ? ?/sec
physical_plan_tpch_all                                1.00     48.2±0.11ms        ? ?/sec     1.03     49.6±0.13ms        ? ?/sec
physical_plan_tpch_q1                                 1.00   1564.1±3.91µs        ? ?/sec     1.00   1567.0±3.32µs        ? ?/sec
physical_plan_tpch_q10                                1.00      2.9±0.00ms        ? ?/sec     1.03      3.0±0.01ms        ? ?/sec
physical_plan_tpch_q11                                1.00      2.6±0.00ms        ? ?/sec     1.01      2.6±0.01ms        ? ?/sec
physical_plan_tpch_q12                                1.00   1283.5±2.79µs        ? ?/sec     1.03   1318.2±2.64µs        ? ?/sec
physical_plan_tpch_q13                                1.00   1000.0±1.52µs        ? ?/sec     1.02   1015.0±3.82µs        ? ?/sec
physical_plan_tpch_q14                                1.01   1377.5±6.89µs        ? ?/sec     1.00   1359.1±3.59µs        ? ?/sec
physical_plan_tpch_q16                                1.00  1706.1±13.25µs        ? ?/sec     1.00   1709.2±3.93µs        ? ?/sec
physical_plan_tpch_q17                                1.00   1857.5±4.19µs        ? ?/sec     1.01  1882.1±15.45µs        ? ?/sec
physical_plan_tpch_q18                                1.00   1950.6±2.92µs        ? ?/sec     1.03      2.0±0.00ms        ? ?/sec
physical_plan_tpch_q19                                1.00      2.5±0.00ms        ? ?/sec     1.01      2.5±0.00ms        ? ?/sec
physical_plan_tpch_q2                                 1.00      4.5±0.01ms        ? ?/sec     1.04      4.6±0.02ms        ? ?/sec
physical_plan_tpch_q20                                1.00      2.3±0.01ms        ? ?/sec     1.01      2.3±0.00ms        ? ?/sec
physical_plan_tpch_q21                                1.00      3.0±0.01ms        ? ?/sec     1.06      3.2±0.01ms        ? ?/sec
physical_plan_tpch_q22                                1.00      2.1±0.01ms        ? ?/sec     1.01      2.1±0.00ms        ? ?/sec
physical_plan_tpch_q3                                 1.00   1964.0±3.16µs        ? ?/sec     1.02      2.0±0.00ms        ? ?/sec
physical_plan_tpch_q4                                 1.00   1046.6±2.63µs        ? ?/sec     1.02   1071.5±7.59µs        ? ?/sec
physical_plan_tpch_q5                                 1.00      2.5±0.00ms        ? ?/sec     1.05      2.6±0.00ms        ? ?/sec
physical_plan_tpch_q6                                 1.04    660.3±1.90µs        ? ?/sec     1.00    634.1±2.57µs        ? ?/sec
physical_plan_tpch_q7                                 1.00      3.1±0.00ms        ? ?/sec     1.05      3.3±0.01ms        ? ?/sec
physical_plan_tpch_q8                                 1.00      4.2±0.01ms        ? ?/sec     1.04      4.4±0.03ms        ? ?/sec
physical_plan_tpch_q9                                 1.00      3.0±0.00ms        ? ?/sec     1.05      3.2±0.01ms        ? ?/sec
physical_select_aggregates_from_200                   1.00     14.9±0.04ms        ? ?/sec     1.00     15.0±0.03ms        ? ?/sec
physical_select_all_from_1000                         1.00     18.1±0.05ms        ? ?/sec     1.03     18.5±0.08ms        ? ?/sec
physical_select_one_from_700                          1.00    773.2±3.01µs        ? ?/sec     1.02    787.0±2.95µs        ? ?/sec
physical_sorted_union_order_by_10_int64               1.00      4.8±0.02ms        ? ?/sec     1.02      4.9±0.01ms        ? ?/sec
physical_sorted_union_order_by_10_uint64              1.00     11.7±0.04ms        ? ?/sec     1.04     12.2±0.04ms        ? ?/sec
physical_sorted_union_order_by_50_int64               1.00    117.9±0.40ms        ? ?/sec     1.02    120.0±0.40ms        ? ?/sec
physical_sorted_union_order_by_50_uint64              1.00    623.4±5.05ms        ? ?/sec     1.05    657.5±2.76ms        ? ?/sec
physical_theta_join_consider_sort                     1.00   1057.1±1.81µs        ? ?/sec     1.26   1330.2±2.42µs        ? ?/sec
physical_unnest_to_join                               1.00  1244.1±12.31µs        ? ?/sec     1.09   1361.5±2.53µs        ? ?/sec
physical_window_function_partition_by_12_on_values    1.00    751.1±1.85µs        ? ?/sec     1.00    750.8±1.78µs        ? ?/sec
physical_window_function_partition_by_30_on_values    1.00   1507.5±3.63µs        ? ?/sec     1.01   1518.9±3.95µs        ? ?/sec
physical_window_function_partition_by_4_on_values     1.01    443.3±1.49µs        ? ?/sec     1.00    439.5±1.15µs        ? ?/sec
physical_window_function_partition_by_7_on_values     1.00    554.3±1.72µs        ? ?/sec     1.00    553.7±1.74µs        ? ?/sec
physical_window_function_partition_by_8_on_values     1.00    597.1±2.25µs        ? ?/sec     1.00    597.2±3.45µs        ? ?/sec
with_param_values_many_columns                        1.03    477.9±3.49µs        ? ?/sec     1.00    464.3±2.22µs        ? ?/sec

Resource Usage

base (merge-base)

Metric	Value
Wall time	1265.6s
Peak memory	18.5 GiB
Avg memory	18.4 GiB
CPU user	1509.7s
CPU sys	1.8s
Disk read	0 B
Disk write	648.0 MiB

branch

Metric	Value
Wall time	1255.9s
Peak memory	18.5 GiB
Avg memory	18.5 GiB
CPU user	1502.0s
CPU sys	1.2s
Disk read	0 B
Disk write	23.9 MiB

File an issue against this benchmark runner

Misc minor optimizations to query optimizer performance

eec93a2

AdamGS force-pushed the adamg/optimizer-memory-optimizations branch from d71431d to eec93a2 Compare March 23, 2026 19:25

github-actions bot added the optimizer Optimizer rules label Mar 23, 2026

AdamGS commented Mar 23, 2026

View reviewed changes

datafusion/optimizer/src/push_down_filter.rs Show resolved Hide resolved

AdamGS commented Mar 23, 2026

View reviewed changes

AdamGS added 2 commits March 23, 2026 20:03

another thing

f23a621

Signed-off-by: Adam Gutglick <[email protected]>

Fix lints

532b74e

Signed-off-by: Adam Gutglick <[email protected]>

AdamGS marked this pull request as ready for review March 27, 2026 17:05

alamb approved these changes Mar 27, 2026

View reviewed changes

AdamGS force-pushed the adamg/optimizer-memory-optimizations branch from b78496f to c26da85 Compare March 27, 2026 18:01

Avoid allocating column names

9b9e4f5

AdamGS force-pushed the adamg/optimizer-memory-optimizations branch from c26da85 to 9b9e4f5 Compare March 27, 2026 18:06

AdamGS mentioned this pull request Mar 28, 2026

Misc minor optimization in the Physical Optimizer #21216

Merged

alamb approved these changes Mar 30, 2026

View reviewed changes

Merge branch 'main' into adamg/optimizer-memory-optimizations

e53a07f

Conversation

AdamGS commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alamb commented Mar 27, 2026

Uh oh!

adriangbot commented Mar 27, 2026

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AdamGS Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AdamGS commented Mar 28, 2026

Uh oh!

Dandandan commented Mar 28, 2026

Uh oh!

adriangbot commented Mar 28, 2026

Uh oh!

adriangbot commented Mar 28, 2026

Uh oh!

alamb commented Mar 30, 2026

Uh oh!

adriangbot commented Mar 30, 2026

Uh oh!

adriangbot commented Mar 30, 2026

Uh oh!

alamb commented Mar 30, 2026

Uh oh!

alamb commented Mar 30, 2026

Uh oh!

alamb commented Mar 30, 2026

Uh oh!

adriangbot commented Mar 30, 2026

Uh oh!

adriangbot commented Mar 30, 2026

Uh oh!

adriangbot commented Mar 30, 2026

Uh oh!

adriangbot commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

AdamGS commented Mar 23, 2026 •

edited

Loading

AdamGS Mar 27, 2026 •

edited

Loading