perf(core): use simpler checks for table existence by michael-johnston · Pull Request #670 · IBM/ado

michael-johnston · 2026-03-07T10:07:25Z

Previously, SQLSampleStore.init called _create_source_table() unconditionally on every construction. _create_source_table() calls meta.create_all(checkfirst=True), which issues a separate table-existence query for each of the four backing tables even when the tables already exist — which they always do on read-only CLI commands. Locally this cost was ~1.45s. The changes reduce this to 0.24s.

The main fix is to probe for table existence instead of blinding issuing _create_source_tables(). This removes the 1.45 and adds the probe. We use a direct SQL probe (taking 240ms locally measured) instead of sqlalchemy.inspect() which took 800ms locally.

The PR also introduces a process-level _source_tables_verified that records tablename prefixes whose backing tables have already been confirmed to exist.
On the second and subsequent SQLSampleStore constructions for the same store within the same process the probe is skipped entirely (0 round-trips).
The main impact of this is in the tests.

Previously, SQLSampleStore.__init__ called _create_source_table() unconditionally on every construction. _create_source_table() calls meta.create_all(checkfirst=True), which issues a separate table-existence query for each of the four backing tables even when the tables already exist — which they always do on read-only CLI commands. Locally this cost was ~1.45s. The changes reduce this to 0.24s. The main fix is to probe for table existence instead of blinding issuing _create_source_tables(). This removes the 1.45 and adds the probe. We use a direct SQL probe (taking 240ms locally measured) instead of sqlalchemy.inspect() which took 800ms locally. The PR also introduces a process-level _source_tables_verified that records tablename prefixes whose backing tables have already been confirmed to exist. On the second and subsequent SQLSampleStore constructions for the same store within the same process the probe is skipped entirely (0 round-trips). The main impact of this is in the tests.

AlessandroPomponio

I'd be careful with this: the use of inspect is dialect independent and is the suggested way to check for table existence. Where do you see the 15 round trips? It should be a DESCRIBE of the 4 tables releated to the sqlsource

I'm all for caching the existence of the tables (like we added in SQLResourceStore) but I'd keep what we have in term of inspection.

Signed-off-by: Michael Johnston <[email protected]>

michael-johnston · 2026-03-09T10:52:36Z

Where do you see the 15 round trips?

That was a typo. Fixed

I'd be careful with this: the use of inspect is dialect independent and is the suggested way to check for table existence.

This is the typical optimisation trade off. If we keep the generic method, inspect, we have to pay the ~1.2 secs for it.

AlessandroPomponio · 2026-03-10T09:22:38Z

This is the typical optimisation trade off. If we keep the generic method, inspect, we have to pay the ~1.2 secs for it.

By not checking what the table structure is, though, we will not be able to perform migrations down the line

michael-johnston · 2026-03-11T19:31:14Z

By not checking what the table structure is, though, we will not be able to perform migrations down the line

Could we not add it back if necessary?

AlessandroPomponio · 2026-03-16T14:15:58Z

This should be updated now that #672 has been merged

…l_checks

DRL-NextGen · 2026-03-18T09:53:52Z

No vulnerabilities found.

DRL-NextGen · 2026-03-24T20:48:17Z

No vulnerabilities found.

The _source_tables_verified cache used only the table name as its key, so two SQLSampleStore instances with the same identifier but pointing to different databases would incorrectly share a cache entry. The second store would skip the table-existence check and attempt to query tables that don't exist in its database. Fix by keying the cache on (db_url, tablename), consistent with how _tables_exist_cache is already scoped in SQLStore (sqlstore.py).

Uses fast-path if db is sqlite or mysql or falls back to inspect/has_table if not.

Co-authored-by: Alessandro Pomponio <[email protected]> Co-authored-by: Michael Johnston <[email protected]> Signed-off-by: Michael Johnston <[email protected]>

AlessandroPomponio

just to make it uniform

Co-authored-by: Alessandro Pomponio <[email protected]> Signed-off-by: Michael Johnston <[email protected]>

AlessandroPomponio

LGTM thanks

michael-johnston requested a review from AlessandroPomponio March 7, 2026 10:07

AlessandroPomponio reviewed Mar 9, 2026

View reviewed changes

michael-johnston commented Mar 9, 2026

View reviewed changes

Comment thread orchestrator/core/samplestore/sql.py Outdated

Apply suggestion from @michael-johnston

dc824bf

Signed-off-by: Michael Johnston <[email protected]>

chore(black): formatting

43442c7

Merge remote-tracking branch 'origin/main' into maj_skip_redundant_dd…

630ed2f

…l_checks

AlessandroPomponio reviewed Mar 18, 2026

View reviewed changes

Comment thread orchestrator/core/samplestore/sql.py Outdated

michael-johnston added 2 commits March 21, 2026 11:00

Merge branch 'main' into maj_skip_redundant_ddl_checks

5f2eac7

Merge branch 'main' into maj_skip_redundant_ddl_checks

b8328dd

michael-johnston added 2 commits March 30, 2026 19:13

feat(sql): add function for checking table existence.

27d0bc3

michael-johnston requested a review from AlessandroPomponio March 30, 2026 18:21

AlessandroPomponio reviewed Mar 31, 2026

View reviewed changes

Comment thread orchestrator/core/samplestore/sql.py Outdated

feat(sql): create check_table_exists utility function.

fa53aaf

Uses fast-path if db is sqlite or mysql or falls back to inspect/has_table if not.

michael-johnston requested a review from AlessandroPomponio March 31, 2026 14:12

michael-johnston mentioned this pull request Mar 31, 2026

feat(cplex-mip): add cplex-mip solver custom experiment #773

Open

michael-johnston enabled auto-merge March 31, 2026 19:05

AlessandroPomponio reviewed Apr 1, 2026

View reviewed changes

Comment thread orchestrator/metastore/sql/statements.py Outdated

Comment thread orchestrator/metastore/sql/statements.py Outdated

Comment thread orchestrator/metastore/sql/statements.py Outdated

Comment thread orchestrator/metastore/sql/statements.py Outdated

AlessandroPomponio changed the title ~~feat(perf): skip redundant DDL checks in SQLSampleStore.__init__~~ perf(core): use simpler checks for table existence Apr 1, 2026

michael-johnston commented Apr 1, 2026

View reviewed changes

Comment thread orchestrator/metastore/sql/statements.py Outdated

michael-johnston commented Apr 1, 2026

View reviewed changes

Comment thread orchestrator/metastore/sql/statements.py Outdated

Apply suggestions from code review

b1d9cb8

Co-authored-by: Alessandro Pomponio <[email protected]> Co-authored-by: Michael Johnston <[email protected]> Signed-off-by: Michael Johnston <[email protected]>

AlessandroPomponio reviewed Apr 1, 2026

View reviewed changes

Comment thread orchestrator/metastore/sql/statements.py Outdated

Update orchestrator/metastore/sql/statements.py

e0b4b17

Co-authored-by: Alessandro Pomponio <[email protected]> Signed-off-by: Michael Johnston <[email protected]>

michael-johnston requested a review from AlessandroPomponio April 1, 2026 15:44

chore(lint): ruff

006cf2a

AlessandroPomponio approved these changes Apr 2, 2026

View reviewed changes

michael-johnston added this pull request to the merge queue Apr 2, 2026

Merged via the queue into main with commit 6113495 Apr 2, 2026
19 checks passed

michael-johnston deleted the maj_skip_redundant_ddl_checks branch April 2, 2026 08:28

Conversation

michael-johnston commented Mar 7, 2026

Uh oh!

AlessandroPomponio left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

michael-johnston commented Mar 9, 2026

Uh oh!

AlessandroPomponio commented Mar 10, 2026

Uh oh!

michael-johnston commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlessandroPomponio commented Mar 16, 2026

Uh oh!

DRL-NextGen commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

DRL-NextGen commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AlessandroPomponio left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AlessandroPomponio left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

michael-johnston commented Mar 11, 2026 •

edited

Loading

DRL-NextGen commented Mar 18, 2026 •

edited

Loading

DRL-NextGen commented Mar 24, 2026 •

edited

Loading