[MDEV-35855] Make it possible to enable BNL-H join without hitting regressions - Jira

Details

Type: New Feature
Status: Open (View Workflow)
Priority: Major
Resolution: Unresolved
Fix Version/s: 12.1
Component/s: Optimizer
Labels:
- hash-join
- optimizer-12.1-sprint

Description

If one enables BNL-H joins, this may change efficient plans with ref access into BNL-H joins with full table scans. See MDEV-16337.

This makes it impossible to recommend to enable BNL-H joins except in some restricted scenarios.

This MDEV is about changing the optimizer code in a way that enabling BNL-H joins does not introduce regressions.

Limitations

Note that join buffering code is very inefficient in case of VARCHAR columns. A quote from https://jira.mariadb.org/browse/MDEV-34352?focusedCommentId=286897&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-286897

Indeed, the BNL-H join has "non-embedded keys" (check JOIN_CACHE::check_emb_key_usage()). If we are joining a varchar(100) column, the join buffer record for it WILL include space for 100 characters (lookup tuple in KeyTupleFormat), even if the joined values are much shorter

Some notes about the code

Starting from 11.0, best_access_path does this:

Sets pos->type= JT_HASH for BNL-H plans (also sets pos->use_join_buffer).
Sets pos->use_join_buffer for full table scan plans that intend to use BNL.

After get_best_combination(), for BNL-H plans we have:

tab->is_ref_for_hash_join()==TRUE.

tab->type == JT_REF  (NOT JT_HASH)

Attachments

Issue Links

relates to

MDEV-16337 Setting join_cache_level=4 changes efficient ref access plan to an inefficient hash join

Open

MDEV-34232 Check performance of BNL-H vs MySQL's hash join

Open

MDEV-34427 BNL-H has not optimal implementation for varchar type

Open

Activity

People

Assignee:: Sergei Petrunia

Reporter:: Sergei Petrunia

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 2025-01-15 11:21

Updated:: 2025-04-10 12:52

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.

MariaDB Server