[MDEV-16871] in_predicate_conversion_threshold cannot be set in my.cnf - Jira

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Critical
Resolution: Fixed
Affects Version/s: 10.3.6
Fix Version/s: 10.3.18, 10.4.8
Component/s: Variables
Labels:
None

Description

Currently testing 10.3.6 but looking at code this is still true in current 10.3.

Any setting of in_predicate_conversion_threshold in my.cnf is overridden in init_common_variables() with this line:

  global_system_variables.in_subquery_conversion_threshold= IN_SUBQUERY_CONVERSION_THRESHOLD;

Removing this line appears to fix the problem. A default is set in sys_vars.cc anyway I don't think this is needed.

This affects ColumnStore because we need to set this high.

As a side note, I couldn't see this variable documented in KB.

Attachments

Issue Links

includes

MDEV-20083 Predicate into subqueries is server KILLER, which can not even be disabled

Closed

is caused by

MDEV-20105 Case for bringing in_subquery_conversion_threshold back in next possible release

Closed

relates to

MCOL-1385 Merge MariaDB 10.3

Closed

MDEV-20482 MyISAM & Aria very slow when IN predicates containing more than 999 elements reference unindexed column.

Closed

Activity

Ascending order - Click to sort in descending order

Andrew Hutchings (Inactive) created issue - 2018-08-01 09:27

Andrew Hutchings (Inactive) made changes - 2018-08-01 09:27

Field	Original Value	New Value
Link		This issue relates to ~~MCOL-1385~~ [ ~~MCOL-1385~~ ]

Elena Stepanova added a comment - 2018-08-01 15:50

According to the commit comment, it might be intentional:

commit 37f5569909d2b5a80e7f55b7b5d38d25ee2f0b5e

Author: Sergei Golubchik <serg@mariadb.org>

Date:   Sat Nov 4 19:14:34 2017 +0100

    @@in_predicate_conversion_threshold

    * rename in_subquery_conversion_threshold to in_predicate_conversion_threshold

    * make it debug-only, hide from users

    * change from ulong to uint - same type and range on all architectures

Assigning to serg to clarify.a

Elena Stepanova added a comment - 2018-08-01 15:50 According to the commit comment, it might be intentional: commit 37f5569909d2b5a80e7f55b7b5d38d25ee2f0b5e Author: Sergei Golubchik <serg@mariadb.org> Date: Sat Nov 4 19:14:34 2017 +0100 @@in_predicate_conversion_threshold * rename in_subquery_conversion_threshold to in_predicate_conversion_threshold * make it debug-only, hide from users * change from ulong to uint - same type and range on all architectures Assigning to serg to clarify.a

Elena Stepanova made changes - 2018-08-01 15:50

Component/s		Variables [ 13903 ]
Fix Version/s		10.3 [ 22126 ]
Assignee		Sergei Golubchik [ serg ]

Andrew Hutchings (Inactive) added a comment - 2018-08-01 16:04 - edited

If there is a way for an engine to access the conversion (I think it is a derived temp table) or some kind of flag an engine can set to turn this off then I would also be happy with that.

Andrew Hutchings (Inactive) added a comment - 2018-08-01 16:04 - edited If there is a way for an engine to access the conversion (I think it is a derived temp table) or some kind of flag an engine can set to turn this off then I would also be happy with that.

Sergei Golubchik added a comment - 2018-08-09 20:05

As long as you have a different server source tree you can change IN_SUBQUERY_CONVERSION_THRESHOLD or set global_system_variables.in_subquery_conversion_threshold when your engine is initialized.

But we'll have to think of something better for 10.4.

Sergei Golubchik added a comment - 2018-08-09 20:05 As long as you have a different server source tree you can change IN_SUBQUERY_CONVERSION_THRESHOLD or set global_system_variables.in_subquery_conversion_threshold when your engine is initialized. But we'll have to think of something better for 10.4.

jocelyn fournier added a comment - 2019-03-06 08:55 - edited

Hi Sergei

Why did you move this variable as debug-only? In some case the original optimization is not as efficient as expected, and it would really be great to be able to control this variable.
See ~~MDEV-17795~~

Thanks!

jocelyn fournier added a comment - 2019-03-06 08:55 - edited Hi Sergei Why did you move this variable as debug-only? In some case the original optimization is not as efficient as expected, and it would really be great to be able to control this variable. See MDEV-17795 Thanks!

Sergei Golubchik added a comment - 2019-03-24 01:20

joce,

I think MariaDB has more than enough variables, so I'm generally against adding new variables without a good reason
In this case I asked various our optimizer developers why would a user might want to tune the @@in_predicate_conversion_threshold. When it should be 1,000 and when 10,000?
Nobody was able to offer any explanation of why it should be configurable, besides "it's useful in tests". So, we've made it debug-only, for testing purposes.

Of course, I was asking the wrong question. Even if user will never need to tune the conversion threshold, there can still be a good reason to be able to disable it. I agree it would be good to have a way to disable this optimization.

Sergei Golubchik added a comment - 2019-03-24 01:20 joce , I think MariaDB has more than enough variables, so I'm generally against adding new variables without a good reason In this case I asked various our optimizer developers why would a user might want to tune the @@in_predicate_conversion_threshold . When it should be 1,000 and when 10,000? Nobody was able to offer any explanation of why it should be configurable, besides "it's useful in tests". So, we've made it debug-only, for testing purposes. Of course, I was asking the wrong question. Even if user will never need to tune the conversion threshold, there can still be a good reason to be able to disable it. I agree it would be good to have a way to disable this optimization.

Sergei Golubchik made changes - 2019-03-29 12:04

Fix Version/s

10.4 [ 22408 ]

Slawomir Pryczek added a comment - 2019-07-17 16:30 - edited

I think this is needed urgently, because this feature is so broken, buggy and random it can turn a tiny select into nested full table scan which requires to scan billions of rows to get some small dataset from 100k rows tables. So it'll randomly turn 2-3s SELECT's into something which could take 5-10 minutes at best. Because i also saw queries which after this "optimization" took so long that they needed to be kill'ed, so im not even sure how bad can it get! Before it was couple seconds!

https://jira.mariadb.org/browse/MDEV-17795
https://jira.mariadb.org/browse/MDEV-20083

Basically i think the feature is providing no benefit and implementation is probably so complicated that it should be removed altogether ASAP, because how many problems it makes, which are in addition very hard to diagnose, due to how random and unexpected they are.

Really if IN (..) is so large memory is a concern, in such rare cases temporary table can be created manually and it can be properly INDEXED and added to a proper join, instead of making totally suboptimal subquery, automatically. For sure, if IN (..) size is a concern then resulting query could probably never finish, due to how bad it works... so this optimization has no purpose because of how randomly it works, and you're risking totally killing your server if it ever gets applied!

Probably not many people are noticing this just because larger IN sets are not very common, and even if it hits them - the query plan could be totally different next time even when query is the same...

Slawomir Pryczek added a comment - 2019-07-17 16:30 - edited I think this is needed urgently, because this feature is so broken, buggy and random it can turn a tiny select into nested full table scan which requires to scan billions of rows to get some small dataset from 100k rows tables. So it'll randomly turn 2-3s SELECT's into something which could take 5-10 minutes at best. Because i also saw queries which after this "optimization" took so long that they needed to be kill'ed, so im not even sure how bad can it get! Before it was couple seconds! https://jira.mariadb.org/browse/MDEV-17795 https://jira.mariadb.org/browse/MDEV-20083 Basically i think the feature is providing no benefit and implementation is probably so complicated that it should be removed altogether ASAP, because how many problems it makes, which are in addition very hard to diagnose, due to how random and unexpected they are. Really if IN (..) is so large memory is a concern, in such rare cases temporary table can be created manually and it can be properly INDEXED and added to a proper join, instead of making totally suboptimal subquery, automatically. For sure, if IN (..) size is a concern then resulting query could probably never finish, due to how bad it works... so this optimization has no purpose because of how randomly it works, and you're risking totally killing your server if it ever gets applied! Probably not many people are noticing this just because larger IN sets are not very common, and even if it hits them - the query plan could be totally different next time even when query is the same...

Slawomir Pryczek added a comment - 2019-07-17 16:45 - edited

@Sergei Golubchik Please speak with people on Maria team, show them all these 3 reports, and try to make this whole "optimization" removed altogether as soon as possible. Even when it works as expected, there's no performance benefit. Moreover these are simplest cases with simple IN on one column. How bad can it get on complex queries?

And to make things worse it makes many real and needed optimizations, simply impossible because it forces suboptimal subqueries!

We have some servers in production which serve hundreds of thousands requests per minute. 10.4 is actually unusable for us because of this "optimization", and we'd need to rewrite each query to manually create temporary table, insert, add index and do a straight join with forced index. And probably it'd work much worse than implementation in 10.1 anyway...

The reason im so sure it needs to be removed and it's so important is that people having a lot of traffic will see the problem straight away and will be able to diagnose it instantly, but people with less loaded servers could be getting random lockups or crashes every couple of months. And they will never know what the problem is... having something which is so bad and so random in production code will make very bad user experience. And some people would even probably need to switch to some other servers because they won't be able to make their code stable.

Slawomir Pryczek added a comment - 2019-07-17 16:45 - edited @Sergei Golubchik Please speak with people on Maria team, show them all these 3 reports, and try to make this whole "optimization" removed altogether as soon as possible. Even when it works as expected, there's no performance benefit. Moreover these are simplest cases with simple IN on one column. How bad can it get on complex queries? And to make things worse it makes many real and needed optimizations, simply impossible because it forces suboptimal subqueries! We have some servers in production which serve hundreds of thousands requests per minute. 10.4 is actually unusable for us because of this "optimization", and we'd need to rewrite each query to manually create temporary table, insert, add index and do a straight join with forced index. And probably it'd work much worse than implementation in 10.1 anyway... The reason im so sure it needs to be removed and it's so important is that people having a lot of traffic will see the problem straight away and will be able to diagnose it instantly, but people with less loaded servers could be getting random lockups or crashes every couple of months. And they will never know what the problem is... having something which is so bad and so random in production code will make very bad user experience. And some people would even probably need to switch to some other servers because they won't be able to make their code stable.

Slawomir Pryczek made changes - 2019-07-19 10:57

Link

This issue is caused by ~~MDEV-20105~~ [ ~~MDEV-20105~~ ]

Igor Babaev (Inactive) added a comment - 2019-07-19 20:32

Slawomir,
In your comments you assume only one pattern of using IN predicates. For this pattern the conversion into IN subquery can not be beneficial especially when the hash join is turned off.
At the same time you claim that " this feature is so broken, buggy and random it ". I need examples where it is:
1. broken
2. buggy
3. random.
It would be helpful.

Igor Babaev (Inactive) added a comment - 2019-07-19 20:32 Slawomir, In your comments you assume only one pattern of using IN predicates. For this pattern the conversion into IN subquery can not be beneficial especially when the hash join is turned off. At the same time you claim that " this feature is so broken, buggy and random it ". I need examples where it is: 1. broken 2. buggy 3. random. It would be helpful.

Slawomir Pryczek added a comment - 2019-07-20 02:18

Ok i oversimplified this looking from perspective of this pattern which i see a lot and for which we're using IN most of the time. It also happened that the second pattern we're using for optimization also was degraded, however that seems irrelevant because differences weren't significiant (<2x). Realized this was designed for different scenarios so shuldn't call for it being removed.

For these 3 points mentioned i meant that even with highest join level enabled a very small differences in table structure or in query, could lead to totally different and unexpected execution times which are very counter intuitive (eg. when you duplicate same WHERE condition twice, query will get executed 40 times faster).

Sent some analyze for these 2 patterns on email, hopefully there's some usable info

Slawomir Pryczek added a comment - 2019-07-20 02:18 Ok i oversimplified this looking from perspective of this pattern which i see a lot and for which we're using IN most of the time. It also happened that the second pattern we're using for optimization also was degraded, however that seems irrelevant because differences weren't significiant (<2x). Realized this was designed for different scenarios so shuldn't call for it being removed. For these 3 points mentioned i meant that even with highest join level enabled a very small differences in table structure or in query, could lead to totally different and unexpected execution times which are very counter intuitive (eg. when you duplicate same WHERE condition twice, query will get executed 40 times faster). Sent some analyze for these 2 patterns on email, hopefully there's some usable info

Sergei Golubchik added a comment - 2019-08-16 15:12

May be we should add an optimizer switch to disable this optimization? igor, what do you think?

Sergei Golubchik added a comment - 2019-08-16 15:12 May be we should add an optimizer switch to disable this optimization? igor , what do you think?

Sergei Golubchik made changes - 2019-08-17 11:17

Priority

Major [ 3 ]

Critical [ 2 ]

Sergei Golubchik made changes - 2019-08-17 11:17

Status

Open [ 1 ]

Confirmed [ 10101 ]

Sergei Golubchik made changes - 2019-08-19 07:55

Link

This issue includes ~~MDEV-20083~~ [ ~~MDEV-20083~~ ]

Sergei Golubchik added a comment - 2019-09-01 13:29

https://github.com/MariaDB/server/commit/eb0cf76f7dd

Sergei Golubchik added a comment - 2019-09-01 13:29 https://github.com/MariaDB/server/commit/eb0cf76f7dd

Sergei Golubchik made changes - 2019-09-01 13:29

Assignee	Sergei Golubchik [ serg ]	Igor Babaev [ igor ]
Status	Confirmed [ 10101 ]	In Review [ 10002 ]

jocelyn fournier added a comment - 2019-09-01 13:54

Perhaps setting the switch to off until ~~MDEV-20109~~ is fixed would be better?

jocelyn fournier added a comment - 2019-09-01 13:54 Perhaps setting the switch to off until MDEV-20109 is fixed would be better?

Sergei Golubchik made changes - 2019-09-02 18:01

Assignee

Igor Babaev [ igor ]

Sergei Golubchik [ serg ]

Sergei Golubchik made changes - 2019-09-02 18:02

Status

In Review [ 10002 ]

Stalled [ 10000 ]

Sergei Golubchik made changes - 2019-09-04 07:31

Fix Version/s		10.3.18 [ 23719 ]
Fix Version/s		10.4.8 [ 23721 ]
Fix Version/s	10.3 [ 22126 ]
Fix Version/s	10.4 [ 22408 ]
Resolution		Fixed [ 1 ]
Status	Stalled [ 10000 ]	Closed [ 6 ]

Sergei Golubchik made changes - 2019-09-04 13:24

Link

This issue relates to ~~MDEV-20482~~ [ ~~MDEV-20482~~ ]

Sergei Golubchik made changes - 2021-12-06 21:47

Workflow

MariaDB v3 [ 88682 ]

MariaDB v4 [ 154740 ]

People

Assignee:: Sergei Golubchik

Reporter:: Andrew Hutchings (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 2018-08-01 09:27

Updated:: 2019-10-28 07:59

Resolved:: 2019-09-04 07:31

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.

MariaDB Server

Details

Description

Attachments

Issue Links

Activity

People

Dates

Git Integration