A query with a large IN () section will be stuck in statistics and use up all RAM. Multiple such queries will crash the server regardless of memory limitations specified in my.cnf
I noticed this in 10.2.35. Downgrading to 10.2.34 fixes the problem
An example is the WordPress "select post_id,meta_value from wp_postmeta where meta_id in (1,2,3 etc ...)" query.
Same issue here. Server that was previously fine with 12G RAM had to be upgraded to 40G RAM to be "stable" with 12.2.35 on Debian Stretch.
RAM wildly fluctuates.
Has anyone checked by any chance if 10.3/10.4/10.5 are also affected? Considering whether we should downgrade or upgrade to the next major release to fix it.
EDIT: Can confirm memory is fine again in 10.2.34.
Niels Hendriks
added a comment - - edited Same issue here. Server that was previously fine with 12G RAM had to be upgraded to 40G RAM to be "stable" with 12.2.35 on Debian Stretch.
RAM wildly fluctuates.
Has anyone checked by any chance if 10.3/10.4/10.5 are also affected? Considering whether we should downgrade or upgrade to the next major release to fix it.
EDIT: Can confirm memory is fine again in 10.2.34.
I ran into this issue on 10.5. I tested both on Debian and Windows.
10.5.6 is fine, 10.5.7 has the bug. I attached `mariadbbug.sql` with an example table and query.
Pavel
added a comment - - edited I ran into this issue on 10.5. I tested both on Debian and Windows.
10.5.6 is fine, 10.5.7 has the bug. I attached `mariadbbug.sql` with an example table and query.
I can confirm this issue. I wondered why I see queries in the process list with "Status" "statistics". Never saw this state before. Then I had problems with memory usage up to the server was killed by the kernel OOM killer.
MariaDB 10.2.35 / Ubuntu repositories
Frank Sagurna
added a comment - I can confirm this issue. I wondered why I see queries in the process list with "Status" "statistics". Never saw this state before. Then I had problems with memory usage up to the server was killed by the kernel OOM killer.
MariaDB 10.2.35 / Ubuntu repositories
Thanks a lot! The regression appeared after 291be49474 commit (MDEV-23811). (I used 1_my.test while reproducing, the same as mariadbbug.sql, but slightly shorter, for debug server)
Alice Sherepa
added a comment - - edited Thanks a lot! The regression appeared after 291be49474 commit ( MDEV-23811 ). (I used 1_my.test while reproducing, the same as mariadbbug.sql, but slightly shorter, for debug server)
Counting SEL_ARG::SEL_ARG allocations. Indeed, before the patch for MDEV-23811, it had: total_sel_args=3,294 (this is the number of elements in the IN-list). And after, it has total_sel_args=5,433,451. Too high.
Another odd thing in key_or(): key2_shared==true. This doesn't seem correct - the testcase has just one index and we use only 1 key part from it. How can a SEL_ARG graph be shared in this case?
Sergei Petrunia
added a comment - Debugging 1_my.test.
Counting SEL_ARG::SEL_ARG allocations. Indeed, before the patch for MDEV-23811 , it had: total_sel_args=3,294 (this is the number of elements in the IN-list). And after, it has total_sel_args=5,433,451 . Too high.
Another odd thing in key_or(): key2_shared==true . This doesn't seem correct - the testcase has just one index and we use only 1 key part from it. How can a SEL_ARG graph be shared in this case?
if (no_imerge_from_ranges && no_merges1 && no_merges2)
{
Unfortunately, this doesn't work, it creates some a test failure in main.range_vs_index_merge. Not sure for which reason.
Sergei Petrunia
added a comment - The reference counters are increased here in tree_or:
if (no_merges1)
rt1= new SEL_TREE(tree1, TRUE, param);
if (no_merges2)
rt2= new SEL_TREE(tree2, TRUE, param);
(denote this as LOC1)
The only place where rt1 and t2 are used is this one:
if (!(imerge_from_ranges= new SEL_IMERGE()))
result= NULL;
else if (!no_ranges1 && !no_ranges2 && !no_imerge_from_ranges)
{
/* Build the imerge part of the tree for the formula (1) */
if (!rt1 || !rt2 ||
...
}
Denote the above as LOC2.
Note that LOC2 has if (...&& !no_imerge_from_ranges)
which gives an idea - if rt1 and and rt2 is only used when no_imerge_from_ranges==FALSE,
maube we can create them only in that case?
@@ -8877,12 +8877,15 @@ tree_or(RANGE_OPT_PARAM *param,SEL_TREE *tree1,SEL_TREE *tree2)
/* Build the range part of the tree for the formula (1) */
if (sel_trees_can_be_ored(param, tree1, tree2, &ored_keys))
{
- if (no_merges1)
- rt1= new SEL_TREE(tree1, TRUE, param);
- if (no_merges2)
- rt2= new SEL_TREE(tree2, TRUE, param);
bool must_be_ored= sel_trees_must_be_ored(param, tree1, tree2, ored_keys);
no_imerge_from_ranges= must_be_ored;
+ if (!no_imerge_from_ranges)
+ {
+ if (no_merges1)
+ rt1= new SEL_TREE(tree1, TRUE, param);
+ if (no_merges2)
+ rt2= new SEL_TREE(tree2, TRUE, param);
+ }
if (no_imerge_from_ranges && no_merges1 && no_merges2)
{
Unfortunately, this doesn't work, it creates some a test failure in main.range_vs_index_merge. Not sure for which reason.
But then due to some error, wrong patch was pushed.
Sergei Petrunia
added a comment - - edited Ok, the issue is caused by the wrong patch for MDEV-23811 .
The patch that was pushed was this:
https://github.com/MariaDB/server/commit/291be494744abe90f4bdf6b5a35c4c26ee8ddda5
But it is a preliminary version.
During the review, I had requested to remove the problematic part of the patch :
https://jira.mariadb.org/browse/MDEV-23811?focusedCommentId=167592&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-167592
which was accepted and done:
http://lists.askmonty.org/pipermail/commits/2020-October/014342.html
But then due to some error, wrong patch was pushed.
Sergei Petrunia
added a comment - A testcase for this bug (thanks serg 's for the idea) :
https://github.com/MariaDB/server/commit/1404f3bea796c8479cf401cb36d518658600ddca
I have same issue. Server was had to be upgraded to 32G RAM with 10.2.35 on Debian 9 (Stretch). RAM wildly fluctuates. The server was killed by the kernel OOM killer.
Any quick fix?
Javier Carvajal
added a comment - - edited I have same issue. Server was had to be upgraded to 32G RAM with 10.2.35 on Debian 9 (Stretch). RAM wildly fluctuates. The server was killed by the kernel OOM killer.
Any quick fix?
I'd recommend to downgrade to 10.2.34 and apt-mark hold it until there is a fix available. We've seen servers downgraded to 10.2.34 to return to normal RAM usage.
Ferdi van der Werf
added a comment - I'd recommend to downgrade to 10.2.34 and apt-mark hold it until there is a fix available. We've seen servers downgraded to 10.2.34 to return to normal RAM usage.
A set of release is about to be made (10.2.36, 10.3.27, 10.4.17, 10.5.8) which will contain the fix for this.
(EDIT: fix version#)
Sergei Petrunia
added a comment - - edited Hello,
A set of release is about to be made (10.2.36, 10.3.27, 10.4.17, 10.5.8) which will contain the fix for this.
(EDIT: fix version#)
Sergei Petrunia
added a comment - The commits that fix this are (all 3 were made in a row):
https://github.com/MariaDB/server/commit/f81eef62e7742806e5e74b5f37f35b7cd2f82291
https://github.com/MariaDB/server/commit/1404f3bea796c8479cf401cb36d518658600ddca
https://github.com/MariaDB/server/commit/bea84aefb0563a10a310ea81d46c372919345c10
Daniel Black
added a comment - For all those watching patiently, there have been packages released https://mariadb.org/mariadb-10-5-8-10-4-17-10-3-27-and-10-2-36-now-available/
People
Sergei Petrunia
Geza Lucz
Votes:
11Vote for this issue
Watchers:
18Start watching this issue
Dates
Created:
Updated:
Resolved:
Git Integration
Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.
{"report":{"fcp":1794.8999996185303,"ttfb":452.09999990463257,"pageVisibility":"visible","entityId":93497,"key":"jira.project.issue.view-issue","isInitial":true,"threshold":1000,"elementTimings":{},"userDeviceMemory":8,"userDeviceProcessors":64,"apdex":0.5,"journeyId":"421ee3e7-4af2-4b07-a808-cabe94a53904","navigationType":0,"readyForUser":1887.5,"redirectCount":0,"resourceLoadedEnd":1484.5999999046326,"resourceLoadedStart":465.7999997138977,"resourceTiming":[{"duration":203.7000002861023,"initiatorType":"link","name":"https://jira.mariadb.org/s/2c21342762a6a02add1c328bed317ffd-CDN/lu2cib/820016/12ta74/0a8bac35585be7fc6c9cc5a0464cd4cf/_/download/contextbatch/css/_super/batch.css","startTime":465.7999997138977,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":465.7999997138977,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":669.5,"responseStart":0,"secureConnectionStart":0},{"duration":203.7999997138977,"initiatorType":"link","name":"https://jira.mariadb.org/s/7ebd35e77e471bc30ff0eba799ebc151-CDN/lu2cib/820016/12ta74/494e4c556ecbb29f90a3d3b4f09cb99c/_/download/contextbatch/css/jira.browse.project,project.issue.navigator,jira.view.issue,jira.general,jira.global,atl.general,-_super/batch.css?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&slack-enabled=true&whisper-enabled=true","startTime":466.09999990463257,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":466.09999990463257,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":669.8999996185303,"responseStart":0,"secureConnectionStart":0},{"duration":741.3000001907349,"initiatorType":"script","name":"https://jira.mariadb.org/s/0917945aaa57108d00c5076fea35e069-CDN/lu2cib/820016/12ta74/0a8bac35585be7fc6c9cc5a0464cd4cf/_/download/contextbatch/js/_super/batch.js?locale=en","startTime":466.2999997138977,"connectEnd":466.2999997138977,"connectStart":466.2999997138977,"domainLookupEnd":466.2999997138977,"domainLookupStart":466.2999997138977,"fetchStart":466.2999997138977,"redirectEnd":0,"redirectStart":0,"requestStart":673.5,"responseEnd":1207.5999999046326,"responseStart":727.5,"secureConnectionStart":466.2999997138977},{"duration":994.2999997138977,"initiatorType":"script","name":"https://jira.mariadb.org/s/2d8175ec2fa4c816e8023260bd8c1786-CDN/lu2cib/820016/12ta74/494e4c556ecbb29f90a3d3b4f09cb99c/_/download/contextbatch/js/jira.browse.project,project.issue.navigator,jira.view.issue,jira.general,jira.global,atl.general,-_super/batch.js?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&locale=en&slack-enabled=true&whisper-enabled=true","startTime":466.5,"connectEnd":466.5,"connectStart":466.5,"domainLookupEnd":466.5,"domainLookupStart":466.5,"fetchStart":466.5,"redirectEnd":0,"redirectStart":0,"requestStart":673.1999998092651,"responseEnd":1460.7999997138977,"responseStart":721.5999999046326,"secureConnectionStart":466.5},{"duration":274.69999980926514,"initiatorType":"script","name":"https://jira.mariadb.org/s/a9324d6758d385eb45c462685ad88f1d-CDN/lu2cib/820016/12ta74/c92c0caa9a024ae85b0ebdbed7fb4bd7/_/download/contextbatch/js/atl.global,-_super/batch.js?locale=en","startTime":466.69999980926514,"connectEnd":466.69999980926514,"connectStart":466.69999980926514,"domainLookupEnd":466.69999980926514,"domainLookupStart":466.69999980926514,"fetchStart":466.69999980926514,"redirectEnd":0,"redirectStart":0,"requestStart":695.2999997138977,"responseEnd":741.3999996185303,"responseStart":733.1999998092651,"secureConnectionStart":466.69999980926514},{"duration":276.59999990463257,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:calendar-en/jira.webresources:calendar-en.js","startTime":466.7999997138977,"connectEnd":466.7999997138977,"connectStart":466.7999997138977,"domainLookupEnd":466.7999997138977,"domainLookupStart":466.7999997138977,"fetchStart":466.7999997138977,"redirectEnd":0,"redirectStart":0,"requestStart":695.5999999046326,"responseEnd":743.3999996185303,"responseStart":734.3999996185303,"secureConnectionStart":466.7999997138977},{"duration":277.09999990463257,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:calendar-localisation-moment/jira.webresources:calendar-localisation-moment.js","startTime":467.09999990463257,"connectEnd":467.09999990463257,"connectStart":467.09999990463257,"domainLookupEnd":467.09999990463257,"domainLookupStart":467.09999990463257,"fetchStart":467.09999990463257,"redirectEnd":0,"redirectStart":0,"requestStart":695.7999997138977,"responseEnd":744.1999998092651,"responseStart":735.5999999046326,"secureConnectionStart":467.09999990463257},{"duration":203.5,"initiatorType":"link","name":"https://jira.mariadb.org/s/b04b06a02d1959df322d9cded3aeecc1-CDN/lu2cib/820016/12ta74/a2ff6aa845ffc9a1d22fe23d9ee791fc/_/download/contextbatch/css/jira.global.look-and-feel,-_super/batch.css","startTime":467.2999997138977,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":467.2999997138977,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":670.7999997138977,"responseStart":0,"secureConnectionStart":0},{"duration":276.90000009536743,"initiatorType":"script","name":"https://jira.mariadb.org/rest/api/1.0/shortcuts/820016/47140b6e0a9bc2e4913da06536125810/shortcuts.js?context=issuenavigation&context=issueaction","startTime":467.3999996185303,"connectEnd":467.3999996185303,"connectStart":467.3999996185303,"domainLookupEnd":467.3999996185303,"domainLookupStart":467.3999996185303,"fetchStart":467.3999996185303,"redirectEnd":0,"redirectStart":0,"requestStart":701.7999997138977,"responseEnd":744.2999997138977,"responseStart":737.2999997138977,"secureConnectionStart":467.3999996185303},{"duration":204.09999990463257,"initiatorType":"link","name":"https://jira.mariadb.org/s/3ac36323ba5e4eb0af2aa7ac7211b4bb-CDN/lu2cib/820016/12ta74/d176f0986478cc64f24226b3d20c140d/_/download/contextbatch/css/com.atlassian.jira.projects.sidebar.init,-_super,-project.issue.navigator,-jira.view.issue/batch.css?jira.create.linked.issue=true","startTime":467.5,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":467.5,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":671.5999999046326,"responseStart":0,"secureConnectionStart":0},{"duration":282.30000019073486,"initiatorType":"script","name":"https://jira.mariadb.org/s/5d5e8fe91fbc506585e83ea3b62ccc4b-CDN/lu2cib/820016/12ta74/d176f0986478cc64f24226b3d20c140d/_/download/contextbatch/js/com.atlassian.jira.projects.sidebar.init,-_super,-project.issue.navigator,-jira.view.issue/batch.js?jira.create.linked.issue=true&locale=en","startTime":467.69999980926514,"connectEnd":467.69999980926514,"connectStart":467.69999980926514,"domainLookupEnd":467.69999980926514,"domainLookupStart":467.69999980926514,"fetchStart":467.69999980926514,"redirectEnd":0,"redirectStart":0,"requestStart":702,"responseEnd":750,"responseStart":738.5,"secureConnectionStart":467.69999980926514},{"duration":1008.1999998092651,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:bigpipe-js/jira.webresources:bigpipe-js.js","startTime":475.19999980926514,"connectEnd":475.19999980926514,"connectStart":475.19999980926514,"domainLookupEnd":475.19999980926514,"domainLookupStart":475.19999980926514,"fetchStart":475.19999980926514,"redirectEnd":0,"redirectStart":0,"requestStart":1064.1999998092651,"responseEnd":1483.3999996185303,"responseStart":1474.5,"secureConnectionStart":475.19999980926514},{"duration":1006.3000001907349,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:bigpipe-init/jira.webresources:bigpipe-init.js","startTime":478.2999997138977,"connectEnd":478.2999997138977,"connectStart":478.2999997138977,"domainLookupEnd":478.2999997138977,"domainLookupStart":478.2999997138977,"fetchStart":478.2999997138977,"redirectEnd":0,"redirectStart":0,"requestStart":1064.6999998092651,"responseEnd":1484.5999999046326,"responseStart":1476.6999998092651,"secureConnectionStart":478.2999997138977},{"duration":166.80000019073486,"initiatorType":"xmlhttprequest","name":"https://jira.mariadb.org/rest/webResources/1.0/resources","startTime":1350.6999998092651,"connectEnd":1350.6999998092651,"connectStart":1350.6999998092651,"domainLookupEnd":1350.6999998092651,"domainLookupStart":1350.6999998092651,"fetchStart":1350.6999998092651,"redirectEnd":0,"redirectStart":0,"requestStart":1485.5999999046326,"responseEnd":1517.5,"responseStart":1516.7999997138977,"secureConnectionStart":1350.6999998092651},{"duration":328.80000019073486,"initiatorType":"xmlhttprequest","name":"https://jira.mariadb.org/rest/webResources/1.0/resources","startTime":1707.7999997138977,"connectEnd":1707.7999997138977,"connectStart":1707.7999997138977,"domainLookupEnd":1707.7999997138977,"domainLookupStart":1707.7999997138977,"fetchStart":1707.7999997138977,"redirectEnd":0,"redirectStart":0,"requestStart":2005.8999996185303,"responseEnd":2036.5999999046326,"responseStart":2035.5,"secureConnectionStart":1707.7999997138977}],"fetchStart":0,"domainLookupStart":236,"domainLookupEnd":244,"connectStart":244,"connectEnd":263,"secureConnectionStart":252,"requestStart":264,"responseStart":452,"responseEnd":474,"domLoading":457,"domInteractive":1981,"domContentLoadedEventStart":1981,"domContentLoadedEventEnd":2061,"domComplete":3002,"loadEventStart":3002,"loadEventEnd":3002,"userAgent":"Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)","marks":[{"name":"bigPipe.sidebar-id.start","time":1940.0999999046326},{"name":"bigPipe.sidebar-id.end","time":1940.8999996185303},{"name":"bigPipe.activity-panel-pipe-id.start","time":1941.0999999046326},{"name":"bigPipe.activity-panel-pipe-id.end","time":1946.0999999046326},{"name":"activityTabFullyLoaded","time":2152.3999996185303}],"measures":[],"correlationId":"108bcfc51c4d04","effectiveType":"4g","downlink":10,"rtt":0,"serverDuration":120,"dbReadsTimeInMs":14,"dbConnsTimeInMs":22,"applicationHash":"9d11dbea5f4be3d4cc21f03a88dd11d8c8687422","experiments":[]}}
Same issue here. Server that was previously fine with 12G RAM had to be upgraded to 40G RAM to be "stable" with 12.2.35 on Debian Stretch.
RAM wildly fluctuates.
Has anyone checked by any chance if 10.3/10.4/10.5 are also affected? Considering whether we should downgrade or upgrade to the next major release to fix it.
EDIT: Can confirm memory is fine again in 10.2.34.