The same IO-bound scenario was run on 2 identical machines, one with RocksDB and the other with TokuDB. In a Zope application (ERP5), using NEO/MariaDB as storage, we filled the disks with a few data streams. The memory leak happened when reading a single whole data stream.
The result is that ERP5/NEO does 2 queries in loop. Here is an example from general log:
SELECT tid, compression, data.hash, value, value_tid FROM obj FORCE INDEX(`partition`) LEFT JOIN data ON (obj.data_id = data.id) WHERE `partition` = 6 AND oid = 94294686 AND tid < 271213171275484455 ORDER BY tid DESC LIMIT 1
|
SELECT tid FROM obj FORCE INDEX(`partition`) WHERE `partition`=6 AND oid=94294686 AND tid>271177614835290848 ORDER BY tid LIMIT 1
|
More about NEO's architecture is described at https://www.nexedi.com/blog/NXD-Document.Blog.Optimising.MariaDB.Big.Data
The data stream is 1.5TB uncompressed, taking about 650MB on-disk. RocksDB got OOM-killed when 1TB was read. TokuDB respected its memory constraints until the end.
Each machine has 2 disks of 2TB, each one with a MariaDB DB that holds half of the Zope DB. IOW, we have 2 RocksDB databases of about 2TB. 8GB of RAM for the 2 mysqld (rocksdb_block_cache_size = 2G).
- relates to
-
MDEV-13739
Huge slowness at inserting rows, CPU-bound
-
-
Confirmed
{"report":{"fcp":1220.2999992370605,"ttfb":592.3999996185303,"pageVisibility":"visible","entityId":64409,"key":"jira.project.issue.view-issue","isInitial":true,"threshold":1000,"elementTimings":{},"userDeviceMemory":8,"userDeviceProcessors":64,"apdex":0.5,"journeyId":"30453191-a9d8-4053-95e3-2d18b2fd3cd8","navigationType":0,"readyForUser":1348.6999998092651,"redirectCount":0,"resourceLoadedEnd":1378.7999992370605,"resourceLoadedStart":599.2999992370605,"resourceTiming":[{"duration":26.5,"initiatorType":"link","name":"https://jira.mariadb.org/s/2c21342762a6a02add1c328bed317ffd-CDN/lu2cib/820016/12ta74/0a8bac35585be7fc6c9cc5a0464cd4cf/_/download/contextbatch/css/_super/batch.css","startTime":599.2999992370605,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":599.2999992370605,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":625.7999992370605,"responseStart":0,"secureConnectionStart":0},{"duration":26.5,"initiatorType":"link","name":"https://jira.mariadb.org/s/7ebd35e77e471bc30ff0eba799ebc151-CDN/lu2cib/820016/12ta74/494e4c556ecbb29f90a3d3b4f09cb99c/_/download/contextbatch/css/jira.browse.project,project.issue.navigator,jira.view.issue,jira.general,jira.global,atl.general,-_super/batch.css?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&slack-enabled=true&whisper-enabled=true","startTime":599.6999998092651,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":599.6999998092651,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":626.1999998092651,"responseStart":0,"secureConnectionStart":0},{"duration":89.80000019073486,"initiatorType":"script","name":"https://jira.mariadb.org/s/0917945aaa57108d00c5076fea35e069-CDN/lu2cib/820016/12ta74/0a8bac35585be7fc6c9cc5a0464cd4cf/_/download/contextbatch/js/_super/batch.js?locale=en","startTime":599.8999996185303,"connectEnd":599.8999996185303,"connectStart":599.8999996185303,"domainLookupEnd":599.8999996185303,"domainLookupStart":599.8999996185303,"fetchStart":599.8999996185303,"redirectEnd":0,"redirectStart":0,"requestStart":599.8999996185303,"responseEnd":689.6999998092651,"responseStart":689.6999998092651,"secureConnectionStart":599.8999996185303},{"duration":153,"initiatorType":"script","name":"https://jira.mariadb.org/s/2d8175ec2fa4c816e8023260bd8c1786-CDN/lu2cib/820016/12ta74/494e4c556ecbb29f90a3d3b4f09cb99c/_/download/contextbatch/js/jira.browse.project,project.issue.navigator,jira.view.issue,jira.general,jira.global,atl.general,-_super/batch.js?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&locale=en&slack-enabled=true&whisper-enabled=true","startTime":600,"connectEnd":600,"connectStart":600,"domainLookupEnd":600,"domainLookupStart":600,"fetchStart":600,"redirectEnd":0,"redirectStart":0,"requestStart":600,"responseEnd":753,"responseStart":753,"secureConnectionStart":600},{"duration":157,"initiatorType":"script","name":"https://jira.mariadb.org/s/a9324d6758d385eb45c462685ad88f1d-CDN/lu2cib/820016/12ta74/c92c0caa9a024ae85b0ebdbed7fb4bd7/_/download/contextbatch/js/atl.global,-_super/batch.js?locale=en","startTime":600.2999992370605,"connectEnd":600.2999992370605,"connectStart":600.2999992370605,"domainLookupEnd":600.2999992370605,"domainLookupStart":600.2999992370605,"fetchStart":600.2999992370605,"redirectEnd":0,"redirectStart":0,"requestStart":600.2999992370605,"responseEnd":757.2999992370605,"responseStart":757.2999992370605,"secureConnectionStart":600.2999992370605},{"duration":157.39999961853027,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:calendar-en/jira.webresources:calendar-en.js","startTime":600.5,"connectEnd":600.5,"connectStart":600.5,"domainLookupEnd":600.5,"domainLookupStart":600.5,"fetchStart":600.5,"redirectEnd":0,"redirectStart":0,"requestStart":600.5,"responseEnd":757.8999996185303,"responseStart":757.7999992370605,"secureConnectionStart":600.5},{"duration":157.69999980926514,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:calendar-localisation-moment/jira.webresources:calendar-localisation-moment.js","startTime":600.5999994277954,"connectEnd":600.5999994277954,"connectStart":600.5999994277954,"domainLookupEnd":600.5999994277954,"domainLookupStart":600.5999994277954,"fetchStart":600.5999994277954,"redirectEnd":0,"redirectStart":0,"requestStart":600.5999994277954,"responseEnd":758.2999992370605,"responseStart":758.2999992370605,"secureConnectionStart":600.5999994277954},{"duration":266.30000019073486,"initiatorType":"link","name":"https://jira.mariadb.org/s/b04b06a02d1959df322d9cded3aeecc1-CDN/lu2cib/820016/12ta74/a2ff6aa845ffc9a1d22fe23d9ee791fc/_/download/contextbatch/css/jira.global.look-and-feel,-_super/batch.css","startTime":600.7999992370605,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":600.7999992370605,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":867.0999994277954,"responseStart":0,"secureConnectionStart":0},{"duration":157.69999980926514,"initiatorType":"script","name":"https://jira.mariadb.org/rest/api/1.0/shortcuts/820016/47140b6e0a9bc2e4913da06536125810/shortcuts.js?context=issuenavigation&context=issueaction","startTime":601,"connectEnd":601,"connectStart":601,"domainLookupEnd":601,"domainLookupStart":601,"fetchStart":601,"redirectEnd":0,"redirectStart":0,"requestStart":601,"responseEnd":758.6999998092651,"responseStart":758.6999998092651,"secureConnectionStart":601},{"duration":266.0999994277954,"initiatorType":"link","name":"https://jira.mariadb.org/s/3ac36323ba5e4eb0af2aa7ac7211b4bb-CDN/lu2cib/820016/12ta74/d176f0986478cc64f24226b3d20c140d/_/download/contextbatch/css/com.atlassian.jira.projects.sidebar.init,-_super,-project.issue.navigator,-jira.view.issue/batch.css?jira.create.linked.issue=true","startTime":601.1999998092651,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":601.1999998092651,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":867.2999992370605,"responseStart":0,"secureConnectionStart":0},{"duration":157.89999961853027,"initiatorType":"script","name":"https://jira.mariadb.org/s/5d5e8fe91fbc506585e83ea3b62ccc4b-CDN/lu2cib/820016/12ta74/d176f0986478cc64f24226b3d20c140d/_/download/contextbatch/js/com.atlassian.jira.projects.sidebar.init,-_super,-project.issue.navigator,-jira.view.issue/batch.js?jira.create.linked.issue=true&locale=en","startTime":601.3999996185303,"connectEnd":601.3999996185303,"connectStart":601.3999996185303,"domainLookupEnd":601.3999996185303,"domainLookupStart":601.3999996185303,"fetchStart":601.3999996185303,"redirectEnd":0,"redirectStart":0,"requestStart":601.3999996185303,"responseEnd":759.2999992370605,"responseStart":759.2999992370605,"secureConnectionStart":601.3999996185303},{"duration":378.30000019073486,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:bigpipe-js/jira.webresources:bigpipe-js.js","startTime":602.5999994277954,"connectEnd":602.5999994277954,"connectStart":602.5999994277954,"domainLookupEnd":602.5999994277954,"domainLookupStart":602.5999994277954,"fetchStart":602.5999994277954,"redirectEnd":0,"redirectStart":0,"requestStart":602.5999994277954,"responseEnd":980.8999996185303,"responseStart":980.8999996185303,"secureConnectionStart":602.5999994277954},{"duration":665.6999998092651,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:bigpipe-init/jira.webresources:bigpipe-init.js","startTime":606.8999996185303,"connectEnd":606.8999996185303,"connectStart":606.8999996185303,"domainLookupEnd":606.8999996185303,"domainLookupStart":606.8999996185303,"fetchStart":606.8999996185303,"redirectEnd":0,"redirectStart":0,"requestStart":606.8999996185303,"responseEnd":1272.5999994277954,"responseStart":1272.5999994277954,"secureConnectionStart":606.8999996185303},{"duration":97.80000019073486,"initiatorType":"xmlhttprequest","name":"https://jira.mariadb.org/rest/webResources/1.0/resources","startTime":883.5999994277954,"connectEnd":883.5999994277954,"connectStart":883.5999994277954,"domainLookupEnd":883.5999994277954,"domainLookupStart":883.5999994277954,"fetchStart":883.5999994277954,"redirectEnd":0,"redirectStart":0,"requestStart":883.5999994277954,"responseEnd":981.3999996185303,"responseStart":981.3999996185303,"secureConnectionStart":883.5999994277954},{"duration":227.5999994277954,"initiatorType":"link","name":"https://jira.mariadb.org/s/d5715adaadd168a9002b108b2b039b50-CDN/lu2cib/820016/12ta74/be4b45e9cec53099498fa61c8b7acba4/_/download/contextbatch/css/jira.project.sidebar,-_super,-project.issue.navigator,-jira.general,-jira.browse.project,-jira.view.issue,-jira.global,-atl.general,-com.atlassian.jira.projects.sidebar.init/batch.css?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&slack-enabled=true&whisper-enabled=true","startTime":1151.1999998092651,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":1151.1999998092651,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":1378.7999992370605,"responseStart":0,"secureConnectionStart":0},{"duration":181.60000038146973,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/e65b778d185daf5aee24936755b43da6/_/download/contextbatch/js/browser-metrics-plugin.contrib,-_super,-project.issue.navigator,-jira.view.issue,-atl.general/batch.js?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&slack-enabled=true&whisper-enabled=true","startTime":1152.0999994277954,"connectEnd":1152.0999994277954,"connectStart":1152.0999994277954,"domainLookupEnd":1152.0999994277954,"domainLookupStart":1152.0999994277954,"fetchStart":1152.0999994277954,"redirectEnd":0,"redirectStart":0,"requestStart":1152.0999994277954,"responseEnd":1333.6999998092651,"responseStart":1333.6999998092651,"secureConnectionStart":1152.0999994277954},{"duration":189.5999994277954,"initiatorType":"script","name":"https://jira.mariadb.org/s/097ae97cb8fbec7d6ea4bbb1f26955b9-CDN/lu2cib/820016/12ta74/be4b45e9cec53099498fa61c8b7acba4/_/download/contextbatch/js/jira.project.sidebar,-_super,-project.issue.navigator,-jira.general,-jira.browse.project,-jira.view.issue,-jira.global,-atl.general,-com.atlassian.jira.projects.sidebar.init/batch.js?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&locale=en&slack-enabled=true&whisper-enabled=true","startTime":1152.5,"connectEnd":1152.5,"connectStart":1152.5,"domainLookupEnd":1152.5,"domainLookupStart":1152.5,"fetchStart":1152.5,"redirectEnd":0,"redirectStart":0,"requestStart":1152.5,"responseEnd":1342.0999994277954,"responseStart":1342.0999994277954,"secureConnectionStart":1152.5},{"duration":264.80000019073486,"initiatorType":"script","name":"https://www.google-analytics.com/analytics.js","startTime":1213.6999998092651,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":1213.6999998092651,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":1478.5,"responseStart":0,"secureConnectionStart":0}],"fetchStart":0,"domainLookupStart":0,"domainLookupEnd":0,"connectStart":0,"connectEnd":0,"requestStart":413,"responseStart":593,"responseEnd":599,"domLoading":597,"domInteractive":1420,"domContentLoadedEventStart":1420,"domContentLoadedEventEnd":1478,"domComplete":1682,"loadEventStart":1682,"loadEventEnd":1682,"userAgent":"Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)","marks":[{"name":"bigPipe.sidebar-id.start","time":1395.5999994277954},{"name":"bigPipe.sidebar-id.end","time":1396.5},{"name":"bigPipe.activity-panel-pipe-id.start","time":1396.5999994277954},{"name":"bigPipe.activity-panel-pipe-id.end","time":1399.5},{"name":"activityTabFullyLoaded","time":1524.7999992370605}],"measures":[],"correlationId":"f01631d2d2e2bb","effectiveType":"4g","downlink":9.8,"rtt":0,"serverDuration":119,"dbReadsTimeInMs":13,"dbConnsTimeInMs":22,"applicationHash":"9d11dbea5f4be3d4cc21f03a88dd11d8c8687422","experiments":[]}}
Can you please make a full test case that we can reproduce?
This includes creating the needed tables, having a script that fills them with data and running the queries that causes the memory overrun.
Alternatively is to link MariaDB with tcmalloc and run the test on your machines. This would give us
a log of the memory usage that could help us locate the problem
https://mariadb.com/kb/en/library/debugging-a-running-server-on-linux/