I started to look at PMP and perf profiles while running ann-benchmarks and when ef_search is large I see much time in memory allocation functions, most from VisitedSet code, during search_layer.
I am curious about two things:
1) Why is such a large value for est_size passed to the VisitedSet ctor here? This creates huge bloom filters as each uses about 2*est_size bytes, and with ef_search=400 (see example below) that is ~32MB. Such large alloc / dealloc cycles are slow when frequent.
2) Why does "bv" use longlong which is 8 bytes while (see here) while the num_blocks math (see here) appears to assume the bv vector is 4 bytes per entry. For one example:
- n = 15,044,334
- m = 143,931,441 (~= 10 * n)
- log_num_blocks = 22
- num_blocks = 4,194,304
- bv.resize(4194304) is called, which provides about 33,554,432 bits
The est_size variable is: est_heuristic * (ef ^ ef_power), from here. This ends up creating huge bloom filters when used by the VisitedSet ctor.
All of these are from queries with LIMIT 10
With ef_search=120 the math is: 39.x * (120 ^ 2) = 1135647 and visited.count is <= 2000 at the end of search_layer
With ef_search=200 the math is: 39.x * (200 ^ 2) = 3398947 and visited.count is <= 2000 at the end of search_layer()
With ef_search=300 the math is: 39.x * (300 ^ 2) = 8114222 and visited.count is <= 3000 at the end of search_layer()
With ef_search=400 the math is: 39.x * (400 ^ 2) = 15044334 and visited.count is <= 3000 at the end of search_layer()
- is duplicated by
-
MDEV-36338
vector search with Cosine Distance is slow
-
-
Closed
{"report":{"fcp":1113.1000003814697,"ttfb":225.5,"pageVisibility":"visible","entityId":132478,"key":"jira.project.issue.view-issue","isInitial":true,"threshold":1000,"elementTimings":{},"userDeviceMemory":8,"userDeviceProcessors":64,"apdex":0.5,"journeyId":"037edfba-1799-41a5-a971-b73f7e79fb67","navigationType":0,"readyForUser":1198.3000001907349,"redirectCount":0,"resourceLoadedEnd":1336.8000001907349,"resourceLoadedStart":231.69999980926514,"resourceTiming":[{"duration":148.80000019073486,"initiatorType":"link","name":"https://jira.mariadb.org/s/2c21342762a6a02add1c328bed317ffd-CDN/lu2cib/820016/12ta74/0a8bac35585be7fc6c9cc5a0464cd4cf/_/download/contextbatch/css/_super/batch.css","startTime":231.69999980926514,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":231.69999980926514,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":380.5,"responseStart":0,"secureConnectionStart":0},{"duration":148.79999923706055,"initiatorType":"link","name":"https://jira.mariadb.org/s/7ebd35e77e471bc30ff0eba799ebc151-CDN/lu2cib/820016/12ta74/494e4c556ecbb29f90a3d3b4f09cb99c/_/download/contextbatch/css/jira.browse.project,project.issue.navigator,jira.view.issue,jira.general,jira.global,atl.general,-_super/batch.css?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&slack-enabled=true&whisper-enabled=true","startTime":232.10000038146973,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":232.10000038146973,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":380.8999996185303,"responseStart":0,"secureConnectionStart":0},{"duration":161.19999980926514,"initiatorType":"script","name":"https://jira.mariadb.org/s/0917945aaa57108d00c5076fea35e069-CDN/lu2cib/820016/12ta74/0a8bac35585be7fc6c9cc5a0464cd4cf/_/download/contextbatch/js/_super/batch.js?locale=en","startTime":232.30000019073486,"connectEnd":232.30000019073486,"connectStart":232.30000019073486,"domainLookupEnd":232.30000019073486,"domainLookupStart":232.30000019073486,"fetchStart":232.30000019073486,"redirectEnd":0,"redirectStart":0,"requestStart":232.30000019073486,"responseEnd":393.5,"responseStart":393.5,"secureConnectionStart":232.30000019073486},{"duration":245.89999961853027,"initiatorType":"script","name":"https://jira.mariadb.org/s/2d8175ec2fa4c816e8023260bd8c1786-CDN/lu2cib/820016/12ta74/494e4c556ecbb29f90a3d3b4f09cb99c/_/download/contextbatch/js/jira.browse.project,project.issue.navigator,jira.view.issue,jira.general,jira.global,atl.general,-_super/batch.js?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&locale=en&slack-enabled=true&whisper-enabled=true","startTime":232.5,"connectEnd":232.5,"connectStart":232.5,"domainLookupEnd":232.5,"domainLookupStart":232.5,"fetchStart":232.5,"redirectEnd":0,"redirectStart":0,"requestStart":232.5,"responseEnd":478.3999996185303,"responseStart":478.3999996185303,"secureConnectionStart":232.5},{"duration":253,"initiatorType":"script","name":"https://jira.mariadb.org/s/a9324d6758d385eb45c462685ad88f1d-CDN/lu2cib/820016/12ta74/c92c0caa9a024ae85b0ebdbed7fb4bd7/_/download/contextbatch/js/atl.global,-_super/batch.js?locale=en","startTime":232.69999980926514,"connectEnd":232.69999980926514,"connectStart":232.69999980926514,"domainLookupEnd":232.69999980926514,"domainLookupStart":232.69999980926514,"fetchStart":232.69999980926514,"redirectEnd":0,"redirectStart":0,"requestStart":232.69999980926514,"responseEnd":485.69999980926514,"responseStart":485.69999980926514,"secureConnectionStart":232.69999980926514},{"duration":253.5,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:calendar-localisation-moment/jira.webresources:calendar-localisation-moment.js","startTime":233,"connectEnd":233,"connectStart":233,"domainLookupEnd":233,"domainLookupStart":233,"fetchStart":233,"redirectEnd":0,"redirectStart":0,"requestStart":233,"responseEnd":486.5,"responseStart":486.5,"secureConnectionStart":233},{"duration":253.10000038146973,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:calendar-en/jira.webresources:calendar-en.js","startTime":233,"connectEnd":233,"connectStart":233,"domainLookupEnd":233,"domainLookupStart":233,"fetchStart":233,"redirectEnd":0,"redirectStart":0,"requestStart":233,"responseEnd":486.1000003814697,"responseStart":486.1000003814697,"secureConnectionStart":233},{"duration":341.80000019073486,"initiatorType":"link","name":"https://jira.mariadb.org/s/b04b06a02d1959df322d9cded3aeecc1-CDN/lu2cib/820016/12ta74/a2ff6aa845ffc9a1d22fe23d9ee791fc/_/download/contextbatch/css/jira.global.look-and-feel,-_super/batch.css","startTime":233.19999980926514,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":233.19999980926514,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":575,"responseStart":0,"secureConnectionStart":0},{"duration":253.5,"initiatorType":"script","name":"https://jira.mariadb.org/rest/api/1.0/shortcuts/820016/47140b6e0a9bc2e4913da06536125810/shortcuts.js?context=issuenavigation&context=issueaction","startTime":233.39999961853027,"connectEnd":233.39999961853027,"connectStart":233.39999961853027,"domainLookupEnd":233.39999961853027,"domainLookupStart":233.39999961853027,"fetchStart":233.39999961853027,"redirectEnd":0,"redirectStart":0,"requestStart":233.39999961853027,"responseEnd":486.8999996185303,"responseStart":486.8999996185303,"secureConnectionStart":233.39999961853027},{"duration":341.5999994277954,"initiatorType":"link","name":"https://jira.mariadb.org/s/3ac36323ba5e4eb0af2aa7ac7211b4bb-CDN/lu2cib/820016/12ta74/d176f0986478cc64f24226b3d20c140d/_/download/contextbatch/css/com.atlassian.jira.projects.sidebar.init,-_super,-project.issue.navigator,-jira.view.issue/batch.css?jira.create.linked.issue=true","startTime":233.60000038146973,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":233.60000038146973,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":575.1999998092651,"responseStart":0,"secureConnectionStart":0},{"duration":253.5999994277954,"initiatorType":"script","name":"https://jira.mariadb.org/s/5d5e8fe91fbc506585e83ea3b62ccc4b-CDN/lu2cib/820016/12ta74/d176f0986478cc64f24226b3d20c140d/_/download/contextbatch/js/com.atlassian.jira.projects.sidebar.init,-_super,-project.issue.navigator,-jira.view.issue/batch.js?jira.create.linked.issue=true&locale=en","startTime":233.80000019073486,"connectEnd":233.80000019073486,"connectStart":233.80000019073486,"domainLookupEnd":233.80000019073486,"domainLookupStart":233.80000019073486,"fetchStart":233.80000019073486,"redirectEnd":0,"redirectStart":0,"requestStart":233.80000019073486,"responseEnd":487.3999996185303,"responseStart":487.3999996185303,"secureConnectionStart":233.80000019073486},{"duration":599,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:bigpipe-js/jira.webresources:bigpipe-js.js","startTime":234.80000019073486,"connectEnd":234.80000019073486,"connectStart":234.80000019073486,"domainLookupEnd":234.80000019073486,"domainLookupStart":234.80000019073486,"fetchStart":234.80000019073486,"redirectEnd":0,"redirectStart":0,"requestStart":234.80000019073486,"responseEnd":833.8000001907349,"responseStart":833.8000001907349,"secureConnectionStart":234.80000019073486},{"duration":851.8999996185303,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/1.0/_/download/batch/jira.webresources:bigpipe-init/jira.webresources:bigpipe-init.js","startTime":234.80000019073486,"connectEnd":234.80000019073486,"connectStart":234.80000019073486,"domainLookupEnd":234.80000019073486,"domainLookupStart":234.80000019073486,"fetchStart":234.80000019073486,"redirectEnd":0,"redirectStart":0,"requestStart":234.80000019073486,"responseEnd":1086.6999998092651,"responseStart":1086.6999998092651,"secureConnectionStart":234.80000019073486},{"duration":245,"initiatorType":"xmlhttprequest","name":"https://jira.mariadb.org/rest/webResources/1.0/resources","startTime":590.1999998092651,"connectEnd":590.1999998092651,"connectStart":590.1999998092651,"domainLookupEnd":590.1999998092651,"domainLookupStart":590.1999998092651,"fetchStart":590.1999998092651,"redirectEnd":0,"redirectStart":0,"requestStart":590.1999998092651,"responseEnd":835.1999998092651,"responseStart":835.1999998092651,"secureConnectionStart":590.1999998092651},{"duration":276.80000019073486,"initiatorType":"link","name":"https://jira.mariadb.org/s/d5715adaadd168a9002b108b2b039b50-CDN/lu2cib/820016/12ta74/be4b45e9cec53099498fa61c8b7acba4/_/download/contextbatch/css/jira.project.sidebar,-_super,-project.issue.navigator,-jira.general,-jira.browse.project,-jira.view.issue,-jira.global,-atl.general,-com.atlassian.jira.projects.sidebar.init/batch.css?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&slack-enabled=true&whisper-enabled=true","startTime":1053,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":1053,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":1329.8000001907349,"responseStart":0,"secureConnectionStart":0},{"duration":276.19999980926514,"initiatorType":"link","name":"https://jira.mariadb.org/s/50bc9be5bfead1a25e72c1a9338c94f6-CDN/lu2cib/820016/12ta74/e108c7645258ccb43280ed3404e3e949/_/download/contextbatch/css/com.atlassian.jira.plugins.jira-development-integration-plugin:0,-_super,-jira.view.issue,-jira.global,-jira.general,-jira.browse.project,-project.issue.navigator,-atl.general/batch.css?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&slack-enabled=true&whisper-enabled=true","startTime":1053.8000001907349,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":1053.8000001907349,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":1330,"responseStart":0,"secureConnectionStart":0},{"duration":274.6000003814697,"initiatorType":"script","name":"https://jira.mariadb.org/s/d41d8cd98f00b204e9800998ecf8427e-CDN/lu2cib/820016/12ta74/e65b778d185daf5aee24936755b43da6/_/download/contextbatch/js/browser-metrics-plugin.contrib,-_super,-project.issue.navigator,-jira.view.issue,-atl.general/batch.js?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&slack-enabled=true&whisper-enabled=true","startTime":1054.6999998092651,"connectEnd":1054.6999998092651,"connectStart":1054.6999998092651,"domainLookupEnd":1054.6999998092651,"domainLookupStart":1054.6999998092651,"fetchStart":1054.6999998092651,"redirectEnd":0,"redirectStart":0,"requestStart":1054.6999998092651,"responseEnd":1329.3000001907349,"responseStart":1329.3000001907349,"secureConnectionStart":1054.6999998092651},{"duration":280.0999994277954,"initiatorType":"script","name":"https://jira.mariadb.org/s/097ae97cb8fbec7d6ea4bbb1f26955b9-CDN/lu2cib/820016/12ta74/be4b45e9cec53099498fa61c8b7acba4/_/download/contextbatch/js/jira.project.sidebar,-_super,-project.issue.navigator,-jira.general,-jira.browse.project,-jira.view.issue,-jira.global,-atl.general,-com.atlassian.jira.projects.sidebar.init/batch.js?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&locale=en&slack-enabled=true&whisper-enabled=true","startTime":1055.3000001907349,"connectEnd":1055.3000001907349,"connectStart":1055.3000001907349,"domainLookupEnd":1055.3000001907349,"domainLookupStart":1055.3000001907349,"fetchStart":1055.3000001907349,"redirectEnd":0,"redirectStart":0,"requestStart":1055.3000001907349,"responseEnd":1335.3999996185303,"responseStart":1335.3999996185303,"secureConnectionStart":1055.3000001907349},{"duration":281.1000003814697,"initiatorType":"script","name":"https://jira.mariadb.org/s/e0bf5781d46ea69fb123572974cf39de-CDN/lu2cib/820016/12ta74/e108c7645258ccb43280ed3404e3e949/_/download/contextbatch/js/com.atlassian.jira.plugins.jira-development-integration-plugin:0,-_super,-jira.view.issue,-jira.global,-jira.general,-jira.browse.project,-project.issue.navigator,-atl.general/batch.js?agile_global_admin_condition=true&jag=true&jira.create.linked.issue=true&locale=en&slack-enabled=true&whisper-enabled=true","startTime":1055.6999998092651,"connectEnd":1055.6999998092651,"connectStart":1055.6999998092651,"domainLookupEnd":1055.6999998092651,"domainLookupStart":1055.6999998092651,"fetchStart":1055.6999998092651,"redirectEnd":0,"redirectStart":0,"requestStart":1055.6999998092651,"responseEnd":1336.8000001907349,"responseStart":1336.8000001907349,"secureConnectionStart":1055.6999998092651},{"duration":296.5,"initiatorType":"script","name":"https://www.google-analytics.com/analytics.js","startTime":1101.1999998092651,"connectEnd":0,"connectStart":0,"domainLookupEnd":0,"domainLookupStart":0,"fetchStart":1101.1999998092651,"redirectEnd":0,"redirectStart":0,"requestStart":0,"responseEnd":1397.6999998092651,"responseStart":0,"secureConnectionStart":0}],"fetchStart":0,"domainLookupStart":0,"domainLookupEnd":0,"connectStart":0,"connectEnd":0,"requestStart":56,"responseStart":226,"responseEnd":228,"domLoading":229,"domInteractive":1398,"domContentLoadedEventStart":1398,"domContentLoadedEventEnd":1461,"domComplete":2034,"loadEventStart":2034,"loadEventEnd":2035,"userAgent":"Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)","marks":[{"name":"bigPipe.sidebar-id.start","time":1345.1999998092651},{"name":"bigPipe.sidebar-id.end","time":1346},{"name":"bigPipe.activity-panel-pipe-id.start","time":1346.1999998092651},{"name":"bigPipe.activity-panel-pipe-id.end","time":1348.8999996185303},{"name":"activityTabFullyLoaded","time":1485.5}],"measures":[],"correlationId":"69af9815607dc","effectiveType":"4g","downlink":9,"rtt":0,"serverDuration":114,"dbReadsTimeInMs":15,"dbConnsTimeInMs":23,"applicationHash":"9d11dbea5f4be3d4cc21f03a88dd11d8c8687422","experiments":[]}}
After hacking MariaDB so that est_size is always <= 64000 when the VisitedSet constructor is called, I repeated tests and the hack makes QPS for ef_search=200, =300 and =400 much faster.
The numbers I pasted below are also here in case I can't get fixed-width fonts
recall QPS
PGVector(m=16, ef_construction=96, ef_search=20) 0.990 2126
PGVector(m=16, ef_construction=96, ef_search=10) 0.970 2662
PGVector(m=16, ef_construction=96, ef_search=30) 0.995 1803
PGVector(m=16, ef_construction=96, ef_search=40) 0.997 1581
PGVector(m=16, ef_construction=96, ef_search=80) 0.999 1109
PGVector(m=16, ef_construction=96, ef_search=120) 0.999 874
PGVector(m=16, ef_construction=96, ef_search=200) 1.000 606
PGVector(m=16, ef_construction=96, ef_search=300) 1.000 462
PGVector(m=16, ef_construction=96, ef_search=400) 1.000 383
PGVector(m=16, ef_construction=96, ef_search=800) 1.000 236
recall QPS
PGVector_halfvec(m=16, ef_construction=96, ef_search=10) 0.968 2803
PGVector_halfvec(m=16, ef_construction=96, ef_search=20) 0.989 2294
PGVector_halfvec(m=16, ef_construction=96, ef_search=30) 0.994 1991
PGVector_halfvec(m=16, ef_construction=96, ef_search=40) 0.996 1767
PGVector_halfvec(m=16, ef_construction=96, ef_search=80) 0.999 1260
PGVector_halfvec(m=16, ef_construction=96, ef_search=120) 0.999 998
PGVector_halfvec(m=16, ef_construction=96, ef_search=200) 1.000 731
PGVector_halfvec(m=16, ef_construction=96, ef_search=300) 1.000 527
PGVector_halfvec(m=16, ef_construction=96, ef_search=400) 1.000 439
PGVector_halfvec(m=16, ef_construction=96, ef_search=800) 1.000 269
MariaDB 11.7.1, as-is
recall QPS
MariaDB(m=8, ef_search=10) 0.984 4400
MariaDB(m=8, ef_search=20) 0.995 3791
MariaDB(m=8, ef_search=30) 0.997 3391
MariaDB(m=8, ef_search=40) 0.998 3105
MariaDB(m=8, ef_search=80) 0.999 2321
MariaDB(m=8, ef_search=120) 0.999 1913
MariaDB(m=8, ef_search=200) 0.999 1356
MariaDB(m=8, ef_search=300) 1.000 189
MariaDB(m=8, ef_search=400) 1.000 100
MariaDB(m=8, ef_search=800) 1.000 13
MariaDB 11.7.1, hacked so that est_size is <= 64000 in call to VisitedSet
recall QPS
MariaDB(m=8, ef_search=10) 0.984 4391
MariaDB(m=8, ef_search=20) 0.995 3797
MariaDB(m=8, ef_search=30) 0.997 3378
MariaDB(m=8, ef_search=40) 0.998 3094
MariaDB(m=8, ef_search=80) 0.999 2383
MariaDB(m=8, ef_search=120) 0.999 1977
MariaDB(m=8, ef_search=200) 0.999 1510 -> vs 1356 above for MariaDB without fix
MariaDB(m=8, ef_search=300) 1.000 1190 -> vs 189 above for MariaDB without fix
MariaDB(m=8, ef_search=400) 1.000 993 -> vs 100 above for MariaDB without fix
MariaDB(m=8, ef_search=800) 1.000 625 -> vs 13 above for MariaDB without fix