Details
-
Task
-
Status: Closed (View Workflow)
-
Critical
-
Resolution: Fixed
-
None
Description
Use some real-life million-size dataset
Benchmark goals:
- Index creation time
- Index update time
- Index lookup for 1, 10, 100, 1000 entries for the same "query" vector repeated N times.
- This tests speed of graph lookup with hopefully most data in cache.
- Index lookup for million different query vectors (top 1, 10, 100, 1000) results, no repetition.
- This tests speed of graph lookup when data may not all fit in cache.
- Compute average recall of the algorithm for such queries.
Compare to papers using the same algorithm.
Attachments
Issue Links
- blocks
-
MDEV-33413 cache k-ANN graph in memory
-
- Closed
-
-
MDEV-33415 graph index search: heuristical edge choice
-
- Closed
-
-
MDEV-33416 graph index: use smaller floating point numbers
-
- Closed
-
-
MDEV-33418 graph index insert: stronger selection of neighbors
-
- Closed
-
-
MDEV-33419 graph index insert: consider more neighbors
-
- Open
-
- is blocked by
-
MDEV-33406 basic optimizer support for k-NN searches
-
- Closed
-
-
MDEV-33407 Parser support for vector indexes
-
- Closed
-
-
MDEV-33408 HNSW for k-ANN vector searches
-
- Closed
-
- is part of
-
MDEV-34939 vector search in 11.7
-
- Closed
-
- relates to
-
MDEV-33404 Engine-independent indexes: subtable method
-
- Closed
-
-
MDEV-33405 Engine-independent indexes: low-level API method
-
- Closed
-
-
MDEV-32887 vector search
-
- Stalled
-
Activity
Field | Original Value | New Value |
---|---|---|
Link | This issue relates to MDEV-32887 [ MDEV-32887 ] |
Link |
This issue relates to |
Link |
This issue relates to |
Link |
This issue is blocked by |
Link |
This issue is blocked by |
Link |
This issue is blocked by |
Link |
This issue blocks |
Link |
This issue relates to |
Link |
This issue blocks |
Link |
This issue relates to |
Link |
This issue blocks |
Link |
This issue blocks |
Link | This issue blocks MDEV-33419 [ MDEV-33419 ] |
Description | use some real-life million-size dataset |
use some real-life million-size dataset
Benchmark goals: * Index creation time * Index update time * Index lookup for 1, 10, 100, 1000 entries for the same "query" vector repeated N times. ** This tests speed of graph lookup with hopefully most data in cache. * Index lookup for million different query entries (top 1, 10, 100, 1000) results, no repetition. ** Thos tests speed of graph lookup when data may not all fit in cache. * Compute average recall of the algorithm for such queries. Compare to papers using the same algorithm. |
Description |
use some real-life million-size dataset
Benchmark goals: * Index creation time * Index update time * Index lookup for 1, 10, 100, 1000 entries for the same "query" vector repeated N times. ** This tests speed of graph lookup with hopefully most data in cache. * Index lookup for million different query entries (top 1, 10, 100, 1000) results, no repetition. ** Thos tests speed of graph lookup when data may not all fit in cache. * Compute average recall of the algorithm for such queries. Compare to papers using the same algorithm. |
use some real-life million-size dataset
Benchmark goals: * Index creation time * Index update time * Index lookup for 1, 10, 100, 1000 entries for the same "query" vector repeated N times. ** This tests speed of graph lookup with hopefully most data in cache. * Index lookup for million different query vectors (top 1, 10, 100, 1000) results, no repetition. ** Thos tests speed of graph lookup when data may not all fit in cache. * Compute average recall of the algorithm for such queries. Compare to papers using the same algorithm. |
Description |
use some real-life million-size dataset
Benchmark goals: * Index creation time * Index update time * Index lookup for 1, 10, 100, 1000 entries for the same "query" vector repeated N times. ** This tests speed of graph lookup with hopefully most data in cache. * Index lookup for million different query vectors (top 1, 10, 100, 1000) results, no repetition. ** Thos tests speed of graph lookup when data may not all fit in cache. * Compute average recall of the algorithm for such queries. Compare to papers using the same algorithm. |
use some real-life million-size dataset
Benchmark goals: * Index creation time * Index update time * Index lookup for 1, 10, 100, 1000 entries for the same "query" vector repeated N times. ** This tests speed of graph lookup with hopefully most data in cache. * Index lookup for million different query vectors (top 1, 10, 100, 1000) results, no repetition. ** This tests speed of graph lookup when data may not all fit in cache. * Compute average recall of the algorithm for such queries. Compare to papers using the same algorithm. |
Description |
use some real-life million-size dataset
Benchmark goals: * Index creation time * Index update time * Index lookup for 1, 10, 100, 1000 entries for the same "query" vector repeated N times. ** This tests speed of graph lookup with hopefully most data in cache. * Index lookup for million different query vectors (top 1, 10, 100, 1000) results, no repetition. ** This tests speed of graph lookup when data may not all fit in cache. * Compute average recall of the algorithm for such queries. Compare to papers using the same algorithm. |
Use some real-life million-size dataset
Benchmark goals: * Index creation time * Index update time * Index lookup for 1, 10, 100, 1000 entries for the same "query" vector repeated N times. ** This tests speed of graph lookup with hopefully most data in cache. * Index lookup for million different query vectors (top 1, 10, 100, 1000) results, no repetition. ** This tests speed of graph lookup when data may not all fit in cache. * Compute average recall of the algorithm for such queries. Compare to papers using the same algorithm. |
Priority | Major [ 3 ] | Critical [ 2 ] |
Fix Version/s | 11.6 [ 29515 ] |
Assignee | Vicențiu Ciorbaru [ cvicentiu ] |
Status | Open [ 1 ] | In Progress [ 3 ] |
Status | In Progress [ 3 ] | Stalled [ 10000 ] |
Status | Stalled [ 10000 ] | In Progress [ 3 ] |
Status | In Progress [ 3 ] | Needs Feedback [ 10501 ] |
Status | Needs Feedback [ 10501 ] | Open [ 1 ] |
Status | Open [ 1 ] | In Progress [ 3 ] |
Status | In Progress [ 3 ] | In Testing [ 10301 ] |
Assignee | Vicențiu Ciorbaru [ cvicentiu ] | Sergei Golubchik [ serg ] |
Fix Version/s | 11.7 [ 29815 ] | |
Fix Version/s | 11.6 [ 29515 ] |
Fix Version/s | N/A [ 14700 ] | |
Fix Version/s | 11.7 [ 29815 ] |
Link |
This issue is part of |
Component/s | Vector search [ 20205 ] | |
Resolution | Fixed [ 1 ] | |
Status | In Testing [ 10301 ] | Closed [ 6 ] |
Fix Version/s | 11.7.1 [ 29913 ] | |
Fix Version/s | N/A [ 14700 ] |
I'm looking into the vector benchmarking.