[MDEV-33408] HNSW for k-ANN vector searches - Jira

XML

Word

Printable

Details

Type: New Feature
Status: Closed (View Workflow)
Priority: Critical
Resolution: Fixed
Fix Version/s: 11.7.1
Component/s: Vector search
Labels:
None

Description

For the first iteration of Vector search, we will implement HNSW algorithm.

The implementation will only support Euclidean distance initially.

Basic plan:
Graph construction will be done according to HNSW paper.

Storage wise, we'll store the graph as part of a subtable (~~MDEV-33404~~).

The table's definition will be something along these lines:

  CREATE TABLE i (

    level int unsigned not null,

    src varbinary(255) not null,

    dst varbinary(255) not null,

    index (level,src),

    index (level,dst));

For each link in the graph, there will be a corresponding entry in the table.

src and dst will store handler::position, a quick link to the actual vector blob in the main table.

The index (level,src) will allow for quick jumping between nodes.
To go deeper in search, one just needs to decrement the level and search using the same "src" value.

If src is found on level n, then it is also found on level n - 1 and so on. Level 0 is the base level with all the nodes.

Performance considerations:

Storing the vector in the subtable might be required. Looking up the blob value in the base table might be too costly.

Attachments

Issue Links

blocks

MDEV-33411 OPTIMIZE for graph indexes

Open

MDEV-33414 benchmark vector indexes

Closed

MDEV-33415 graph index search: heuristical edge choice

Closed

MDEV-33416 graph index: use smaller floating point numbers

Closed

MDEV-33418 graph index insert: stronger selection of neighbors

Closed

MDEV-33419 graph index insert: consider more neighbors

Open

is blocked by

MDEV-33404 Engine-independent indexes: subtable method

Closed

MDEV-36317 vector search with Cosine Distance, the recall rate of the returned results is very low

Closed

MDEV-36338 vector search with Cosine Distance is slow

Closed

is part of

MDEV-34939 vector search in 11.7

Closed

relates to

MDEV-32887 Vector Search

Closed

(1 blocks, 3 is blocked by, 1 is part of, 1 relates to)

Activity

People

Assignee:: Vicențiu Ciorbaru

Reporter:: Sergei Golubchik

Votes:: 3 Vote for this issue

Watchers:: 12 Start watching this issue

Dates

Created:: 2024-02-07 11:21

Updated:: 2025-09-01 15:02

Resolved:: 2024-11-10 13:52