Details

    • Epic
    • Status: Stalled (View Workflow)
    • Critical
    • Resolution: Unresolved
    • N/A
    • Server
    • vector search

    Description

      A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
      This task is about implementing indexes that allow to do it fast.

      • ideally they'll be engine independent
      • indexes should be update-able
      • in this task we'll only do Euclidean distance
      • we'll benchmark it on real multi-million-rows data sets
      • what algorithm, exactly, to use is still unclear

      Attachments

        Issue Links

          Activity

            serg Sergei Golubchik created issue -
            serg Sergei Golubchik made changes -
            Field Original Value New Value
            serg Sergei Golubchik made changes -
            Description A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
            This task is about implementing indexes that allow to do it fast.

            * ideally they'll be engine independent
            * indexes should be update-able
            * in this task we'll only do Euclidean distance
            * we'll benchmark it on real multi-million-rows data sets

            * what algorithm, exactly, to use is still unclear
            A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
            This task is about implementing indexes that allow to do it fast.

            * ideally they'll be engine independent
            * indexes should be update-able
            * in this task we'll only do Euclidean distance
            * we'll benchmark it on real multi-million-rows data sets
            * optimizer-wise we'll do like with fulltext search

            * what algorithm, exactly, to use is still unclear
            serg Sergei Golubchik made changes -
            Fix Version/s 11.5 [ 29506 ]
            serg Sergei Golubchik made changes -
            Status Open [ 1 ] In Progress [ 3 ]
            serg Sergei Golubchik made changes -
            Comment [ (/) index storage: first version. inside the table row, in the internal (not visible to a user) column

            won't be performant, but allows to implement and debug the algorithm

            the engine doesn't see the key at all, so enable/disable keys likely won't work. but as we'll probably change the storage in the future, it doesn't matter now. ]
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            Description A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
            This task is about implementing indexes that allow to do it fast.

            * ideally they'll be engine independent
            * indexes should be update-able
            * in this task we'll only do Euclidean distance
            * we'll benchmark it on real multi-million-rows data sets
            * optimizer-wise we'll do like with fulltext search

            * what algorithm, exactly, to use is still unclear
            A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
            This task is about implementing indexes that allow to do it fast.

            * ideally they'll be engine independent
            * indexes should be update-able
            * in this task we'll only do Euclidean distance
            * we'll benchmark it on real multi-million-rows data sets
            * optimizer-wise we'll do like with fulltext search

            * what algorithm, exactly, to use is still unclear

            h1. this task is at the moment in the _early design phase_ the ideas are recorded, added, changed, and -removed- in the linked gdoc
            serg Sergei Golubchik made changes -
            Description A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
            This task is about implementing indexes that allow to do it fast.

            * ideally they'll be engine independent
            * indexes should be update-able
            * in this task we'll only do Euclidean distance
            * we'll benchmark it on real multi-million-rows data sets
            * optimizer-wise we'll do like with fulltext search

            * what algorithm, exactly, to use is still unclear

            h1. this task is at the moment in the _early design phase_ the ideas are recorded, added, changed, and -removed- in the linked gdoc
            A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
            This task is about implementing indexes that allow to do it fast.

            * ideally they'll be engine independent
            * indexes should be update-able
            * in this task we'll only do Euclidean distance
            * we'll benchmark it on real multi-million-rows data sets
            * optimizer-wise we'll do like with fulltext search

            * what algorithm, exactly, to use is still unclear

            h1. *this task is at the moment in the _early design phase_ the ideas are recorded, added, changed, and -removed- in the linked gdoc*
            serg Sergei Golubchik made changes -
            Description A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
            This task is about implementing indexes that allow to do it fast.

            * ideally they'll be engine independent
            * indexes should be update-able
            * in this task we'll only do Euclidean distance
            * we'll benchmark it on real multi-million-rows data sets
            * optimizer-wise we'll do like with fulltext search

            * what algorithm, exactly, to use is still unclear

            h1. *this task is at the moment in the _early design phase_ the ideas are recorded, added, changed, and -removed- in the linked gdoc*
            A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
            This task is about implementing indexes that allow to do it fast.

            * ideally they'll be engine independent
            * indexes should be update-able
            * in this task we'll only do Euclidean distance
            * we'll benchmark it on real multi-million-rows data sets
            * optimizer-wise we'll do like with fulltext search

            * what algorithm, exactly, to use is still unclear

            h1. *this task is at the moment in the _early design phase_ the ideas are recorded, added, changed, and -removed- *in the linked gdoc*
            serg Sergei Golubchik made changes -
            Description A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
            This task is about implementing indexes that allow to do it fast.

            * ideally they'll be engine independent
            * indexes should be update-able
            * in this task we'll only do Euclidean distance
            * we'll benchmark it on real multi-million-rows data sets
            * optimizer-wise we'll do like with fulltext search

            * what algorithm, exactly, to use is still unclear

            h1. *this task is at the moment in the _early design phase_ the ideas are recorded, added, changed, and -removed- *in the linked gdoc*
            A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
            This task is about implementing indexes that allow to do it fast.

            * ideally they'll be engine independent
            * indexes should be update-able
            * in this task we'll only do Euclidean distance
            * we'll benchmark it on real multi-million-rows data sets
            * optimizer-wise we'll do like with fulltext search

            * what algorithm, exactly, to use is still unclear

            h1. *this task is at the moment in the _early design phase_ the ideas are recorded, added, changed, and -removed- in the linked gdoc*
            serg Sergei Golubchik made changes -
            Description A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
            This task is about implementing indexes that allow to do it fast.

            * ideally they'll be engine independent
            * indexes should be update-able
            * in this task we'll only do Euclidean distance
            * we'll benchmark it on real multi-million-rows data sets
            * optimizer-wise we'll do like with fulltext search

            * what algorithm, exactly, to use is still unclear

            h1. *this task is at the moment in the _early design phase_ the ideas are recorded, added, changed, and -removed- in the linked gdoc*
            A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
            This task is about implementing indexes that allow to do it fast.

            * ideally they'll be engine independent
            * indexes should be update-able
            * in this task we'll only do Euclidean distance
            * we'll benchmark it on real multi-million-rows data sets
            * optimizer-wise we'll do like with fulltext search

            * what algorithm, exactly, to use is still unclear
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            Description A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
            This task is about implementing indexes that allow to do it fast.

            * ideally they'll be engine independent
            * indexes should be update-able
            * in this task we'll only do Euclidean distance
            * we'll benchmark it on real multi-million-rows data sets
            * optimizer-wise we'll do like with fulltext search

            * what algorithm, exactly, to use is still unclear
            A common operation for multi-dimensional vectors is to find k nearest vectors to the given one.
            This task is about implementing indexes that allow to do it fast.

            * ideally they'll be engine independent
            * indexes should be update-able
            * in this task we'll only do Euclidean distance
            * we'll benchmark it on real multi-million-rows data sets

            * what algorithm, exactly, to use is still unclear
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            Fix Version/s 11.6 [ 29515 ]
            Fix Version/s 11.5 [ 29506 ]
            JIraAutomate JiraAutomate made changes -
            Status In Progress [ 3 ] Stalled [ 10000 ]
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            julien.fritsch Julien Fritsch made changes -
            Priority Major [ 3 ] Critical [ 2 ]
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            Status Stalled [ 10000 ] In Progress [ 3 ]
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            Fix Version/s 11.7 [ 29815 ]
            Fix Version/s 11.6 [ 29515 ]
            serg Sergei Golubchik made changes -
            Fix Version/s 11.8 [ 29921 ]
            Fix Version/s 11.7 [ 29815 ]
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            Fix Version/s 11.8 [ 29921 ]
            serg Sergei Golubchik made changes -
            Issue Type New Feature [ 2 ] Epic [ 5 ]
            serg Sergei Golubchik made changes -
            Status In Progress [ 3 ] Stalled [ 10000 ]
            serg Sergei Golubchik made changes -
            Fix Version/s N/A [ 14700 ]
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            Summary k-ANN indexes for vectors vector search
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            svoj Sergey Vojtovich made changes -
            julien.fritsch Julien Fritsch made changes -
            Epic Name vectore search
            julien.fritsch Julien Fritsch made changes -
            Epic Name vectore search vector search

            People

              serg Sergei Golubchik
              serg Sergei Golubchik
              Votes:
              2 Vote for this issue
              Watchers:
              19 Start watching this issue

              Dates

                Created:
                Updated:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.