[MCOL-4809] Vectorize column scanning/filtering - Jira

XML

Word

Printable

Details

Type: New Feature
Status: Closed (View Workflow)
Priority: Critical
Resolution: Fixed
Affects Version/s: 6.1.1
Fix Version/s: 6.3.1
Component/s: PrimProc
Labels:
None

Epic Link:
Performance
Sprint:
2021-9, 2021-10, 2021-11, 2021-12, 2021-13, 2021-14, 2021-15, 2021-16, 2021-17
Epic/Theme:
- Performance

Description

As of now there is no way to vectorize the loops of the scanning/filtering code that resides in primitives/linux-port/column.*

The basic logic is that for the column the mentioned code traverses the block of values:

skiping empty values
filtering the values using related filters from SQL statement
saving the values that satisfies into the output buffer
The code optionally traverses the column block and touch only those values with specific RIDs sent from upper layers.

The data processing is scalar here with lots of conditions that slows down execution.
The suggested way is to refactor the code to leverage data prefetch and batch processing using SIMD instructions. The available CPU command set should be detected in runtime on PP startup or at least once per column block.

Attachments

Issue Links

includes

MCOL-4815 Refactor ColumnCommand to have multiple derived classes specified by column width

Closed

relates to

MCOL-4876 Separate values and RID vectors sent b/w filtering and PrimitiveProcessor runtimes

Closed

MCOL-4818 Vectorize in-memory data representation

Closed

Activity

People

Assignee:: Aleksei Antipovskii

Reporter:: Roman

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 2021-07-09 19:03

Updated:: 2024-10-03 15:53

Resolved:: 2022-02-27 07:53

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.