[MCOL-3270] Improve cpimport ingest speed into Dictionary columns - Jira

XML

Word

Printable

Details

Type: New Feature
Status: Closed (View Workflow)
Priority: Major
Resolution: Fixed
Affects Version/s: 1.2.3
Fix Version/s: 1.2.4
Component/s: cpimport
Labels:
None

Sprint:
2019-04

Description

Given 800 000 000 records with a couple Dictionary columns with lots of equal length strings in the data set. It took 4 167 seconds to ingest the data set into CS.
After the patch it takes only 467 seconds.

There were two main sources of latency:

Dctnry::getTokenFromArray represented de-dup buffer as array and called memcpy for any equal-sized string
COND_WAIT_SECONDS was 3 seconds per default

Attachments

Issue Links

causes

MCOL-3395 regression: dictionary de-duplication cache bleeding between columns

Closed

Activity

People

Assignee:: Daniel Lee (Inactive)

Reporter:: Roman

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 2019-04-18 18:55

Updated:: 2024-07-08 02:20

Resolved:: 2019-04-19 17:20

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.