[MCOL-5043] Reduce a number of pre-spawned ExeMgr threads - Jira

XML

Word

Printable

Details

Type: New Feature
Status: Stalled (View Workflow)
Priority: Major
Resolution: Unresolved
Affects Version/s: 6.2.3
Fix Version/s: 23.10
Component/s: ExeMgr, PrimProc
Labels:
- rm_perf

Sprint:
2021-17, 2022-22, 2022-23, 2023-4, 2023-5, 2023-6, 2023-7, 2023-8, 2023-10, 2023-11, 2023-12, 2024-1, 2024-2
PM Planning:
- PM_BACKLOG

Description

MCS spawns lots of idle thread pool jobs for parallel query execution, e.g. every 2nd phase of a parallel 2-step aggregation spawns 24 threads and parallel sorting spawns 16 threads by default. The pool threads are just idle until data starts to flow from the lower parts of the executed query. Every thread uses sync primitives, e.g. mutex-es or cond_variable. When multiple queries are processed by an MCS cluster the concurrency sync primitives overhead is enormous and can reach 25% of non-virtualized CPU horsepower.
The suggested solution is to reduce a number of threads on the start down to one. EM adds more parallel threads if needed only.
Consider above mentioned 2nd step of a parallel aggregation. It pre-spawns of threads that reads data from an input queue and puts records(RowPointers to be exact) into buckets(bucket number = hash % buckets number). The thread later populates hash map with the calculated bucket number with the RowPointers and the hash calculated. The suggestion is to enable the code to detect if the input queue is filled up to a certain limit for a period of time and to add a new processing thread at this point. If it is the code must spawn another thread/-s.

Attachments

Issue Links

blocks

MCOL-4593 Multiple concurrent queries with aggregates are bottlenecked, result in lack of user scalability

Stalled

relates to

MCOL-5044 Improve PP thread pool with a fair scheduler

Closed

MCOL-5045 Computational resources and Workload aware primitive job scheduler in EM.

Open

MCOL-4691 Major Regression: Selects with aggregates 2x slower in 5.x than in 1.2 (due to collation support)

Closed

Activity

People

Assignee:: Roman

Reporter:: Roman

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 2022-04-06 11:25

Updated:: 2025-02-21 08:58

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.