[MCOL-5105] Reduced systemd timeouts results in corrupted EM Created: 2022-05-27 Updated: 2024-01-26 Resolved: 2022-06-21 |
|
| Status: | Closed |
| Project: | MariaDB ColumnStore |
| Component/s: | DMLProc |
| Affects Version/s: | 5.6.5, 6.3.1 |
| Fix Version/s: | 6.4.1 |
| Type: | Bug | Priority: | Major |
| Reporter: | Roman | Assignee: | Roman |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||||||||||||||||||||||||||||||
| Sprint: | 2021-17 | ||||||||||||||||||||||||||||||||
| Description |
|
There were multiple cases when low default startup/shutdown systemd timeouts, namely mcs-dmlproc, mcs-workernode, result in a lost or corrupted extent map. |
| Comments |
| Comment by Roman [ 2022-05-27 ] | ||||||||||||||
|
Plz review. | ||||||||||||||
| Comment by Daniel Lee (Inactive) [ 2022-06-07 ] | ||||||||||||||
|
Build tested: 6.4.1-1 (PR #4500, cron #4590) 1. set autocommit=0 ColumnStore tried to rollback during startup. From the debug.log, it kept trying the rollback every 2 minutes. I waited for 16 iteration before stopping the test. When I manually issued a rollback after update, it look 10 minutes to complete.
| ||||||||||||||
| Comment by Daniel Lee (Inactive) [ 2022-06-07 ] | ||||||||||||||
|
Reopen per my last comment | ||||||||||||||
| Comment by Roman [ 2022-06-09 ] | ||||||||||||||
|
The issue caused by a hardcoded pipe read operation limit that implicitly limits the effective timeout for startup operations. | ||||||||||||||
| Comment by alexey vorovich (Inactive) [ 2022-06-09 ] | ||||||||||||||
|
drrtuy | ||||||||||||||
| Comment by Roman [ 2022-06-10 ] | ||||||||||||||
|
This issue is about systemd irrespective whether it is a container installation or not. | ||||||||||||||
| Comment by Daniel Lee (Inactive) [ 2022-06-10 ] | ||||||||||||||
|
Build tested: 6.4.1-1 (#4623) The test was done on VM, not Docker container. Repeated the last test case. It took 10 minutes to rollback the transaction successfully.
| ||||||||||||||
| Comment by Daniel Lee (Inactive) [ 2022-06-16 ] | ||||||||||||||
|
Build tested: 6.4.1-1 (#4660) Did another two tests in a Docker container. Test #1 Test #2 |