[MDEV-29940] mariabackup gets unbearably slow Created: 2022-11-03 Updated: 2022-11-03 |
|
| Status: | Open |
| Project: | MariaDB Server |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major |
| Reporter: | Matthias Bethke | Assignee: | Unassigned |
| Resolution: | Unresolved | Votes: | 2 |
| Labels: | None | ||
| Environment: |
MariaDB 10.5.9 on CentOS 8 |
||
| Description |
|
We are running a primary/replica setup with currently one primary and two replicas. DB size is about 550 GB with sufficiently beefy hardware: DL360 Gen10 with 24 cores, 64 GB on the primary and 128 GB on the replicas; 10 Gbit network, 1.8 TB SAS disks (RAID1) on a HPE Smart Array 408i with cache and additional 480 GB SSDs as SmartCache (relaylogs on a separate array); running CentOS 8-stream and MariaDB 10.5.9. To repair replication after a hardware problem, I've been using mariabackup streaming the following way to avoid having to store a temporary copy on the primary:
This method hasn't ever been exactly speedy (although still better than dump-copy-restore) but it seems to have gotten worse between 10.3 and 10.5 (yeah, CentOS) and is downright glacial now. stracing the mariabackup process shows a bunch of pread64 calls with fairly small 64k blocks but that's about the only thing that stands out and shouldn't have any such effect either. The strangest thing is that there are periods when iowait is zero on the sending side and load goes down to around 0.25, most of which is probably from other jobs like mostly idling mysqld, puppet and monitoring stuff. The slowdown is severe enough that I think there must be a bug. Any ideas would be highly appreciated, even if it's just "it was fixed a decade ago, upgrade". |