[MDEV-31139] MariaDB hangs with multiple threads open - Jira

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Critical
Resolution: Duplicate
Affects Version/s: 10.11.2
Fix Version/s: N/A
Component/s: Storage Engine - InnoDB
Labels:
- Connection
- speed
Environment:
Ubuntu 22.04.2 LTS (GNU/Linux 5.15.0-70-generic x86_64)

Description

It started happening soon after upgrade to 10.11.2 version from 10.10 I believe. The problem starts when PHP scripts starts using 250-350 threads for INSERT statements mostly (SELECT statements are frequent too). On average MariaDB receives around 6000 queries per second, but problem comes even when it is just 2000 queries per second.

There is 1 master and 4 replicas (each using 16 parallel slave threads for replication).

At some point server (sometimes one sometimes two) - either master or replica stop processing any INSERT or SELECT statements. Process list then shows several hundred threads connected and they show time in many minutes or hours. Stopping any further queries to that affected server does not resolve anything - nothing changes for hours.

When this happens - a simple CONNECT to MariaDB is measured and it usually shows connection time of 60-460 ms instead of normal 0.5-1.1ms.

When I try to kill all threads one by one - they are killed, process list shows them as killed but present on the list.

If then (or before) I try to use systemctl stop mariadb - server is not stopped for hours (same with restart). The only option is to find any mariadb process and issue KILL -9 and then start Mariadb again which then starts in recovery mode. All tables are InnoDB in the database.

Servers, which do not receive many SELECT statements (the last 2 on the image list) - they never get such a problem thus I believe replication is not an issue. Replicas are READ ONLY.

This happens about once or twice a day at different times but when number of queries is on a high side. There is no shortage of RAM or disk.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

download.png
14 kB
2023-04-27 07:51
mariadbd_full_bt_all_threads_2.txt
1.09 MB
2023-04-29 14:03
mariadbd_full_bt_all_threads.txt
380 kB
2023-04-28 09:49
master_db_recovery.jpg
161 kB
2023-05-06 07:12
master_mariadb_full_bt_all_threads.txt
4.47 MB
2023-04-29 14:03
process_list.txt
63 kB
2023-04-27 07:51
process_list2.txt
79 kB
2023-04-27 08:18

Issue Links

duplicates

MDEV-29835 Partial server freeze

Closed

relates to

MDEV-33185 Connection timeout/slowdown upon increased load

Open

Activity

People

Assignee:: Unassigned

Reporter:: Erlandas

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 2023-04-27 08:06

Updated:: 2024-01-05 06:24

Resolved:: 2023-09-05 00:15

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.

MariaDB Server

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

Git Integration