[MDEV-29809] MariaDB: node crash and recovery - Semaphore wait has lasted > 600 seconds. We intentionally crash.... Created: 2022-10-17 Updated: 2022-11-23 Resolved: 2022-11-23 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Server, Storage Engine - InnoDB |
| Affects Version/s: | 10.3.27 |
| Fix Version/s: | N/A |
| Type: | Bug | Priority: | Critical |
| Reporter: | Wenwen Jing | Assignee: | Unassigned |
| Resolution: | Incomplete | Votes: | 0 |
| Labels: | crash, lock, recovery | ||
| Environment: |
CentOS7.9 ,16C32G2000G, |
||
| Attachments: |
|
||||||||
| Issue Links: |
|
||||||||
| Description |
|
Several days ago,we met a problem a node of MM cluster crashed and then recovery itself.There was a heavy load-in migration task at that time.The related log is in the attachment,11.PNG tells that the server is hanged and crashed raised signal 6. 12.PNG tells the server raised singal 11 and aborting ,that is the reason backtracing file is not generated.13.PNG and 14.PNG tell the informtion when analyzing singal waiting. |
| Comments |
| Comment by Daniel Black [ 2022-10-17 ] |
|
Was a core file generated ? Is installing debug info packages and obtaining a backtrace from the core (as text) possible? A 91M count in the resevation array (13.png) seems like a lot. What configuration are you running? Do you have some forms of query for the load-in migration and their tables? I removed |
| Comment by Marko Mäkelä [ 2022-10-17 ] |
|
By default, due to It would be better to attach output as text instead of bitmaps. In the server error log, were there any reports of corrupted pages? I believe that before the fix of Is this hang reproducible with MariaDB Server 10.6.10 or a later version? |