Status: Closed (View Workflow)
Resolution: Not a Bug
Gentoo linux - 4.4.8-hardened-r1 with PAX enabled
Supermicro machine with:
- two AMD Opteron(TM) Processor 6274, 64GB memory (approx. 40GB used regurarly)
- 4x 512GB SSD drives (different vendors) -> Linux MDRAID RAID5 -> LVM for /var/lib/mysql
emerge info and my.cnf attached
After replacing SSD drives laying under the the RAID+LVM (256GB drives to larger 512GB drives) and extending the LVM logical volume, server started crashing with:
Checksum failure while reading node partition in file ... error on different TokuDB files (example crash.log attached) - most of the time it's main files, sometime index
Table structure attached. Crash can be reproduced easily by reading the whole table (or identify the failing part of the data range and read its data).
The only solution was to drop index/table - in case of table it hurt because it contained 10M-100M lines of data.
What helps was to split large tables into partitions, smaller ones are not crashing anymore, but the the ones containing 30M+ lines per partition still crashes from time to time.
My question is if it's related to storage layer or tokudb/mariadb?