[MDEV-6718] Server crashed in Gtid_log_event::Gtid_log_event with parallel replication Created: 2014-09-10 Updated: 2014-11-13 Resolved: 2014-11-13 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Replication |
| Affects Version/s: | 10.0.14 |
| Fix Version/s: | 10.0.15 |
| Type: | Bug | Priority: | Major |
| Reporter: | Elena Stepanova | Assignee: | Kristian Nielsen |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | parallelslave, replication | ||
| Environment: |
Linux 3.10.0-123.6.3.el7.bz114641.ppc64 #1 SMP |
||
| Attachments: |
|
| Description |
|
Stack trace from:
All threads' stack traces are attached. It happened once so far during tests on Power.
|
| Comments |
| Comment by Kristian Nielsen [ 2014-09-29 ] |
|
It is crashing on this line: uint8 post_header_len= description_event->post_header_len[GTID_EVENT-1]; The context is the Gtid_log_event constructor: Gtid_log_event::Gtid_log_event(const char *buf, uint event_len, Looks like maybe the description_event pointer is incorrect or something like that? |
| Comment by Kristian Nielsen [ 2014-09-29 ] |
|
I think I can repeat the problem by running the test case rpl.rpl_parallel_retry a lot of times with --mem --parallel=3 --repeat=1000 (on standard amd64 linux). At least the problem looks very similar, corrupt description_event below retry_event_group(). It fails in various ways depending on exactly when the corruption triggers, but the root cause seems likely to be the same. |
| Comment by Kristian Nielsen [ 2014-09-30 ] |
|
Here is a proposed patch: http://lists.askmonty.org/pipermail/commits/2014-September/006667.html The problem is incorrect handling of the format description event that is used |
| Comment by Kristian Nielsen [ 2014-11-13 ] |
|
Pushed to 10.0.15: http://lists.askmonty.org/pipermail/commits/2014-November/006974.html |