Details
-
Bug
-
Status: Closed (View Workflow)
-
Critical
-
Resolution: Fixed
-
10.6
-
None
-
Master and single semi-sync slave
Description
The user gets a lot of these errors randomly in the error log:
"Read semi-sync reply magic number error"
This is probably some missing or out of sync packet as part of semi sync replication.
We should try to repeat this by have our QA and benchmark teams run tests on a master and semi-sync slave to try to repeat the issue.
Attachments
Issue Links
- causes
-
MDEV-32940 Semi-sync Slave Falsely Logs that Master has Semi-sync Disabled
-
- Closed
-
-
MDEV-32943 Stopping Semi-sync Slave Reports that it is Falling Back to Async Mode
-
- Closed
-
-
MDEV-32944 Rpl_semi_sync_slave_enabled sysvar Should Not Be Modifiable While IO Thread is Running
-
- Closed
-
-
MDEV-32945 Rpl_semi_sync_master_status is ON Without Any Connected Slaves
-
- Open
-
-
MDEV-32947 Semi-sync with Multi-source Checksum Verification Failure
-
- Open
-
-
MDEV-32960 Semi-sync ACKed Transaction can Timeout and Switch Off Semi-sync with Multiple Replicas
-
- Closed
-
-
MDEV-32961 Semi-sync Primary Segfaults on Net Error
-
- Closed
-
-
MDEV-33331 IO Thread Relay Log Inconsistent Statistics After MDEV-32551
-
- Open
-
-
MDEV-33546 Rpl_semi_sync_slave_status is ON When Replication Is Not Configured
-
- Closed
-
- relates to
-
MDEV-11853 semisync thread can be killed after sync binlog but before ACK in the sync state
-
- Closed
-
-
MDEV-24773 slave_compressed_protocol doesn't work properly with semi-sync replication
-
- Closed
-
-
MDEV-28114 Semi-sync Master ACK Receiver Thread Can Error on COM_QUIT
-
- Closed
-
-
MDEV-28141 Slave crashes with Packets out of order when connecting to a shutting down master
-
- Open
-
-
MDEV-28306 Slave I/O: Fatal error: Failed to run 'after_read_event' hook, Internal MariaDB error code: 1593
-
- Open
-
-
MDEV-29369 rpl.rpl_semi_sync_shutdown_await_ack fails regularly with Result content mismatch
-
- Closed
-
-
MDEV-32365 detailize the semisync replication magic number error
-
- Closed
-
-
MDEV-32385 Semi-Sync Ack_Receiver Thread Should Allow COM_QUIT Command
-
- Closed
-
-
MDEV-32792 Second Semi-sync Replica Can Hang at Connect Time
-
- Closed
-
-
MDEV-33327 rpl_seconds_behind_master_spike Sensitive to IO Thread Stop Position
-
- Closed
-
-
MXS-4792 Semi-sync replication through MaxScale causes errors on STOP SLAVE
-
- Closed
-
-
MXS-4793 Unknown command errors when semi-sync replication runs through MaxScale
-
- Closed
-
Activity
Field | Original Value | New Value |
---|---|---|
Assignee | Roel Van de Paar [ roel ] |
Labels | CS0653958 |
Link |
This issue relates to |
Link |
This issue relates to |
Link | This issue relates to MDEV-28141 [ MDEV-28141 ] |
Link |
This issue relates to |
Link |
This issue relates to |
Assignee | Roel Van de Paar [ roel ] | Axel Schwenke [ axel ] |
Link |
This issue relates to |
Link |
This issue relates to |
Link | This issue relates to MDEV-28306 [ MDEV-28306 ] |
Assignee | Axel Schwenke [ axel ] |
Assignee | Pandikrishnan Gurusamy [ JIRAUSER52131 ] |
Status | Open [ 1 ] | Needs Feedback [ 10501 ] |
Link |
This issue relates to |
Labels | CS0653958 |
Priority | Major [ 3 ] | Critical [ 2 ] |
Link |
This issue relates to |
Assignee | Pandikrishnan Gurusamy [ JIRAUSER52131 ] | Brandon Nesterenko [ JIRAUSER48702 ] |
Status | Needs Feedback [ 10501 ] | Open [ 1 ] |
Comment |
[ [~Elkin] I tried the following MTR case, but was not able to repeat the issue:
Please provide a working mtr case so that I can help working on this! If the issue happens because of some missing bytes in the protocol, it may be that the code in net_clear() is eating the response on master/slave side. I have tried to follow your instructions, but not yet succeed. Here is what I have at this point: --source include/master-slave.inc --let $rpl_connection_name= slave2 --let $rpl_server_number= 2 --source include/rpl_connect.inc --connection master delimiter //; create procedure p1(i int) begin declare k int default 0; while k < i do select sleep(0.1); SET GLOBAL rpl_semi_sync_slave_enabled= OFF; STOP SLAVE IO_THREAD; select sleep(0.001); start slave io_thread; SET GLOBAL rpl_semi_sync_slave_enabled= ON; set k=k+1; end while; end// delimiter ;// --sync_slave_with_master --connection master set global rpl_semi_sync_master_enabled = ON; --connection slave1 --send call p1(1000) --connection slave2 --send call p1(1000) --connection slave1 --reap SET GLOBAL rpl_semi_sync_slave_enabled= OFF; --connection slave2 --reap SET GLOBAL rpl_semi_sync_slave_enabled= OFF; --connection master SET GLOBAL rpl_semi_sync_master_enabled= OFF; drop procedure p1; --sync_slave_with_master --connection master --source include/rpl_end.inc ] |
Attachment | skr.test [ 72422 ] |
Attachment | my.cnf [ 72423 ] |
Attachment | my.cnf [ 72423 ] |
Attachment | skr.cnf [ 72424 ] |
Attachment | skr.test [ 72422 ] |
Attachment | skr.test [ 72425 ] |
Status | Open [ 1 ] | In Progress [ 3 ] |
Link |
This issue relates to |
Fix Version/s | 10.6.17 [ 29518 ] | |
Fix Version/s | 10.6 [ 24028 ] |
Fix Version/s | 10.6 [ 24028 ] | |
Fix Version/s | 10.6.17 [ 29518 ] |
Status | In Progress [ 3 ] | In Testing [ 10301 ] |
Link |
This issue causes |
Link |
This issue causes |
Link |
This issue causes |
Link | This issue causes MDEV-32945 [ MDEV-32945 ] |
Link | This issue causes MDEV-32947 [ MDEV-32947 ] |
Link |
This issue causes |
Link |
This issue causes |
Status | In Testing [ 10301 ] | Stalled [ 10000 ] |
Assignee | Brandon Nesterenko [ JIRAUSER48702 ] | Michael Widenius [ monty ] |
Fix Version/s | 10.6.17 [ 29518 ] | |
Fix Version/s | 10.11.7 [ 29519 ] | |
Fix Version/s | 11.0.5 [ 29520 ] | |
Fix Version/s | 11.1.4 [ 29024 ] | |
Fix Version/s | 11.2.3 [ 29521 ] | |
Fix Version/s | 10.6 [ 24028 ] | |
Resolution | Fixed [ 1 ] | |
Status | Stalled [ 10000 ] | Closed [ 6 ] |
Link |
This issue relates to |
Link | This issue causes MDEV-33331 [ MDEV-33331 ] |
Link |
This issue causes |
Zendesk Related Tickets | 158556 183178 |