Details
- 
    Bug 
- 
    Status: Closed (View Workflow)
- 
    Major 
- 
    Resolution: Fixed
- 
    10.4.11
- 
    OS: CentOS Linux release 7.6.1810 (Core).
Description
When RQG is used with --galera=mms, master node 1 and slave node 2 disconnect from the cluster after HA_ERR_KEY_NOT_FOUND and HA_ERR_FOUND_DUPP_KEY accordingly.
MariaDB Version 10.4.11-MariaDB-debug: Repository: MariaDB/server; branch 10.4; Revision ae72205e31e7665238c1757ede352d9ca53d5327.
Galera lib 26.4.3(r4548) debug: Repository: MariaDB/galera; branch mariadb-4.x; Revision a5431753a3f6bdd348adcbca00e3450ba0ef9045.
RQG: Repository: MariaDB/randgen; master branch; Revision bea5b6b07d08ba6349d8a6ff4356b01678822727.
Run:
| perl ./runall-new.pl --grammar=conf/galera/galera_stress.yy --gendata=conf/galera/galera_stress.zz --duration=600 --queries=200M --threads=1 --galera=mms --basedir=/home/stepan/mariadb/10.4/git --vardir=/home/stepan/rqg/github/var "--mysqld=--wsrep-provider=/usr/lib/libgalera_smm_4.so" "--mysqld=--wsrep_sst_method=rsync" "--mysqld=--core" "--mysqld=--general-log" "--mysqld=--general-log-file=queries.log" "--mysqld=--log-output=file" "--mysqld=--wsrep-debug=server" "--mysqld=--wsrep-sync-wait=15" "--mysqld=--wsrep_retry_autocommit=0" "--mysqld=--wsrep_log_conflicts=1" "--mysqld=--wsrep_on=ON"
 | 
Master node 1 :
:
2019-11-20 16:22:09 2 [ERROR] mysqld: Can't find record in 'table100_innodb_int'
2019-11-20 16:22:09 2 [ERROR] Slave SQL: Could not execute Update_rows_v1 event on table test.table100_innodb_int; Can't find record in 'table100_innodb_int', Error_code: 1032; handler error HA_ERR_KEY_NOT_FOUND; the event's master log FIRST, end_log_pos 321, Internal MariaDB error code: 1032
2019-11-20 16:22:09 2 [ERROR] WSREP: Failed to apply write set: gtid: 09f8e15a-0ba1-11ea-bed3-56e4b108bea5:130 server_id: 09f7e5f7-0ba1-11ea-a944-d7f74653ae06 client_id: 17 trx_id: 1748 flags: 3 (start_transaction | commit)
2019-11-20 16:22:09 2 [Note] [Debug] WSREP: server cnt7glr11.localdomain state change: connected -> disconnected
2019-11-20 16:22:09 2 [Note] WSREP: Server status change connected -> disconnected
2019-11-20 16:22:09 2 [Warning] Aborted connection 2 to db: 'unconnected' user: 'unauthenticated' host: '' (This connection closed normally without authentication)
| 2019-11-20 16:22:09 2 [Note] WSREP: assigned new next trx id: 1384 | 
| 2019-11-20 16:22:09 2 [ERROR] mysqld: Can't find record in 'table100_innodb_int' | 
| 2019-11-20 16:22:09 2 [ERROR] Slave SQL: Could not execute Update_rows_v1 event on table test.table100_innodb_int; Can't find record in 'table100_innodb_int', Error_code: 1032; handler error HA_ERR_KEY_NOT_FOUND; the event's master log FIRST, end_log_pos 321, Internal MariaDB error code: 1032 | 
| 2019-11-20 16:22:09 2 [Note] WSREP: ha_rollback_trans(2, FALSE) rolled back: REPLACE INTO `table100_innodb_int` ( `col_char_12`, `col_char_12_key` ) VALUES ( 'u', 406257664 ), ( 4083875840, 5 ) /* QNO 1280 CON_ID 17 */L՝: Can't find record in 'table100_innodb_int'; is_real 0 | 
| 2019-11-20 16:22:09 2 [Warning] WSREP: Event 3 Update_rows_v1 apply failed: 120, seqno 130 | 
| 2019-11-20 16:22:09 2 [Note] WSREP: ha_rollback_trans(2, TRUE) rolled back: REPLACE INTO `table100_innodb_int` ( `col_char_12`, `col_char_12_key` ) VALUES ( 'u', 406257664 ), ( 4083875840, 5 ) /* QNO 1280 CON_ID 17 */L՝: Can't find record in 'table100_innodb_int'; is_real 1 | 
| 2019-11-20 16:22:09 2 [Note] WSREP: wsrep_after_apply 2 | 
| 2019-11-20 16:22:09 2 [Note] WSREP: wsrep_thd_binlog_reset | 
| 2019-11-20 16:22:09 2 [Note] WSREP: Log dummy write set 130 | 
| 2019-11-20 16:22:09 2 [Note] WSREP: wsrep_thd_binlog_reset | 
| 2019-11-20 16:22:09 2 [ERROR] WSREP: Failed to apply write set: gtid: 09f8e15a-0ba1-11ea-bed3-56e4b108bea5:130 server_id: 09f7e5f7-0ba1-11ea-a944-d7f74653ae06 client_id: 17 trx_id: 1748 flags: 3 (start_transaction | commit) | 
| 2019-11-20 16:22:09 2 [Note] WSREP: Closing send monitor... | 
| 2019-11-20 16:22:09 2 [Note] WSREP: Closed send monitor. | 
| 2019-11-20 16:22:09 2 [Note] WSREP: gcomm: terminating thread | 
| 2019-11-20 16:22:09 2 [Note] WSREP: gcomm: joining thread | 
| 2019-11-20 16:22:09 2 [Note] WSREP: gcomm: closing backend | 
| 2019-11-20 16:22:09 17 [Note] WSREP: assigned new next trx id: 1385 | 
| 2019-11-20 16:22:09 2 [Note] WSREP: view(view_id(NON_PRIM,09f7e5f7,3) memb { | 
| 	0a54a855,0 | 
| } joined { | 
| } left { | 
| } partitioned { | 
| 	09f7e5f7,0 | 
| 	0d528ca2,0 | 
| }) | 
| 2019-11-20 16:22:09 2 [Note] WSREP: PC protocol downgrade 1 -> 0 | 
| 2019-11-20 16:22:09 17 [Note] WSREP: wsrep_sync_wait: thd->variables.wsrep_sync_wait= 15, mask= 4, thd->variables.wsrep_on= 1 | 
| 2019-11-20 16:22:09 2 [Note] WSREP: view((empty)) | 
| 2019-11-20 16:22:09 2 [Note] WSREP: gcomm: closed | 
| 2019-11-20 16:22:09 0 [Note] WSREP: New COMPONENT: primary = no, bootstrap = no, my_idx = 0, memb_num = 1 | 
| 2019-11-20 16:22:09 0 [Note] WSREP: Flow-control interval: [16, 16] | 
| 2019-11-20 16:22:09 0 [Note] WSREP: Received NON-PRIMARY. | 
| 2019-11-20 16:22:09 0 [Note] WSREP: Shifting SYNCED -> OPEN (TO: 130) | 
| 2019-11-20 16:22:09 0 [Note] WSREP: New SELF-LEAVE. | 
| 2019-11-20 16:22:09 0 [Note] WSREP: Flow-control interval: [0, 0] | 
| 2019-11-20 16:22:09 0 [Note] WSREP: Received SELF-LEAVE. Closing connection. | 
| 2019-11-20 16:22:09 0 [Note] WSREP: Shifting OPEN -> CLOSED (TO: -1) | 
| 2019-11-20 16:22:09 0 [Note] WSREP: RECV thread exiting 0: Success | 
| 2019-11-20 16:22:09 17 [Warning] WSREP: gcs_caused() returned -103 (Software caused connection abort) | 
| 2019-11-20 16:22:09 2 [Note] WSREP: recv_thread() joined. | 
| 2019-11-20 16:22:09 2 [Note] WSREP: Closing replication queue. | 
| 2019-11-20 16:22:09 2 [Note] WSREP: Closing slave action queue. | 
| 2019-11-20 16:22:09 2 [Note] WSREP: ####### processing CC -1, local, ordered | 
| 2019-11-20 16:22:09 2 [Note] WSREP: ####### My UUID: 0a54a855-0ba1-11ea-93a9-d724df0d647a | 
| 2019-11-20 16:22:09 2 [Note] WSREP: ####### ST not required | 
| 2019-11-20 16:22:09 2 [Note] WSREP: ================================================ | 
| View: | 
|   id: 09f8e15a-0ba1-11ea-bed3-56e4b108bea5:-1 | 
|   status: non-primary | 
|   protocol_version: 4 | 
|   capabilities: MULTI-MASTER, CERTIFICATION, PARALLEL_APPLYING, REPLAY, ISOLATION, PAUSE, CAUSAL_READ, INCREMENTAL_WS, UNORDERED, PREORDERED, STREAMING, NBO | 
|   final: no | 
|   own_index: 0 | 
|   members(1): | 
| 	0: 0a54a855-0ba1-11ea-93a9-d724df0d647a, cnt7glr11.localdomain | 
| ================================================= | 
| 2019-11-20 16:22:09 2 [Note] WSREP: Non-primary view | 
| 2019-11-20 16:22:09 2 [Note] [Debug] WSREP: server cnt7glr11.localdomain state change: synced -> connected | 
| 2019-11-20 16:22:09 2 [Note] WSREP: Server status change synced -> connected | 
| 2019-11-20 16:22:09 2 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification. | 
| 2019-11-20 16:22:09 2 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification. | 
| 2019-11-20 16:22:09 2 [Note] WSREP: ####### processing CC -1, local, ordered | 
| 2019-11-20 16:22:09 2 [Note] WSREP: ####### My UUID: 0a54a855-0ba1-11ea-93a9-d724df0d647a | 
| 2019-11-20 16:22:09 2 [Note] WSREP: ####### ST not required | 
| 2019-11-20 16:22:09 2 [Note] WSREP: ================================================ | 
| View: | 
|   id: 09f8e15a-0ba1-11ea-bed3-56e4b108bea5:-1 | 
|   status: non-primary | 
|   protocol_version: 4 | 
|   capabilities: MULTI-MASTER, CERTIFICATION, PARALLEL_APPLYING, REPLAY, ISOLATION, PAUSE, CAUSAL_READ, INCREMENTAL_WS, UNORDERED, PREORDERED, STREAMING, NBO | 
|   final: yes | 
|   own_index: -1 | 
|   members(0): | 
| ================================================= | 
| 2019-11-20 16:22:09 2 [Note] WSREP: Non-primary view | 
| 2019-11-20 16:22:09 2 [Note] [Debug] WSREP: server cnt7glr11.localdomain state change: connected -> disconnected | 
| 2019-11-20 16:22:09 2 [Note] WSREP: Server status change connected -> disconnected | 
| 2019-11-20 16:22:09 2 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification. | 
| 2019-11-20 16:22:09 2 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification. | 
| 2019-11-20 16:22:09 17 [Note] WSREP: wsrep_commit_empty(17) | 
| 2019-11-20 16:22:09 17 [Note] WSREP: wsrep_thd_binlog_reset | 
| 2019-11-20 16:22:09 17 [Note] WSREP: wsrep_thd_binlog_reset | 
| 2019-11-20 16:22:09 17 [Note] WSREP: dispatch_command leave | 
|     thd: 17 thd_ptr: 0x7f4eb0000af0 client_mode: local client_state: result trx_state: aborted | 
|     next_trx_id: 1385 trx_id: -1 seqno: -1 | 
|     is_streaming: 0 fragments: 0 | 
|     sql_errno: 1205 message: Lock wait timeout exceeded; try restarting transaction | 
|     command: 40 query: REPLACE INTO `table100_innodb_int` ( `col_char_12`, `col_char_12_key` )  | 
| 2019-11-20 16:22:09 0 [Note] WSREP: Service thread queue flushed. | 
| 2019-11-20 16:22:09 2 [Note] WSREP: ####### Assign initial position for certification: 00000000-0000-0000-0000-000000000000:-1, protocol version: 5 | 
| 2019-11-20 16:22:09 2 [Note] WSREP: Applier thread exiting ret: 0 thd: 2 | 
| 2019-11-20 16:22:09 2 [Note] WSREP: closing applier 2 | 
| 2019-11-20 16:22:09 2 [Note] WSREP: wsrep system thread: 2, 0x7f4f34000af0 closing | 
| 2019-11-20 16:22:09 2 [Warning] Aborted connection 2 to db: 'unconnected' user: 'unauthenticated' host: '' (This connection closed normally without authentication) | 
| 2019-11-20 16:22:09 2 [Note] WSREP: wsrep running threads now: 1 | 
| 2019-11-20 16:22:09 2 [Note] WSREP: ha_rollback_trans(2, TRUE) rolled back: (null): Can't find record in 'table100_innodb_int'; is_real 1
 | 
Slave node 2 :
:
2019-11-20 16:22:16 2 [ERROR] Slave SQL: Could not execute Write_rows_v1 event on table test.table10000_innodb_int; Duplicate entry '0' for key 'PRIMARY', Error_code: 1062; handler error HA_ERR_FOUND_DUPP_KEY; the event's master log FIRST, end_log_pos 8406, Internal MariaDB error code: 1062
2019-11-20 16:22:16 2 [ERROR] WSREP: Failed to apply write set: gtid: 09f8e15a-0ba1-11ea-bed3-56e4b108bea5:147 server_id: 09f7e5f7-0ba1-11ea-a944-d7f74653ae06 client_id: 17 trx_id: 1940 flags: 3 (start_transaction | commit)
2019-11-20 16:22:17 2 [Note] [Debug] WSREP: server cnt7glr11.localdomain state change: connected -> disconnected
2019-11-20 16:22:17 2 [Warning] Aborted connection 2 to db: 'unconnected' user: 'unauthenticated' host: '' (This connection closed normally without authentication)
| 2019-11-20 16:22:16 2 [Note] WSREP: assigned new next trx id: 986 | 
| 2019-11-20 16:22:16 2 [ERROR] Slave SQL: Could not execute Write_rows_v1 event on table test.table10000_innodb_int; Duplicate entry '0' for key 'PRIMARY', Error_code: 1062; handler error HA_ERR_FOUND_DUPP_KEY; the event's master log FIRST, end_log_pos 8406, Internal MariaDB error code: 1062 | 
| 2019-11-20 16:22:16 2 [Note] WSREP: ha_rollback_trans(2, FALSE) rolled back: INSERT IGNORE INTO `table10000_innodb_int` ( `col_int_key`, `col_char_12` ) VALUES ( 'b', 'w' ), ( 5, 'b' ) /* QNO 1682 CON_ID 17 */LÕ]: Duplicate entry '0' for key 'PRIMARY'; is_real 0 | 
| 2019-11-20 16:22:16 2 [Warning] WSREP: Event 64 Write_rows_v1 apply failed: 121, seqno 147 | 
| 2019-11-20 16:22:16 2 [Note] WSREP: ha_rollback_trans(2, TRUE) rolled back: INSERT IGNORE INTO `table10000_innodb_int` ( `col_int_key`, `col_char_12` ) VALUES ( 'b', 'w' ), ( 5, 'b' ) /* QNO 1682 CON_ID 17 */LÕ]: Duplicate entry '0' for key 'PRIMARY'; is_real 1 | 
| 2019-11-20 16:22:16 2 [Note] WSREP: wsrep_after_apply 2 | 
| 2019-11-20 16:22:16 2 [Note] WSREP: wsrep_thd_binlog_reset | 
| 2019-11-20 16:22:16 2 [Note] WSREP: Log dummy write set 147 | 
| 2019-11-20 16:22:16 2 [Note] WSREP: wsrep_thd_binlog_reset | 
| 2019-11-20 16:22:16 2 [ERROR] WSREP: Failed to apply write set: gtid: 09f8e15a-0ba1-11ea-bed3-56e4b108bea5:147 server_id: 09f7e5f7-0ba1-11ea-a944-d7f74653ae06 client_id: 17 trx_id: 1940 flags: 3 (start_transaction | commit) | 
| 2019-11-20 16:22:16 2 [Note] WSREP: Closing send monitor... | 
| 2019-11-20 16:22:16 2 [Note] WSREP: Closed send monitor. | 
| 2019-11-20 16:22:16 2 [Note] WSREP: gcomm: terminating thread | 
| 2019-11-20 16:22:16 2 [Note] WSREP: gcomm: joining thread | 
| 2019-11-20 16:22:16 2 [Note] WSREP: gcomm: closing backend | 
| 2019-11-20 16:22:17 2 [Note] WSREP: view(view_id(NON_PRIM,09f7e5f7,4) memb { | 
| 	0d528ca2,0 | 
| } joined { | 
| } left { | 
| } partitioned { | 
| 	09f7e5f7,0 | 
| }) | 
| 2019-11-20 16:22:17 2 [Note] WSREP: PC protocol downgrade 1 -> 0 | 
| 2019-11-20 16:22:17 2 [Note] WSREP: view((empty)) | 
| 2019-11-20 16:22:17 2 [Note] WSREP: gcomm: closed | 
| 2019-11-20 16:22:17 0 [Note] WSREP: New COMPONENT: primary = no, bootstrap = no, my_idx = 0, memb_num = 1 | 
| 2019-11-20 16:22:17 0 [Note] WSREP: Flow-control interval: [16, 16] | 
| 2019-11-20 16:22:17 0 [Note] WSREP: Received NON-PRIMARY. | 
| 2019-11-20 16:22:17 0 [Note] WSREP: Shifting SYNCED -> OPEN (TO: 147) | 
| 2019-11-20 16:22:17 0 [Note] WSREP: New SELF-LEAVE. | 
| 2019-11-20 16:22:17 0 [Note] WSREP: Flow-control interval: [0, 0] | 
| 2019-11-20 16:22:17 0 [Note] WSREP: Received SELF-LEAVE. Closing connection. | 
| 2019-11-20 16:22:17 0 [Note] WSREP: Shifting OPEN -> CLOSED (TO: -1) | 
| 2019-11-20 16:22:17 0 [Note] WSREP: RECV thread exiting 0: Success | 
| 2019-11-20 16:22:17 2 [Note] WSREP: recv_thread() joined. | 
| 2019-11-20 16:22:17 2 [Note] WSREP: Closing replication queue. | 
| 2019-11-20 16:22:17 2 [Note] WSREP: Closing slave action queue. | 
| 2019-11-20 16:22:17 2 [Note] WSREP: ####### processing CC -1, local, ordered | 
| 2019-11-20 16:22:17 2 [Note] WSREP: ####### My UUID: 0d528ca2-0ba1-11ea-a7e1-5a09a105dcfd | 
| 2019-11-20 16:22:17 2 [Note] WSREP: ####### ST not required | 
| 2019-11-20 16:22:17 2 [Note] WSREP: ================================================ | 
| View: | 
|   id: 09f8e15a-0ba1-11ea-bed3-56e4b108bea5:-1 | 
|   status: non-primary | 
|   protocol_version: 4 | 
|   capabilities: MULTI-MASTER, CERTIFICATION, PARALLEL_APPLYING, REPLAY, ISOLATION, PAUSE, CAUSAL_READ, INCREMENTAL_WS, UNORDERED, PREORDERED, STREAMING, NBO | 
|   final: no | 
|   own_index: 0 | 
|   members(1): | 
| 	0: 0d528ca2-0ba1-11ea-a7e1-5a09a105dcfd, cnt7glr11.localdomain | 
| ================================================= | 
| 2019-11-20 16:22:17 2 [Note] WSREP: Non-primary view | 
| 2019-11-20 16:22:17 2 [Note] [Debug] WSREP: server cnt7glr11.localdomain state change: synced -> connected | 
| 2019-11-20 16:22:17 2 [Note] WSREP: Server status change synced -> connected | 
| 2019-11-20 16:22:17 2 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification. | 
| 2019-11-20 16:22:17 2 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification. | 
| 2019-11-20 16:22:17 2 [Note] WSREP: ####### processing CC -1, local, ordered | 
| 2019-11-20 16:22:17 2 [Note] WSREP: ####### My UUID: 0d528ca2-0ba1-11ea-a7e1-5a09a105dcfd | 
| 2019-11-20 16:22:17 2 [Note] WSREP: ####### ST not required | 
| 2019-11-20 16:22:17 2 [Note] WSREP: ================================================ | 
| View: | 
|   id: 09f8e15a-0ba1-11ea-bed3-56e4b108bea5:-1 | 
|   status: non-primary | 
|   protocol_version: 4 | 
|   capabilities: MULTI-MASTER, CERTIFICATION, PARALLEL_APPLYING, REPLAY, ISOLATION, PAUSE, CAUSAL_READ, INCREMENTAL_WS, UNORDERED, PREORDERED, STREAMING, NBO | 
|   final: yes | 
|   own_index: -1 | 
|   members(0): | 
| ================================================= | 
| 2019-11-20 16:22:17 2 [Note] WSREP: Non-primary view | 
| 2019-11-20 16:22:17 2 [Note] [Debug] WSREP: server cnt7glr11.localdomain state change: connected -> disconnected | 
| 2019-11-20 16:22:17 2 [Note] WSREP: Server status change connected -> disconnected | 
| 2019-11-20 16:22:17 2 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification. | 
| 2019-11-20 16:22:17 2 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification. | 
| 2019-11-20 16:22:17 0 [Note] WSREP: Service thread queue flushed. | 
| 2019-11-20 16:22:17 2 [Note] WSREP: ####### Assign initial position for certification: 00000000-0000-0000-0000-000000000000:-1, protocol version: 5 | 
| 2019-11-20 16:22:17 2 [Note] WSREP: Applier thread exiting ret: 0 thd: 2 | 
| 2019-11-20 16:22:17 2 [Note] WSREP: closing applier 2 | 
| 2019-11-20 16:22:17 2 [Note] WSREP: wsrep system thread: 2, 0x7f36d4000af0 closing | 
| 2019-11-20 16:22:17 2 [Warning] Aborted connection 2 to db: 'unconnected' user: 'unauthenticated' host: '' (This connection closed normally without authentication) | 
| 2019-11-20 16:22:17 2 [Note] WSREP: wsrep running threads now: 1 | 
| 2019-11-20 16:22:17 2 [Note] WSREP: ha_rollback_trans(2, TRUE) rolled back: (null): Duplicate entry '0' for key 'PRIMARY'; is_real 1 | 
| 2019-11-20 16:32:02 13 [Note] WSREP: assigned new next trx id: 987 | 
| 2019-11-20 16:32:02 13 [Note] WSREP: assigned new next trx id: 987 | 
| 2019-11-20 16:32:02 0 [Note] /home/stepan/mariadb/10.4/git/sql/mysqld (initiated by: [stepan] @ localhost [127.0.0.1]): Normal shutdown | 
| 2019-11-20 16:32:02 0 [Note] WSREP: Shutdown replication
 | 
| # 2019-11-20T16:31:58 [14601] No differences were found between servers 0 and 1. | 
| # 2019-11-20T16:31:58 [14601] No differences were found between servers 1 and 2. | 
| [...] | 
| # 2019-11-20T16:32:04 [14601] [14601] ./runall-new.pl will exit with exit status STATUS_OK (0)
 | 
