Details
-
Bug
-
Status: Closed (View Workflow)
-
Critical
-
Resolution: Fixed
-
10.3.9, 10.4(EOL), 10.5, 10.6
Description
After the following commit:
commit 1748a31ae8d69e4939336f644f884e9de3039e7f
|
Author: Marko Mäkelä <marko.makela@mariadb.com>
|
Date: Tue Jul 3 15:10:06 2018 +0300
|
MDEV-16675 Unnecessary explicit lock acquisition during UPDATE or DELETE
|
|
lock_rec_convert_impl_to_expl() does not create explicit lock if the caller holds implicit lock.
Suppose we have the following call stack:
▾ lock_rec_convert_impl_to_expl
|
▾ lock_clust_rec_read_check_and_lock
|
▾ sel_set_rec_lock
|
▸ row_search_mvcc
|
If lock type is LOCK_GAP or LOCK_ORDINARY, and the transaction holds implicit lock for the record, then explicit gap-lock will not be set for the record, as lock_rec_convert_impl_to_expl() returns true and lock_rec_convert_impl_to_expl() bypasses lock_rec_lock() call.
The following test shows the issue:
--source include/have_innodb.inc
|
--source include/count_sessions.inc
|
CREATE TABLE t(a INT UNSIGNED PRIMARY KEY) ENGINE=InnoDB;
|
INSERT INTO t VALUES (10), (30);
|
--connect (con1,localhost,root,,)
|
SET TRANSACTION ISOLATION LEVEL SERIALIZABLE;
|
BEGIN;
|
INSERT INTO t VALUES (20);
|
SELECT * FROM t WHERE a BETWEEN 10 AND 30;
|
--connection default
|
SET session innodb_lock_wait_timeout=1;
|
--error ER_LOCK_WAIT_TIMEOUT
|
INSERT INTO t VALUES (15);
|
--disconnect con1
|
DROP TABLE t;
|
--source include/wait_until_count_sessions.inc
|
Attachments
Issue Links
- blocks
-
MDEV-20605 Awaken transaction can miss inserted by other transaction records due to wrong persistent cursor restoration
-
- Closed
-
- is caused by
-
MDEV-16675 Unnecessary explicit lock acquisition during UPDATE or DELETE
-
- Closed
-
The initial idea for the fix was to bypass lock_rec_convert_impl_to_expl() call if requested lock is not LOCK_REC_NOT_GAP in lock_clust_rec_read_check_and_lock() and lock_sec_rec_read_check_and_lock(). But after I implemented it there were some rpl tests failed.
Before
MDEV-16675when lock_rec_convert_impl_to_expl() was invoked from lock_clust_rec_read_check_and_lock()/lock_sec_rec_read_check_and_lock(), and gap lock was acquired, explicit non-gap lock was created from implicit one, and then, after lock_rec_convert_impl_to_expl() call, explicit gap lock was created with lock_rec_lock() call.So, before
MDEV-16675, when gap lock was acquired for the record with implicit lock of the same transaction, two explicit locks were created: 1) non-gap lock was converted from implicit lock, and 2) gap lock created by lock_rec_lock().It looks like implicit to explicit lock conversion is unnecessary for that case. But when I removed it, I got some rpl tests failed.
Particularly binlog_encryption.rpl_parallel_innodb_lock_conflict failed here:
# Create a group commit with INSERT and DELETE, in that order.
# The bug was that while the INSERT's insert intention lock does not block
# the DELETE, the DELETE's gap lock _does_ block the INSERT. This could cause
# a deadlock on the slave.
--connection con1
SET debug_sync='commit_after_release_LOCK_prepare_ordered SIGNAL master_queued1 WAIT_FOR master_cont1';
send INSERT INTO t4 VALUES (7, NULL);
--connection server_1
SET debug_sync='now WAIT_FOR master_queued1';
--connection con2
SET debug_sync='commit_after_release_LOCK_prepare_ordered SIGNAL master_queued2';
send DELETE FROM t4 WHERE b <= 3;
--connection server_1
SET debug_sync='now WAIT_FOR master_queued2';
SET debug_sync='now SIGNAL master_cont1';
--connection con1
REAP;
--connection con2
REAP;
SET debug_sync='RESET';
--save_master_pos
--connection server_2
--source include/start_slave.inc
--sync_with_master
--source include/stop_slave.inc
SELECT * FROM t4 ORDER BY a;
(7, NULL) row was not found on slave.
So, it looks like that explicit non-gap lock is necessary and used somehow, I don't understand how. When I left the same logic as it was before
MDEV-16675if gap lock is acquired, all tests passed.