[MDEV-19660] wsrep_rec_get_foreign_key() is dereferencing a stale pointer to a page that was previously latched - Jira

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Major
Resolution: Fixed
Affects Version/s: 5.5.33a-galera, 10.0.19-galera, 10.1.6, 10.2.0, 10.3.0, 10.4.0
Fix Version/s: 10.2.26, 10.1.41, 10.3.17, 10.4.7
Component/s: Galera, Storage Engine - InnoDB, Storage Engine - XtraDB
Labels:

Description

In row_ins_foreign_check_on_constraint(), clustered index record is being passed to wsrep_append_foreign_key() after releasing the latch. If a record has been changed by other thread in the meantime then it could lead to a crash when
wsrep_rec_get_foreign_key () tries to access the record.

The following is the problematic code :

        btr_pcur_store_position(pcur, mtr);

        if (index == clust_index) {

                btr_pcur_copy_stored_position(cascade->pcur, pcur);

        } else {

                btr_pcur_store_position(cascade->pcur, mtr);

        mtr_commit(mtr);

        ut_a(cascade->pcur->rel_pos == BTR_PCUR_ON);

        cascade->state = UPD_NODE_UPDATE_CLUSTERED;

#ifdef WITH_WSREP

        err = wsrep_append_foreign_key(

                                        thr_get_trx(thr),

                                        foreign,

                                        clust_rec,

                                        clust_index,

                                        FALSE,

                                        (node) ? TRUE : FALSE);

Attachments

Activity

Ascending order - Click to sort in descending order

Marko Mäkelä added a comment - 2019-05-31 11:39

As far as I can tell, this was introduced in 5.5.25-galera, 10.0.19-galera, 10.1.6.

Marko Mäkelä added a comment - 2019-05-31 11:39 As far as I can tell, this was introduced in 5.5.25-galera, 10.0.19-galera, 10.1.6 .

Jan Lindström (Inactive) added a comment - 2019-06-04 03:50

In row_ins_check_foreign_constraint same function is called inside a active mtr.

Jan Lindström (Inactive) added a comment - 2019-06-04 03:50 In row_ins_check_foreign_constraint same function is called inside a active mtr.

Jan Lindström (Inactive) added a comment - 2019-06-04 10:25

https://github.com/MariaDB/server/commit/42a1ad314700b705077333f42393250c978c92d7

Jan Lindström (Inactive) added a comment - 2019-06-04 10:25 https://github.com/MariaDB/server/commit/42a1ad314700b705077333f42393250c978c92d7

Marko Mäkelä added a comment - 2019-06-10 05:43

I think that we need a test case that exercises the error handling code. Note: I am not asking for a test that reproduces the race condition.

Also, please address my review comments regarding the code changes.

Marko Mäkelä added a comment - 2019-06-10 05:43 I think that we need a test case that exercises the error handling code. Note: I am not asking for a test that reproduces the race condition. Also, please address my review comments regarding the code changes.

Marko Mäkelä added a comment - 2019-06-10 13:00

I wonder if we could simply replace clust_rec with cascade->pcur->old_rec in the call.

Marko Mäkelä added a comment - 2019-06-10 13:00 I wonder if we could simply replace clust_rec with cascade->pcur->old_rec in the call.

Jan Lindström (Inactive) added a comment - 2019-06-18 09:29

https://github.com/MariaDB/server/commit/859052dfcc3da3d61e3023c7401ef009cea50bcd

Jan Lindström (Inactive) added a comment - 2019-06-18 09:29 https://github.com/MariaDB/server/commit/859052dfcc3da3d61e3023c7401ef009cea50bcd

Marko Mäkelä added a comment - 2019-06-25 06:19

Can we merely replace the clust_rec with cascade->pcur->old_rec and omit all other changes? I am concerned about adding so much code for error handling or reporting, especially when that code is not being backed by an ‘organic’ test case that does not resort to fault injection.

Marko Mäkelä added a comment - 2019-06-25 06:19 Can we merely replace the clust_rec with cascade->pcur->old_rec and omit all other changes? I am concerned about adding so much code for error handling or reporting, especially when that code is not being backed by an ‘organic’ test case that does not resort to fault injection.

Jan Lindström (Inactive) added a comment - 2019-07-01 06:06

Sure, I will do that for 10.1-10.4.

Jan Lindström (Inactive) added a comment - 2019-07-01 06:06 Sure, I will do that for 10.1-10.4.

People

Assignee:: Jan Lindström (Inactive)

Reporter:: Thirunarayanan Balathandayuthapani

Votes:: 1 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 2019-05-31 11:26

Updated:: 2024-07-07 22:47

Resolved:: 2019-07-09 12:17

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.

MariaDB Server