[MDEV-23446] UPDATE does not insert history row if the row is not changed - Jira

Details

Type: Bug
Status: Closed (View Workflow)
Priority: Blocker
Resolution: Fixed
Affects Version/s: 10.3.18, 10.3(EOL)
Fix Version/s: 10.3.28, 10.4.18, 10.5.9
Component/s: Data Manipulation - Update, Versioned Tables
Labels:
None

Description

UPDATE queries that do not change any values in the rows they match on versioned tables do not insert historical rows.

Example:

CREATE TABLE test (a INT) WITH SYSTEM VERSIONING;

INSERT INTO test (a) VALUES (1);

UPDATE test SET a = 1 WHERE a = 1;

SELECT *, row_start, row_end FROM test FOR SYSTEM_TIME ALL;

Disclaimer: I don't know whether this is a bug or was purposefully designed like this. In the latter case, I want to make a case to reconsider it:

This behavior contrasts with triggers: UPDATE queries that do not change any values in the rows they match still carry out registered triggers for each row.
Now some UPDATEs cannot be logged historically. Versioned table fail to capture that they happened. Yet logging such queries and the timestamps at which they were issued can still be valuable information.

Attachments

Issue Links

causes

MDEV-24522 Assertion `inited==NONE' fails upon UPDATE on versioned table with unique blob

Closed

MDEV-25644 UPDATE not working properly on transaction precise system versioned table

Closed

MDEV-31944 UPDATE creates new row in system-versioned tables even if there is no value to change

Closed

MDEV-32124 System-Versioned Tables, extra rows with UPDATE

Closed

is blocked by

MDEV-23644 Assertion on evaluating foreign referential action for self-reference in system versioned table

Closed

is caused by

MDEV-17089 Updating a System Versioned Table always causes a row to be updated, regardless if the data is the same or not

Closed

relates to

MDEV-24451 Record retains new value on DML when history row insert fails

Closed

MDEV-16226 TRX_ID-based System Versioning refactoring

Stalled

MDEV-22540 ER_DUP_ENTRY upon REPLACE or Assertion `transactional_table || !changed || thd->transaction.stmt.modified_non_trans_table' failed

Closed

MDEV-23100 ODKU of non-versioning column inserts history row

Closed

MDEV-24064 Utility columns for system-versioned tables

Open

MDEV-26778 row_start is not updated in current row for InnoDB

Closed

(1 is caused by, 6 relates to)

Activity

Ascending order - Click to sort in descending order

Sergei Golubchik added a comment - 2020-09-02 08:57

Looks like a bug to me

Sergei Golubchik added a comment - 2020-09-02 08:57 Looks like a bug to me

Michael Erickson (Inactive) added a comment - 2020-09-10 17:01

We had a similar bug in Clustrix caused by a similar optimization, though it manifest as a MVCC problem rather than a system versioning problem. (Bug 31880 for those with access to the Clustrix bug database.) If it helps, here was the scenario from that bug:

(All transactions use repeatable-read isolation.) First, create a table with some data, and open a transaction:

> CREATE TABLE t (a INT, b INT);

Query OK, 0 rows affected (0.02 sec)

> INSERT INTO t VALUES (1, 10), (3, 30);

Query OK, 2 rows affected (0.03 sec)

> BEGIN;

Query OK, 0 rows affected (0.00 sec)

> SELECT * FROM t;

+------+------+

| a    | b    |

+------+------+

|    3 |   30 |

|    1 |   10 |

+------+------+

2 rows in set (0.00 sec)

Then, while keeping that transaction open, in another session update the table:

> BEGIN;

Query OK, 0 rows affected (0.00 sec)

> INSERT INTO t VALUES (5, 50);

Query OK, 1 row affected (0.01 sec)

> UPDATE t SET b = 100 WHERE a = 1;

Query OK, 1 row affected (0.01 sec)

> SELECT * FROM t;

+------+------+

| a    | b    |

+------+------+

|    3 |   30 |

|    5 |   50 |

|    1 |  100 |

+------+------+

3 rows in set (0.00 sec)

> COMMIT;

Query OK, 0 rows affected (0.00 sec)

(The insert is just to prove we're not crazy.) Back in the first transaction, we can't see those modifications:

> SELECT * FROM t;

+------+------+

| a    | b    |

+------+------+

|    3 |   30 |

|    1 |   10 |

+------+------+

2 rows in set (0.00 sec)

But we should be able to see the results of our own modifications. However, if we update to the future value of the row, we cannot see it!

> UPDATE t SET b = 100 WHERE a = 1;

Query OK, 1 row affected (7.09 sec)

> SELECT * FROM t;

+------+------+

| a    | b    |

+------+------+

|    3 |   30 |

|    1 |   10 |

+------+------+

2 rows in set (0.00 sec)

This seems to only happen if we use the future value of the row. We can see it if we update to a different value:

> UPDATE t SET b = 1000 WHERE a = 1;

Query OK, 1 row affected (0.00 sec)

> SELECT * FROM t;

+------+------+

|  a   | b    |

+------+------+

|    3 |   30 |

|    1 | 1000 |

+------+------+

2 rows in set (0.00 sec)

Perhaps this can serve as inspiration for additional testcases.

Michael Erickson (Inactive) added a comment - 2020-09-10 17:01 We had a similar bug in Clustrix caused by a similar optimization, though it manifest as a MVCC problem rather than a system versioning problem. (Bug 31880 for those with access to the Clustrix bug database.) If it helps, here was the scenario from that bug: (All transactions use repeatable-read isolation.) First, create a table with some data, and open a transaction: > CREATE TABLE t (a INT, b INT); Query OK, 0 rows affected (0.02 sec) > INSERT INTO t VALUES (1, 10), (3, 30); Query OK, 2 rows affected (0.03 sec) > BEGIN; Query OK, 0 rows affected (0.00 sec) > SELECT * FROM t; +------+------+ | a | b | +------+------+ | 3 | 30 | | 1 | 10 | +------+------+ 2 rows in set (0.00 sec) Then, while keeping that transaction open, in another session update the table: > BEGIN; Query OK, 0 rows affected (0.00 sec) > INSERT INTO t VALUES (5, 50); Query OK, 1 row affected (0.01 sec) > UPDATE t SET b = 100 WHERE a = 1; Query OK, 1 row affected (0.01 sec) > SELECT * FROM t; +------+------+ | a | b | +------+------+ | 3 | 30 | | 5 | 50 | | 1 | 100 | +------+------+ 3 rows in set (0.00 sec) > COMMIT; Query OK, 0 rows affected (0.00 sec) (The insert is just to prove we're not crazy.) Back in the first transaction, we can't see those modifications: > SELECT * FROM t; +------+------+ | a | b | +------+------+ | 3 | 30 | | 1 | 10 | +------+------+ 2 rows in set (0.00 sec) But we should be able to see the results of our own modifications. However, if we update to the future value of the row, we cannot see it! > UPDATE t SET b = 100 WHERE a = 1; Query OK, 1 row affected (7.09 sec) > SELECT * FROM t; +------+------+ | a | b | +------+------+ | 3 | 30 | | 1 | 10 | +------+------+ 2 rows in set (0.00 sec) This seems to only happen if we use the future value of the row. We can see it if we update to a different value: > UPDATE t SET b = 1000 WHERE a = 1; Query OK, 1 row affected (0.00 sec) > SELECT * FROM t; +------+------+ | a | b | +------+------+ | 3 | 30 | | 1 | 1000 | +------+------+ 2 rows in set (0.00 sec) Perhaps this can serve as inspiration for additional testcases.

Nikita Malyavin added a comment - 2020-10-25 17:18

Changes are required: IGNORE behavior should be handled for `vers_insert_history_row`; gotos to the label in the middle of the function body should be removed

Nikita Malyavin added a comment - 2020-10-25 17:18 Changes are required: IGNORE behavior should be handled for `vers_insert_history_row`; gotos to the label in the middle of the function body should be removed

Nikita Malyavin added a comment - 2020-10-26 10:41

After the discussion we decided not to argue too much on IGNORE behavior, and leave it out of this ticket's scope. So this will be left as is

Nikita Malyavin added a comment - 2020-10-26 10:41 After the discussion we decided not to argue too much on IGNORE behavior, and leave it out of this ticket's scope. So this will be left as is

Aleksey Midenkov added a comment - 2020-12-12 02:02 - edited

versioning.foreign fails still. Might be different bug of execution foreign ON constraint. Fixed by ~~MDEV-23644~~

Aleksey Midenkov added a comment - 2020-12-12 02:02 - edited versioning.foreign fails still. Might be different bug of execution foreign ON constraint. Fixed by MDEV-23644

Oleksandr Byelkin added a comment - 2021-01-06 10:21 - edited

Common people, first goto backward, then using "error" variable from other scope by place of goto.

Oleksandr Byelkin added a comment - 2021-01-06 10:21 - edited Common people, first goto backward, then using "error" variable from other scope by place of goto.

Oleksandr Byelkin added a comment - 2021-01-06 10:22

I will make urgent fix with the variable, plase fix it properly.

Oleksandr Byelkin added a comment - 2021-01-06 10:22 I will make urgent fix with the variable, plase fix it properly.

Marko Mäkelä added a comment - 2021-01-07 07:17

midenok, I think that we must absolutely cover this code path in multi_update::send_data() with a test case:

      if (has_vers_fields && table->versioned(VERS_TIMESTAMP))

        store_record(table, record[2]);

        if (vers_insert_history_row(table))

          restore_record(table, record[2]);

          goto error;

This is causing an uninitialized variable to be used in another scope:

      if (!can_compare_record || compare_record(table))

	int error;

…

error:

/*

              If (ignore && error == is ignorable) we don't have to

              do anything; otherwise...

*/

            myf flags= 0;

            if (table->file->is_fatal_error(error, HA_CHECK_ALL))

              flags|= ME_FATAL; /* Other handler errors are fatal */

            prepare_record_for_error_message(error, table);

            table->file->print_error(error,MYF(flags));

            DBUG_RETURN(1);

If I understood it correctly, the fix that sanja is considering would only silence the GCC -Wmaybe-uninitialized in optimized builds, but we would use error=0 (no error) in that error handling code path, which feels wrong to me. Hence, we should make sure that this code path is covered by a test, and an appropriate error will be reported.

Marko Mäkelä added a comment - 2021-01-07 07:17 midenok , I think that we must absolutely cover this code path in multi_update::send_data() with a test case: if (has_vers_fields && table->versioned(VERS_TIMESTAMP)) { store_record(table, record[2]); if (vers_insert_history_row(table)) { restore_record(table, record[2]); goto error; } This is causing an uninitialized variable to be used in another scope: if (!can_compare_record || compare_record(table)) { int error; … { error: /* If (ignore && error == is ignorable) we don't have to do anything; otherwise... */ myf flags= 0; if (table->file->is_fatal_error(error, HA_CHECK_ALL)) flags|= ME_FATAL; /* Other handler errors are fatal */ prepare_record_for_error_message(error, table); table->file->print_error(error,MYF(flags)); DBUG_RETURN(1); If I understood it correctly, the fix that sanja is considering would only silence the GCC -Wmaybe-uninitialized in optimized builds, but we would use error=0 (no error) in that error handling code path, which feels wrong to me. Hence, we should make sure that this code path is covered by a test, and an appropriate error will be reported.

Oleksandr Byelkin added a comment - 2021-01-07 08:31

188b328335d5c2a61d21f528fad19a685f9511ef

Oleksandr Byelkin added a comment - 2021-01-07 08:31 188b328335d5c2a61d21f528fad19a685f9511ef

Aleksey Midenkov added a comment - 2021-01-12 15:54

marko Agree and already started that in ~~MDEV-24451~~. Some errors are hard to trigger without DBUG_EXECUTE_IF() (like failure of insert history row). Do you have an idea how to fail row insert in release build?

Aleksey Midenkov added a comment - 2021-01-12 15:54 marko Agree and already started that in MDEV-24451 . Some errors are hard to trigger without DBUG_EXECUTE_IF() (like failure of insert history row). Do you have an idea how to fail row insert in release build?

Marko Mäkelä added a comment - 2021-01-13 15:58

midenok, an insert should be able to fail on lock wait timeout. Maybe you could issue a locking read (SELECT…LOCK IN SHARE MODE) from another connection?

Marko Mäkelä added a comment - 2021-01-13 15:58 midenok , an insert should be able to fail on lock wait timeout. Maybe you could issue a locking read ( SELECT…LOCK IN SHARE MODE ) from another connection?

People

Assignee:: Aleksey Midenkov

Reporter:: Remy Fox

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 2020-08-11 09:36

Updated:: 2023-09-08 22:29

Resolved:: 2021-01-12 15:56

Git Integration

Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.

MariaDB Server