Uploaded image for project: 'MariaDB Server'
  1. MariaDB Server
  2. MDEV-15199

Triangular FKs - Cascade delete causes broken referential integrity

Details

    Description

      See SQL test case below for detail. See this thread for problem discovery:
      https://lists.launchpad.net/maria-developers/msg11102.html

      For a "triangular" table FK structure (see attached png visual) deleting parent record, should delete both children, but the ON DELETE CASCADE does not delete record in 2nd child table when 2nd child has ON DELETE SET NULL constraint to 1st child

      Referential integrity is broken after parent is deleted. LEFT JOINs do not show broken FK.

      MySQL 5.7.21 and MariaDB 10.1.30 do not exhibit this broken behaviour.

      -- Server version: 10.2.12-MariaDB FreeBSD Ports
       
      -- setup DB structure
       
      SET FOREIGN_KEY_CHECKS=0;
       
      CREATE TABLE member (
        id int(11) NOT NULL AUTO_INCREMENT,
        PRIMARY KEY (id)
      ) ENGINE=InnoDB;
       
      INSERT INTO member VALUES (1);
       
      CREATE TABLE address (
        id int(11) NOT NULL AUTO_INCREMENT,
        member_id int(11) NOT NULL,
        PRIMARY KEY (id),
        KEY address_FI_1 (member_id),
        CONSTRAINT address_FK_1 FOREIGN KEY (member_id) REFERENCES member (id) ON DELETE CASCADE ON UPDATE CASCADE
      ) ENGINE=InnoDB;
       
      INSERT INTO address VALUES (2,1);
       
      CREATE TABLE payment_method (
        id int(11) NOT NULL AUTO_INCREMENT,
        member_id int(11) NOT NULL,
        cardholder_address_id int(11) DEFAULT NULL,
        PRIMARY KEY (id),
        KEY payment_method_FI_1 (member_id),
        KEY payment_method_FI_2 (cardholder_address_id),
        CONSTRAINT payment_method_FK_1 FOREIGN KEY (member_id) REFERENCES member (id) ON DELETE CASCADE ON UPDATE CASCADE,
        CONSTRAINT payment_method_FK_2 FOREIGN KEY (cardholder_address_id) REFERENCES address (id) ON DELETE SET NULL ON UPDATE CASCADE
      ) ENGINE=InnoDB;
       
      INSERT INTO payment_method VALUES (3,1,2);
       
      SET FOREIGN_KEY_CHECKS=1;
       
      -- show inserted data
       
      SELECT * FROM member;
      +----+
      | id |
      +----+
      |  1 |
      +----+
       
      SELECT * FROM address;
      +----+-----------+
      | id | member_id |
      +----+-----------+
      |  2 |         1 |
      +----+-----------+
       
      SELECT * FROM payment_method;
      +----+-----------+-----------------------+
      | id | member_id | cardholder_address_id |
      +----+-----------+-----------------------+
      |  3 |         1 |                     2 |
      +----+-----------+-----------------------+
       
      -- finished setup now delete the central `member` record 
      DELETE FROM member WHERE id = 1;
       
      -- check the resulting state
       
      SELECT * FROM member;
      -- empty set, correct
       
      SELECT * FROM address;
      -- empty set, correct
       
      SELECT * FROM payment_method;
      +----+-----------+-----------------------+
      | id | member_id | cardholder_address_id |
      +----+-----------+-----------------------+
      |  3 |         1 |                  NULL |
      +----+-----------+-----------------------+
      -- should be an empty set
      -- referential integrity is broken: payment_method.member_id references a non-existent member.id
       
      -- inconsistency continues during subsequent left join queries
       
      SELECT
        payment_method.id, member_id
      FROM
        payment_method
      LEFT JOIN
        member ON member.id=payment_method.member_id
      WHERE
        payment_method.member_id IS NOT NULL AND member.id IS NULL;
       
      --  empty set, when it should not be
      -- note that if you dump and insert the broken DB, then above left join DOES show the broken FK, which is how I discovered it
      

      Attachments

        Issue Links

          Activity

            oschonrock, I believe that the MySQL 5.7.2 change may have introduced this corruption, but nobody noticed until now.

            For what it is worth, if you look at the commit, it is changing the file mysql-test/suite/innodb/r/innodb-index-online-fk.result. This result change looks acceptable to me, as there are multiple FOREIGN KEY constraints between the tables, and the order of evaluating SET NULL and CASCADE is affected based on the code change.

            marko Marko Mäkelä added a comment - oschonrock , I believe that the MySQL 5.7.2 change may have introduced this corruption, but nobody noticed until now. For what it is worth, if you look at the commit, it is changing the file mysql-test/suite/innodb/r/innodb-index-online-fk.result . This result change looks acceptable to me, as there are multiple FOREIGN KEY constraints between the tables, and the order of evaluating SET NULL and CASCADE is affected based on the code change.
            oschonrock Oliver Schonrock added a comment - - edited

            marko, you're right. Notwithstanding my limited understanding of that code, that does indeed look like it might have been the trigger for the cascade delete break.

            Wow, so that means potentially many users since GA of MySQL 5.7 and MariaDB 10.2, who are using such FK structures have these "failed cascade delete" orphan records lying around. Perhaps what we are doing is more unusual than I thought, or perhaps it is down to "not being noticed".

            As stated in the original report, finding the "orphaned" record is not obvious. By FK doesn't work:

            MariaDB [test]> select * from payment_method;
            +----+-----------+-----------------------+
            | id | member_id | cardholder_address_id |
            +----+-----------+-----------------------+
            |  3 |         1 |                  NULL |
            +----+-----------+-----------------------+
            1 row in set (0.00 sec)
             
            MariaDB [test]> select * from payment_method where member_id = 1; 
            Empty set (0.00 sec)
            

            and a LEFT JOIN won't find it either as shown in original report. As I explained, the only way I found it, is by mysqldump followed by inserting that dump and then running the LEFT JOIN.

            If this gets fixed by merging the upstream fix/revert to stack based recursion for FK cascade, would it make sense to alert MariaDB users (and MySQL?) that they may have these orphaned records silently lying around in their DB?

            BTW: I also double confirmed that MySQL 5.7.20 exhibits the broken behaviour (while 5.7.21 does not).

            Thanks for your help.

            oschonrock Oliver Schonrock added a comment - - edited marko , you're right. Notwithstanding my limited understanding of that code, that does indeed look like it might have been the trigger for the cascade delete break. Wow, so that means potentially many users since GA of MySQL 5.7 and MariaDB 10.2, who are using such FK structures have these "failed cascade delete" orphan records lying around. Perhaps what we are doing is more unusual than I thought, or perhaps it is down to "not being noticed". As stated in the original report, finding the "orphaned" record is not obvious. By FK doesn't work: MariaDB [test]> select * from payment_method; + ----+-----------+-----------------------+ | id | member_id | cardholder_address_id | + ----+-----------+-----------------------+ | 3 | 1 | NULL | + ----+-----------+-----------------------+ 1 row in set (0.00 sec)   MariaDB [test]> select * from payment_method where member_id = 1; Empty set (0.00 sec) and a LEFT JOIN won't find it either as shown in original report. As I explained, the only way I found it, is by mysqldump followed by inserting that dump and then running the LEFT JOIN . If this gets fixed by merging the upstream fix/revert to stack based recursion for FK cascade, would it make sense to alert MariaDB users (and MySQL?) that they may have these orphaned records silently lying around in their DB? BTW: I also double confirmed that MySQL 5.7.20 exhibits the broken behaviour (while 5.7.21 does not). Thanks for your help.

            marko For what it's worth, once the broken CASCADE DELETE has occurred, a server restart does not restore sanity.

            -- server restart performed here
             
            Server version: 10.2.12-MariaDB FreeBSD Ports
             
            MariaDB [test]> select * from payment_method where member_id = 1;
            Empty set (0.00 sec)
             
            MariaDB [test]> select * from payment_method;
            +----+-----------+-----------------------+
            | id | member_id | cardholder_address_id |
            +----+-----------+-----------------------+
            |  3 |         1 |                  NULL |
            +----+-----------+-----------------------+
            1 row in set (0.00 sec)
             
            MariaDB [test]> select * from member;
            Empty set (0.00 sec)
            

            oschonrock Oliver Schonrock added a comment - marko For what it's worth, once the broken CASCADE DELETE has occurred, a server restart does not restore sanity. -- server restart performed here   Server version: 10.2.12-MariaDB FreeBSD Ports   MariaDB [test]> select * from payment_method where member_id = 1; Empty set (0.00 sec)   MariaDB [test]> select * from payment_method; + ----+-----------+-----------------------+ | id | member_id | cardholder_address_id | + ----+-----------+-----------------------+ | 3 | 1 | NULL | + ----+-----------+-----------------------+ 1 row in set (0.00 sec)   MariaDB [test]> select * from member; Empty set (0.00 sec)

            marko I had a quick try to see what other methods might detect the problem, so users can see if they have been affected. And then what options exist to fix broken DBs without dump/reload.

            CHECK TABLE reports the problem:

            MariaDB [test]> check table payment_method;
            +---------------------+-------+----------+----------------------------------------------------------------------+
            | Table               | Op    | Msg_type | Msg_text                                                             |
            +---------------------+-------+----------+----------------------------------------------------------------------+
            | test.payment_method | check | Warning  | InnoDB: Index 'payment_method_FI_1' contains 0 entries, should be 1. |
            | test.payment_method | check | error    | Corrupt                                                              |
            +---------------------+-------+----------+----------------------------------------------------------------------+
            2 rows in set (0.00 sec)
            

            and obviously, so does mysqlcheck

            $ mysqlcheck -uroot -p test
            test.address                                       OK
            test.member                                        OK
            test.payment_method
            Warning  : InnoDB: Index payment_method_FI_1 is marked as corrupted
            error    : Corrupt
            

            For InnoDB, the only suggested option for repair is:

            If you need to rebuild an InnoDB table because a CHECK TABLE operation indicates that a table upgrade is required, use mysqldump to create a dump file and mysql to reload the file.

            What I did on our production DB (not wanting to take it down, dump & insert) is a I manually deleted the orphaned record by its PK (which is hard to know in a real dataset). Having done Some trial and error shows that this manual DELETE is easier if you do a "null alteration" first (which makes CHECK TABLE happy), then run the LEFT JOIN, which now works, and then you can easily clean up the orphans.

             
            MariaDB [test]> check table payment_method;
            +---------------------+-------+----------+----------------------------------------------------------------------+
            | Table               | Op    | Msg_type | Msg_text                                                             |
            +---------------------+-------+----------+----------------------------------------------------------------------+
            | test.payment_method | check | Warning  | InnoDB: Index 'payment_method_FI_1' contains 0 entries, should be 1. |
            | test.payment_method | check | error    | Corrupt                                                              |
            +---------------------+-------+----------+----------------------------------------------------------------------+
            2 rows in set (0.01 sec)
             
            -- "null alteration"
             
            MariaDB [test]> ALTER TABLE payment_method ENGINE = InnoDB;
            Query OK, 0 rows affected (0.09 sec)
            Records: 0  Duplicates: 0  Warnings: 0
             
            MariaDB [test]> check table payment_method;
            +---------------------+-------+----------+----------+
            | Table               | Op    | Msg_type | Msg_text |
            +---------------------+-------+----------+----------+
            | test.payment_method | check | status   | OK       |
            +---------------------+-------+----------+----------+
            1 row in set (0.00 sec)
             
            -- CHECK TABLE is happy, now we can use LEFT JOIN to find orphans
             
            SELECT
              payment_method.id, member_id
            FROM
              payment_method
            LEFT JOIN
              member ON member.id=payment_method.member_id
            WHERE
              payment_method.member_id IS NOT NULL AND member.id IS NULL;
            +----+-----------+
            | id | member_id |
            +----+-----------+
            |  3 |         1 |
            +----+-----------+
            1 row in set (0.00 sec)
             
            -- and delete them
             
            MariaDB [test]> delete from payment_method where id = 3;
            Query OK, 1 row affected (0.02 sec)
            

            oschonrock Oliver Schonrock added a comment - marko I had a quick try to see what other methods might detect the problem, so users can see if they have been affected. And then what options exist to fix broken DBs without dump/reload. CHECK TABLE reports the problem: MariaDB [test]> check table payment_method; + ---------------------+-------+----------+----------------------------------------------------------------------+ | Table | Op | Msg_type | Msg_text | + ---------------------+-------+----------+----------------------------------------------------------------------+ | test.payment_method | check | Warning | InnoDB: Index 'payment_method_FI_1' contains 0 entries, should be 1. | | test.payment_method | check | error | Corrupt | + ---------------------+-------+----------+----------------------------------------------------------------------+ 2 rows in set (0.00 sec) and obviously, so does mysqlcheck $ mysqlcheck -uroot -p test test .address OK test .member OK test .payment_method Warning : InnoDB: Index payment_method_FI_1 is marked as corrupted error : Corrupt For InnoDB, the only suggested option for repair is: If you need to rebuild an InnoDB table because a CHECK TABLE operation indicates that a table upgrade is required, use mysqldump to create a dump file and mysql to reload the file. What I did on our production DB (not wanting to take it down, dump & insert) is a I manually deleted the orphaned record by its PK (which is hard to know in a real dataset). Having done Some trial and error shows that this manual DELETE is easier if you do a "null alteration" first (which makes CHECK TABLE happy), then run the LEFT JOIN, which now works, and then you can easily clean up the orphans.   MariaDB [test]> check table payment_method; + ---------------------+-------+----------+----------------------------------------------------------------------+ | Table | Op | Msg_type | Msg_text | + ---------------------+-------+----------+----------------------------------------------------------------------+ | test.payment_method | check | Warning | InnoDB: Index 'payment_method_FI_1' contains 0 entries, should be 1. | | test.payment_method | check | error | Corrupt | + ---------------------+-------+----------+----------------------------------------------------------------------+ 2 rows in set (0.01 sec)   -- "null alteration"   MariaDB [test]> ALTER TABLE payment_method ENGINE = InnoDB; Query OK, 0 rows affected (0.09 sec) Records: 0 Duplicates: 0 Warnings: 0   MariaDB [test]> check table payment_method; + ---------------------+-------+----------+----------+ | Table | Op | Msg_type | Msg_text | + ---------------------+-------+----------+----------+ | test.payment_method | check | status | OK | + ---------------------+-------+----------+----------+ 1 row in set (0.00 sec)   -- CHECK TABLE is happy, now we can use LEFT JOIN to find orphans   SELECT payment_method.id, member_id FROM payment_method LEFT JOIN member ON member.id=payment_method.member_id WHERE payment_method.member_id IS NOT NULL AND member.id IS NULL ; + ----+-----------+ | id | member_id | + ----+-----------+ | 3 | 1 | + ----+-----------+ 1 row in set (0.00 sec)   -- and delete them   MariaDB [test]> delete from payment_method where id = 3; Query OK, 1 row affected (0.02 sec)

            marko Thank for fixing this so quickly. We have now rolled this out on production via 10.2.13. Works great!

            By the time we rolled it out we had dozens of broken FKs on our production DB which we cleaned up by technique I described above.

            I still find it astonishing that no-one since 10.2.4 (GA since > 2 years!) had come across this. I suspect there must be quite a few users out there with broken FKs without realising it (since the broken LEFT JOIN hides them).

            oschonrock Oliver Schonrock added a comment - marko Thank for fixing this so quickly. We have now rolled this out on production via 10.2.13. Works great! By the time we rolled it out we had dozens of broken FKs on our production DB which we cleaned up by technique I described above. I still find it astonishing that no-one since 10.2.4 (GA since > 2 years!) had come across this. I suspect there must be quite a few users out there with broken FKs without realising it (since the broken LEFT JOIN hides them).

            People

              marko Marko Mäkelä
              oschonrock Oliver Schonrock
              Votes:
              1 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.