Uploaded image for project: 'MariaDB Server'
  1. MariaDB Server
  2. MDEV-27459

SST works as expected on joiner node but donor node never leaves donor state.

Details

    Description

      Hi Team,

      Galera donor node left in "donor/desynced" after SST with mariabackup. Here I have attached the logs. Please check it.

      After SST the donor node is left in:

      MariaDB [(none)]> show status like 'wsrep_local_state%';
      +---------------------------+--------------------------------------+
      | Variable_name | Value |
      +---------------------------+--------------------------------------+
      | wsrep_local_state_uuid | abacdfcc-5c70-11ea-b34c-c2bcad908195 |
      | wsrep_local_state | 2 |
      | wsrep_local_state_comment | Donor/Desynced |
      +---------------------------+--------------------------------------+
      3 rows in set (0.001 sec)
      

      Attachments

        1. db-prod02.donor.txt
          20 kB
        2. db-prod04.joiner.txt
          87 kB
        3. node1.err
          19 kB
        4. node2.err
          38 kB

        Issue Links

          Activity

            ponsuresh.pandians Pon Suresh Pandian (Inactive) created issue -
            ralfbecker Ralf Becker added a comment -

            Happens for me (5 node geo-distributed Galera cluster running 10.5.13) too in roughly 50% of my SSTs.
            Killing/restarting the stuck donor brings it back to nor with an IST.

            Ralf

            ralfbecker Ralf Becker added a comment - Happens for me (5 node geo-distributed Galera cluster running 10.5.13) too in roughly 50% of my SSTs. Killing/restarting the stuck donor brings it back to nor with an IST. Ralf

            Same happens here on 10.5.12.
            Also using mariabackup.

            Could this get some priority? As the only fix is restarting which is annoying.

            dupondje Jean-Louis Dupond added a comment - Same happens here on 10.5.12. Also using mariabackup. Could this get some priority? As the only fix is restarting which is annoying.
            julien.fritsch Julien Fritsch made changes -
            Field Original Value New Value
            Assignee Ramesh Sivaraman [ JIRAUSER48189 ]
            ramesh Ramesh Sivaraman made changes -
            Attachment node1.err [ 61668 ]
            ramesh Ramesh Sivaraman made changes -
            Attachment node2.err [ 61669 ]
            ramesh Ramesh Sivaraman made changes -
            Assignee Ramesh Sivaraman [ JIRAUSER48189 ] Jan Lindström [ jplindst ]
            ramesh Ramesh Sivaraman made changes -
            Assignee Jan Lindström [ jplindst ] Seppo Jaakola [ seppo ]
            ramesh Ramesh Sivaraman made changes -
            Priority Major [ 3 ] Critical [ 2 ]
            ramesh Ramesh Sivaraman made changes -
            ramesh Ramesh Sivaraman made changes -
            ramesh Ramesh Sivaraman made changes -
            Fix Version/s 10.4 [ 22408 ]
            ramesh Ramesh Sivaraman made changes -
            Status Open [ 1 ] Confirmed [ 10101 ]
            julien.fritsch Julien Fritsch made changes -
            Status Confirmed [ 10101 ] Open [ 1 ]
            julien.fritsch Julien Fritsch made changes -
            Status Open [ 1 ] Needs Feedback [ 10501 ]

            julien.fritsch Based on Seppo's tests this does not look like regression so not a blocker for next release.

            jplindst Jan Lindström (Inactive) added a comment - julien.fritsch Based on Seppo's tests this does not look like regression so not a blocker for next release.
            julien.fritsch Julien Fritsch made changes -
            Status Needs Feedback [ 10501 ] Open [ 1 ]
            julien.fritsch Julien Fritsch made changes -
            Status Open [ 1 ] Confirmed [ 10101 ]
            seppo Seppo Jaakola made changes -
            Status Confirmed [ 10101 ] In Progress [ 3 ]
            seppo Seppo Jaakola made changes -
            Status In Progress [ 3 ] Stalled [ 10000 ]
            seppo Seppo Jaakola added a comment -

            This was analyzed to be a problem in galera side. A fix has been prepared to both galera 3 and 4 versions, and I cannot reproduce the issue with this test scenario anymore.

            seppo Seppo Jaakola added a comment - This was analyzed to be a problem in galera side. A fix has been prepared to both galera 3 and 4 versions, and I cannot reproduce the issue with this test scenario anymore.
            seppo Seppo Jaakola added a comment -

            fixes are merged in galera-bugs 3.x, 4.x and 4.ee HEAD, please confirm if customer's use case is fixed by this

            seppo Seppo Jaakola added a comment - fixes are merged in galera-bugs 3.x, 4.x and 4.ee HEAD, please confirm if customer's use case is fixed by this
            seppo Seppo Jaakola made changes -
            Assignee Seppo Jaakola [ seppo ] Jan Lindström [ jplindst ]
            Status Stalled [ 10000 ] In Review [ 10002 ]
            jplindst Jan Lindström (Inactive) made changes -
            Status In Review [ 10002 ] In Testing [ 10301 ]

            Should be fixed with Galera library 26.4.11 with commit 67341d07

            jplindst Jan Lindström (Inactive) added a comment - Should be fixed with Galera library 26.4.11 with commit 67341d07
            jplindst Jan Lindström (Inactive) made changes -
            Assignee Jan Lindström [ jplindst ] Ramesh Sivaraman [ JIRAUSER48189 ]

            jplindst bug fix looks good. Donor wsrep state changes correctly in the given test case and in normal SST.

            ramesh Ramesh Sivaraman added a comment - jplindst bug fix looks good. Donor wsrep state changes correctly in the given test case and in normal SST.
            ramesh Ramesh Sivaraman made changes -
            Assignee Ramesh Sivaraman [ JIRAUSER48189 ] Jan Lindström [ jplindst ]
            ramesh Ramesh Sivaraman made changes -
            Status In Testing [ 10301 ] Stalled [ 10000 ]

            ponsuresh.pandians Can customer test with 26.4.11 Galera library ?

            jplindst Jan Lindström (Inactive) added a comment - ponsuresh.pandians Can customer test with 26.4.11 Galera library ?
            jplindst Jan Lindström (Inactive) made changes -
            Labels need_feedback
            julien.fritsch Julien Fritsch made changes -
            Status Stalled [ 10000 ] Open [ 1 ]
            julien.fritsch Julien Fritsch made changes -
            Status Open [ 1 ] Needs Feedback [ 10501 ]
            julien.fritsch Julien Fritsch made changes -
            Labels need_feedback

            Please provide download link for galera-26.4.11 so I can test this

            Newest version I can find on archive.mariadb.org is 26.4.6 as it is packaged even in the newer tar files.

            ➜  Downloads mdmd5 galera-26.4.9-systemd-x86_64/usr/lib/galera/libgalera_smm.so mariadb-10.4.21-linux-systemd-x86_64/lib/galera/libgalera_smm.so galera-26.4.6-systemd-x86_64/usr/lib/galera/libgalera_smm.so
            MD5 (galera-26.4.9-systemd-x86_64/usr/lib/galera/libgalera_smm.so) = a04f49ca276e50bb0a48e7decfeebcff
            MD5 (mariadb-10.4.21-linux-systemd-x86_64/lib/galera/libgalera_smm.so) = a04f49ca276e50bb0a48e7decfeebcff
            MD5 (galera-26.4.6-systemd-x86_64/usr/lib/galera/libgalera_smm.so) = a04f49ca276e50bb0a48e7decfeebcff
            

            Some of your build rutines may need a checkup.

            lmk@netic.dk Lars Mikkelsen added a comment - Please provide download link for galera-26.4.11 so I can test this Newest version I can find on archive.mariadb.org is 26.4.6 as it is packaged even in the newer tar files. ➜ Downloads mdmd5 galera-26.4.9-systemd-x86_64/usr/lib/galera/libgalera_smm.so mariadb-10.4.21-linux-systemd-x86_64/lib/galera/libgalera_smm.so galera-26.4.6-systemd-x86_64/usr/lib/galera/libgalera_smm.so MD5 (galera-26.4.9-systemd-x86_64/usr/lib/galera/libgalera_smm.so) = a04f49ca276e50bb0a48e7decfeebcff MD5 (mariadb-10.4.21-linux-systemd-x86_64/lib/galera/libgalera_smm.so) = a04f49ca276e50bb0a48e7decfeebcff MD5 (galera-26.4.6-systemd-x86_64/usr/lib/galera/libgalera_smm.so) = a04f49ca276e50bb0a48e7decfeebcff Some of your build rutines may need a checkup.

            And in status on a 10.4.21 server its also the reported version:

            MariaDB [(none)]> show status like 'wsrep_provider_ver%';
            +------------------------+------------------+
            | Variable_name          | Value            |
            +------------------------+------------------+
            | wsrep_provider_version | 26.4.6(r1d8d67c) |
            +------------------------+------------------+
            1 row in set (0.000 sec)
            

            lmk@netic.dk Lars Mikkelsen added a comment - And in status on a 10.4.21 server its also the reported version: MariaDB [(none)]> show status like 'wsrep_provider_ver%'; +------------------------+------------------+ | Variable_name | Value | +------------------------+------------------+ | wsrep_provider_version | 26.4.6(r1d8d67c) | +------------------------+------------------+ 1 row in set (0.000 sec)
            valerii Valerii Kravchuk made changes -
            Affects Version/s 10.5.12 [ 26025 ]
            julien.fritsch Julien Fritsch made changes -
            Status Needs Feedback [ 10501 ] Open [ 1 ]

            ponsuresh.pandians You can compile newest Galera library from sources if you need it now. If not we will release MariaDB Community Server soon with new Galera library. This link https://dlm.mariadb.com/browse/mariadb_server/76/1194/ seems to contain 26.4.9. I do not know why your link shows so old library.

            jplindst Jan Lindström (Inactive) added a comment - ponsuresh.pandians You can compile newest Galera library from sources if you need it now. If not we will release MariaDB Community Server soon with new Galera library. This link https://dlm.mariadb.com/browse/mariadb_server/76/1194/ seems to contain 26.4.9. I do not know why your link shows so old library.

            It actually dont. Package names are correct and says 26.4.9 but once you unpack you will see the version is only 26.4.6 and from october 2020.

            This is all ready reported to and confirmed by your support.

            lmk@netic.dk Lars Mikkelsen added a comment - It actually dont. Package names are correct and says 26.4.9 but once you unpack you will see the version is only 26.4.6 and from october 2020. This is all ready reported to and confirmed by your support.

            lmk@netic.dk Thank you for your report. I opened https://jira.mariadb.org/browse/TODO-3320 to fix this.

            jplindst Jan Lindström (Inactive) added a comment - lmk@netic.dk Thank you for your report. I opened https://jira.mariadb.org/browse/TODO-3320 to fix this.
            jplindst Jan Lindström (Inactive) added a comment - Please try following: https://archive.mariadb.org/mariadb-10.4.22/galera-26.4.9/

            Thats also an old version from october 2020

            ➜  galera wget https://archive.mariadb.org/mariadb-10.4.22/galera-26.4.9/bintar/galera-26.4.9-systemd-x86_64.tar.gz
            --2022-01-26 08:16:35--  https://archive.mariadb.org/mariadb-10.4.22/galera-26.4.9/bintar/galera-26.4.9-systemd-x86_64.tar.gz
            Resolving archive.mariadb.org (archive.mariadb.org)... 2a01:4f8:c17:cad6::1, 138.201.152.105
            Connecting to archive.mariadb.org (archive.mariadb.org)|2a01:4f8:c17:cad6::1|:443... connected.
            HTTP request sent, awaiting response... 200 OK
            Length: 20394498 (19M) [application/octet-stream]
            Saving to: ‘galera-26.4.9-systemd-x86_64.tar.gz’
             
            galera-26.4.9-systemd-x86_64.tar.gz                        100%[=======================================================================================================================================>]  19,45M  4,71MB/s    in 5,5s
             
            2022-01-26 08:16:41 (3,52 MB/s) - ‘galera-26.4.9-systemd-x86_64.tar.gz’ saved [20394498/20394498]
             
            ➜  galera tar zxvf galera-26.4.9-systemd-x86_64.tar.gz
            x galera-26.4.9-systemd-x86_64/
            x galera-26.4.9-systemd-x86_64/usr/
            x galera-26.4.9-systemd-x86_64/usr/lib/
            x galera-26.4.9-systemd-x86_64/usr/lib/galera/
            x galera-26.4.9-systemd-x86_64/usr/lib/galera/libgalera_smm.so
            x galera-26.4.9-systemd-x86_64/usr/lib/libgalera_smm.so
            ...   
            x galera-26.4.9-systemd-x86_64/etc/
            x galera-26.4.9-systemd-x86_64/etc/default/
            x galera-26.4.9-systemd-x86_64/etc/default/garb
            x galera-26.4.9-systemd-x86_64/etc/init.d/
            x galera-26.4.9-systemd-x86_64/etc/init.d/garb
            ➜  galera ls -l galera-26.4.9-systemd-x86_64/usr/lib/galera/libgalera_smm.so
            -rw-r--r--  1 lmk  staff  40221093 Oct 22  2020 galera-26.4.9-systemd-x86_64/usr/lib/galera/libgalera_smm.so
            

            lmk@netic.dk Lars Mikkelsen added a comment - Thats also an old version from october 2020 âžœ galera wget https://archive.mariadb.org/mariadb-10.4.22/galera-26.4.9/bintar/galera-26.4.9-systemd-x86_64.tar.gz --2022-01-26 08:16:35-- https://archive.mariadb.org/mariadb-10.4.22/galera-26.4.9/bintar/galera-26.4.9-systemd-x86_64.tar.gz Resolving archive.mariadb.org (archive.mariadb.org)... 2a01:4f8:c17:cad6::1, 138.201.152.105 Connecting to archive.mariadb.org (archive.mariadb.org)|2a01:4f8:c17:cad6::1|:443... connected. HTTP request sent, awaiting response... 200 OK Length: 20394498 (19M) [application/octet-stream] Saving to: ‘galera-26.4.9-systemd-x86_64.tar.gz’   galera-26.4.9-systemd-x86_64.tar.gz 100%[=======================================================================================================================================>] 19,45M 4,71MB/s in 5,5s   2022-01-26 08:16:41 (3,52 MB/s) - ‘galera-26.4.9-systemd-x86_64.tar.gz’ saved [20394498/20394498]   âžœ galera tar zxvf galera-26.4.9-systemd-x86_64.tar.gz x galera-26.4.9-systemd-x86_64/ x galera-26.4.9-systemd-x86_64/usr/ x galera-26.4.9-systemd-x86_64/usr/lib/ x galera-26.4.9-systemd-x86_64/usr/lib/galera/ x galera-26.4.9-systemd-x86_64/usr/lib/galera/libgalera_smm.so x galera-26.4.9-systemd-x86_64/usr/lib/libgalera_smm.so ... x galera-26.4.9-systemd-x86_64/etc/ x galera-26.4.9-systemd-x86_64/etc/default/ x galera-26.4.9-systemd-x86_64/etc/default/garb x galera-26.4.9-systemd-x86_64/etc/init.d/ x galera-26.4.9-systemd-x86_64/etc/init.d/garb âžœ galera ls -l galera-26.4.9-systemd-x86_64/usr/lib/galera/libgalera_smm.so -rw-r--r-- 1 lmk staff 40221093 Oct 22 2020 galera-26.4.9-systemd-x86_64/usr/lib/galera/libgalera_smm.so

            Debian seems to be fine but bintar not.

            jplindst Jan Lindström (Inactive) added a comment - Debian seems to be fine but bintar not.
            jplindst Jan Lindström (Inactive) made changes -
            Fix Version/s 10.4.23 [ 26807 ]
            Fix Version/s 10.5.14 [ 26809 ]
            Fix Version/s 10.6.6 [ 26811 ]
            Fix Version/s 10.7.2 [ 26813 ]
            Fix Version/s 10.4 [ 22408 ]
            Resolution Fixed [ 1 ]
            Status Open [ 1 ] Closed [ 6 ]

            Fixed on Galera library 26.4.11 commit 9561a159c

            jplindst Jan Lindström (Inactive) added a comment - Fixed on Galera library 26.4.11 commit 9561a159c
            pramod.mahto@mariadb.com Pramod Mahto made changes -
            mariadb-jira-automation Jira Automation (IT) made changes -
            Zendesk Related Tickets 108294 158663 191328

            People

              jplindst Jan Lindström (Inactive)
              ponsuresh.pandians Pon Suresh Pandian (Inactive)
              Votes:
              5 Vote for this issue
              Watchers:
              17 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.