Uploaded image for project: 'MariaDB Server'
  1. MariaDB Server
  2. MDEV-20505

Server crash on startup beacuse of bad wsrep configuration

    XMLWordPrintable

Details

    • Bug
    • Status: Closed (View Workflow)
    • Major
    • Resolution: Fixed
    • 10.4.6, 10.4.7
    • 10.4.9
    • Galera
    • None
    • Fedora

    Description

      Hello,
      some behaviour changed between 10.3 & 10.4 (and also between 10.4.5 & 10.4.6 if I recall correctly), which leads to server crash (SIGSEGV) on startup.

      In 10.3, the server will starts correctly, and it will print some lines to the systemd journal, that the wsrep is not properly configured, but the server runs fine.
      That's how the server always behave, and how I'd like it to have it.

      MariaDB 10.3.17, journal entries:

      Sep 05 09:28:42 host-0-0-0-0 mysql-prepare-db-dir[20455]: 2019-09-05  9:28:42 0 [ERROR] WSREP: rsync SST method requires wsrep_cluster_address to be configured on startup.
      Sep 05 09:28:45 host-0-0-0-0 mysqld[20555]: 2019-09-05  9:28:45 0 [ERROR] WSREP: rsync SST method requires wsrep_cluster_address to be configured on startup.
      


      I used the exact same configuration for both 10.3 and 10.4:
      (Default Fedora Configuration)

      # /usr/libexec/mysqld --print-defaults
      /usr/libexec/mysqld would have been started with the following arguments:
      --binlog_format=ROW --default-storage-engine=innodb --innodb_autoinc_lock_mode=2 --bind-address=0.0.0.0 --wsrep_on=1 --wsrep_provider=/usr/lib64/galera/libgalera_smm.so --wsrep_cluster_name=my_wsrep_cluster --wsrep_slave_threads=1 --wsrep_certify_nonPK=1 --wsrep_max_ws_rows=0 --wsrep_max_ws_size=2147483647 --wsrep_debug=0 --wsrep_convert_LOCK_to_trx=0 --wsrep_retry_autocommit=1 --wsrep_auto_increment_control=1 --wsrep_drupal_282555_workaround=0 --wsrep_causal_reads=0 --wsrep_notify_cmd= --wsrep_sst_method=rsync --wsrep_sst_auth=root: --datadir=/var/lib/mysql --socket=/var/lib/mysql/mysql.sock --log-error=/var/log/mariadb/mariadb.log --pid-file=/run/mariadb/mariadb.pid 
      


      The 10.4.7 however, will crash before it can even write a log.
      Systemd journal entries:

      2019-09-05  8:51:57 0 [ERROR] WSREP: rsync SST method requires wsrep_cluster_address to be configured on startup.
      2019-09-05  8:51:57 0 [ERROR] Aborting
      

      Info from the gdb:

      Program received signal SIGSEGV, Segmentation fault.
      0x0000555555bfd5b2 in wsrep::server_state::state (this=0x0) at /usr/src/debug/mariadb-10.4.7-1.debug.000.fc30.x86_64/wsrep-lib/include/wsrep/server_state.hpp:524
      handle_fatal_signal (sig=65536) at /usr/src/debug/mariadb-10.4.7-1.debug.000.fc30.x86_64/sql/signal_handler.cc:103
      

      Attached coredump (23MB extracted) generated by: "coredumpctl -1 dump --output /tmp/10.4.7.coredump" and compressed.

      The crash is 100% reproducible, so I can provide more info, if you specify what you want to know.

      EDIT:
      Probabbly good to add that when properly configured (either with "wsrep_on=0" or properly configured replication), it works fine.

      Attachments

        Activity

          People

            jplindst Jan Lindström (Inactive)
            mschorm Michal Schorm
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Git Integration

                Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.