[MDEV-21982] despite the use of new Streaming Replication , cluster crash on large data operations - Jira

Details

Type: Bug
Status: Open (View Workflow)
Priority: Major
Resolution: Unresolved
Affects Version/s: 10.4.12
Fix Version/s: 10.4(EOL)
Component/s: Galera
Labels:
- crash
- need_verification
Environment:
Galera cluster, 3 nodes on debian 9, provider version 26.4.3(r4535)

Description

Hello,
For several months we have a problem with large (max 250 000) data operation ( delete + load data ) randomly . the entire cluster crash ( no more queries works, the cluster is like "locked" ) --> bootstrap a node etc...
This occurs during the journey, others transactions still arrive from normal traffic on our websites, no problem with large date operations in the night .

We do these large operations for several yearq without problems, don't konw if the problem can be associated with wsrep_load_data_splitting is off by default since several maria versions ?
So, for 1 month we use the new "streaming replication " feature with galera 4
SET SESSION wsrep_trx_fragment_unit='rows'
SET SESSION wsrep_trx_fragment_size=10000 ;

We active it in our program that do the large date operation

The replication streaming seems to be used correctly because the table wsrep_streaming_log growing each days (the space , not in rows because rows are deleted automaticaly )

Despite of the use of streaming replication nothing has changed, today a table of 245 000 records is sucessfully passed and 3 min after the entire cluster has crashed.

How to solve this problem that occurs randomly ( sometimes 3 times on a week , or sometimes 1 time by month .

Thank you

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

screenshot-1.png
111 kB
2020-07-27 13:36
my.cnf
9 kB
2020-04-14 08:30

Activity

Ascending order - Click to sort in descending order

View 5 older comments

Jan Lindström (Inactive) added a comment - 2020-07-27 13:23

Hi, if you can share your infile and table definition you use on load data and cluster config files, it could help us to find the reason.

Jan Lindström (Inactive) added a comment - 2020-07-27 13:23 Hi, if you can share your infile and table definition you use on load data and cluster config files, it could help us to find the reason.

Levieux stéphane added a comment - 2020-07-27 13:39 - edited

Of course

Here the table definition :

here my config file :

http://dev.mysql.com/doc/mysql/en/server-system-variables.html

This will be passed to all mysql clients
It has been reported that passwords should be enclosed with ticks/quotes
escpecially if they contain "#" chars...
Remember to edit /etc/mysql/debian.cnf when changing the socket location.
[client]
port = 3306
socket = /var/run/mysqld/mysqld.sock

Here is entries for some specific programs
The following values assume you have at least 32M ram

This was formally known as [safe_mysqld]. Both versions are currently parsed.
[mysqld_safe]
socket = /var/run/mysqld/mysqld.sock
nice = 0

[mysqld]
#

* Basic Settings
#
sql_mode = 'ERROR_FOR_DIVISION_BY_ZERO,NO_AUTO_CREATE_USER,NO_ENGINE_SUBSTITUTION'

user = mysql
pid-file = /var/run/mysqld/mysqld.pid
socket = /var/run/mysqld/mysqld.sock
port = 3306
basedir = /usr
datadir = /var/lib/mysql
tmpdir = /tmp
lc_messages_dir = /usr/share/mysql
lc_messages = en_US
skip-external-locking
#

Instead of skip-networking the default is now to listen only on
localhost which is more compatible and is not less secure.
bind-address = 127.0.0.1
#
* Fine Tuning
#
thread_concurrency = 8
thread_cache_size = 128

performance_schema=on
skip-name-resolve
max_connections = 450
connect_timeout = 5
wait_timeout = 180
max_allowed_packet = 256M
thread_cache_size = 128
sort_buffer_size = 4M
bulk_insert_buffer_size = 16M
tmp_table_size = 32M
max_heap_table_size = 32M
key_buffer_size = 64K
#

* MyISAM
#
This replaces the startup script and checks MyISAM tables if needed
the first time they are touched. On error, make copy and try a repair.
myisam_recover_options = BACKUP
open-files-limit = 2000
table_open_cache = 4096
concurrent_insert = 2
#
* Query Cache Configuration
#
Cache only tiny result sets, so we can fit more in the query cache.
query_cache_limit = 128K
query_cache_size = 0
query_cache_type = 0
for more write intensive setups, set to DEMAND or OFF
#
* Logging and Replication
#
Both location gets rotated by the cronjob.
Be aware that this log type is a performance killer.
As of 5.1 you can enable the log at runtime!
general_log_file = /var/log/mysql/mysql.log
general_log = 1
#
Error logging goes to syslog due to /etc/mysql/conf.d/mysqld_safe_syslog.cnf.
#
we do want to know about network errors and such
log_warnings = 2
#
Enable the slow query log to see queries with especially long duration
#slow_query_log[= {0|1}
]
slow_query_log_file = /var/log/mysql/mariadb-slow.log
long_query_time = 2
log_slow_rate_limit = 1000
log_slow_verbosity = query_plan
log_slow_admin_statements=ON
log_queries_not_using_indexes=OFF
log_throttle_queries_not_using_indexes=1
default_storage_engine = InnoDB
you can't just change log file size, requires special procedure
innodb_log_file_size = 50M
innodb_buffer_pool_size = 10G
innodb_file_per_table = 1
#innodb_io_capacity = 400
innodb_flush_method = O_DIRECT

InnoDB
default-storage-engine = 'InnoDB'
innodb-stats-on-metadata = 0
innodb-stats-sample-pages = 32

table-definition-cache = 2000
table-open-cache = 2048

transaction-isolation = READ-COMMITTED
# To be set to 0 if not multi transactional storage engine
innodb-buffer-pool-instances = 8
innodb_data_file_path = ibdata1:2000M;ibdata2:10M:autoextend
#
# – Set the following to maximum 60/70% of physical RAM.
# innodb_max_dirty_pages_pct should be compute with redo log size in mind: It’s recommended to set the dirty pages smaller than redo log space. The more you have dirty page, the less io will be produced on disk.
innodb-max-dirty-pages-pct = 50
innodb-file-per-table = 1
#Save and restore buffer pool to be transparent for user
# innodb_flush_method = O_DIRECT
innodb_log_file_size = 1G
innodb_flush_log_at_trx_commit = 2
innodb_lock_wait_timeout = 50
innodb-log-buffer-size = 128M
innodb-log-file-size = 1024M
innodb_purge_threads = 1
– Raid1: 200. Raid10: 200 x write arrays. SSD: 5000. FusionIO: 20000.
innodb_io_capacity = 200
– SSD & FusionIO can flush pages in random disk order, keep default for spinning disks
innodb_flush_neighbors = 0
– Increase these numbers to increase global throughput on SAN (16,32,64)
innodb-read-io-threads = 8
innodb-write-io-threads = 8
– Set this to 2-4 times # of Cpus, maximum recommended is 64.
– iostat report io service time: io_capacity / service_time is a good starting point for innodb_write_io_threads
– innodb_read_io_threads starting point monitoring status # read ahead per second
innodb-thread-concurrency = 8
#
innodb-open-files = 2048
#

#

* Security Features
#
Read the manual, too, if you want chroot!
chroot = /var/lib/mysql/
#
For generating SSL certificates I recommend the OpenSSL GUI "tinyca".
#
ssl-ca = /etc/mysql/cacert.pem
ssl-cert = /etc/mysql/server-cert.pem
ssl-key = /etc/mysql/server-key.pem

#

* Galera-related settings
#

log_error=/var/log/mysql/mariadb-error.log

[galera]
wsrep_gtid_mode = on
wsrep_gtid_domain_id = 0
log_slave_updates = on
log-bin = /var/log/mysql/binlog
log-bin-index = /var/log/mysql/binlog.index
gtid_domain_id = 1
binlog_format=ROW
expire-logs-days = 3
max-binlog-size = 1024M
sync-binlog = 0
binlog-stmt-cache-size = 128K
binlog-cache-size = 256K
#

default-storage-engine=innodb
innodb_autoinc_lock_mode=2
wsrep_on=ON
wsrep_cluster_name='mariadb_cluster'
wsrep_node_name=node4
wsrep_node_address=10.253.253.93
wsrep_provider = /usr/lib/galera/libgalera_smm.so
wsrep_provider_options = "gcache.size = 5G; gcache.name = /var/lib/mysql/galera.cache;"

# wsrep_sst_method = rsync
wsrep_sst_method = mariabackup
wsrep_sst_auth = root:extra27frz
wsrep_slave_threads=32
wsrep_certify_nonPK=1
wsrep_max_ws_rows=0
wsrep_max_ws_size=1073741824
wsrep_debug=0
wsrep_log_conflicts = 1
wsrep_retry_autocommit = 1
wsrep_cluster_address=gcomm://10.253.253.93,10.253.253.94,10.253.253.95
wsrep_cluster_address = gcomm://
wsrep_forced_binlog_format=ROW
enforce_storage_engine = "InnoDB"
#
server-id = 01

[mysqldump]
quick
quote-names
max_allowed_packet = 16M
[mysql]
#no-auto-rehash # faster start of mysql but no tab completion

[isamchk]
key_buffer = 16M

#

* IMPORTANT: Additional settings that can override those from this file!
The files must end with '.cnf', otherwise they'll be ignored.
#
!include /etc/mysql/mariadb.cnf
!includedir /etc/mysql/conf.d/
!include /etc/mysql/secrets-backup.cnf

[xtrabackup]
databases-exclude=lost+found
ssl=0

Levieux stéphane added a comment - 2020-07-27 13:39 - edited Of course Here the table definition : here my config file : http://dev.mysql.com/doc/mysql/en/server-system-variables.html This will be passed to all mysql clients It has been reported that passwords should be enclosed with ticks/quotes escpecially if they contain "#" chars... Remember to edit /etc/mysql/debian.cnf when changing the socket location. [client] port = 3306 socket = /var/run/mysqld/mysqld.sock Here is entries for some specific programs The following values assume you have at least 32M ram This was formally known as [safe_mysqld] . Both versions are currently parsed. [mysqld_safe] socket = /var/run/mysqld/mysqld.sock nice = 0 [mysqld] # * Basic Settings # sql_mode = 'ERROR_FOR_DIVISION_BY_ZERO,NO_AUTO_CREATE_USER,NO_ENGINE_SUBSTITUTION' user = mysql pid-file = /var/run/mysqld/mysqld.pid socket = /var/run/mysqld/mysqld.sock port = 3306 basedir = /usr datadir = /var/lib/mysql tmpdir = /tmp lc_messages_dir = /usr/share/mysql lc_messages = en_US skip-external-locking # Instead of skip-networking the default is now to listen only on localhost which is more compatible and is not less secure. bind-address = 127.0.0.1 # * Fine Tuning # thread_concurrency = 8 thread_cache_size = 128 performance_schema=on skip-name-resolve max_connections = 450 connect_timeout = 5 wait_timeout = 180 max_allowed_packet = 256M thread_cache_size = 128 sort_buffer_size = 4M bulk_insert_buffer_size = 16M tmp_table_size = 32M max_heap_table_size = 32M key_buffer_size = 64K # * MyISAM # This replaces the startup script and checks MyISAM tables if needed the first time they are touched. On error, make copy and try a repair. myisam_recover_options = BACKUP open-files-limit = 2000 table_open_cache = 4096 concurrent_insert = 2 # * Query Cache Configuration # Cache only tiny result sets, so we can fit more in the query cache. query_cache_limit = 128K query_cache_size = 0 query_cache_type = 0 for more write intensive setups, set to DEMAND or OFF # * Logging and Replication # Both location gets rotated by the cronjob. Be aware that this log type is a performance killer. As of 5.1 you can enable the log at runtime! general_log_file = /var/log/mysql/mysql.log general_log = 1 # Error logging goes to syslog due to /etc/mysql/conf.d/mysqld_safe_syslog.cnf. # we do want to know about network errors and such log_warnings = 2 # Enable the slow query log to see queries with especially long duration #slow_query_log[= {0|1} ] slow_query_log_file = /var/log/mysql/mariadb-slow.log long_query_time = 2 log_slow_rate_limit = 1000 log_slow_verbosity = query_plan log_slow_admin_statements=ON log_queries_not_using_indexes=OFF log_throttle_queries_not_using_indexes=1 default_storage_engine = InnoDB you can't just change log file size, requires special procedure innodb_log_file_size = 50M innodb_buffer_pool_size = 10G innodb_file_per_table = 1 #innodb_io_capacity = 400 innodb_flush_method = O_DIRECT InnoDB default-storage-engine = 'InnoDB' innodb-stats-on-metadata = 0 innodb-stats-sample-pages = 32 table-definition-cache = 2000 table-open-cache = 2048 transaction-isolation = READ-COMMITTED # To be set to 0 if not multi transactional storage engine innodb-buffer-pool-instances = 8 innodb_data_file_path = ibdata1:2000M;ibdata2:10M:autoextend # # – Set the following to maximum 60/70% of physical RAM. # innodb_max_dirty_pages_pct should be compute with redo log size in mind: It’s recommended to set the dirty pages smaller than redo log space. The more you have dirty page, the less io will be produced on disk. innodb-max-dirty-pages-pct = 50 innodb-file-per-table = 1 #Save and restore buffer pool to be transparent for user # innodb_flush_method = O_DIRECT innodb_log_file_size = 1G innodb_flush_log_at_trx_commit = 2 innodb_lock_wait_timeout = 50 innodb-log-buffer-size = 128M innodb-log-file-size = 1024M innodb_purge_threads = 1 – Raid1: 200. Raid10: 200 x write arrays. SSD: 5000. FusionIO: 20000. innodb_io_capacity = 200 – SSD & FusionIO can flush pages in random disk order, keep default for spinning disks innodb_flush_neighbors = 0 – Increase these numbers to increase global throughput on SAN (16,32,64) innodb-read-io-threads = 8 innodb-write-io-threads = 8 – Set this to 2-4 times # of Cpus, maximum recommended is 64. – iostat report io service time: io_capacity / service_time is a good starting point for innodb_write_io_threads – innodb_read_io_threads starting point monitoring status # read ahead per second innodb-thread-concurrency = 8 # innodb-open-files = 2048 # # * Security Features # Read the manual, too, if you want chroot! chroot = /var/lib/mysql/ # For generating SSL certificates I recommend the OpenSSL GUI "tinyca". # ssl-ca = /etc/mysql/cacert.pem ssl-cert = /etc/mysql/server-cert.pem ssl-key = /etc/mysql/server-key.pem # * Galera-related settings # log_error=/var/log/mysql/mariadb-error.log [galera] wsrep_gtid_mode = on wsrep_gtid_domain_id = 0 log_slave_updates = on log-bin = /var/log/mysql/binlog log-bin-index = /var/log/mysql/binlog.index gtid_domain_id = 1 binlog_format=ROW expire-logs-days = 3 max-binlog-size = 1024M sync-binlog = 0 binlog-stmt-cache-size = 128K binlog-cache-size = 256K # default-storage-engine=innodb innodb_autoinc_lock_mode=2 wsrep_on=ON wsrep_cluster_name='mariadb_cluster' wsrep_node_name=node4 wsrep_node_address=10.253.253.93 wsrep_provider = /usr/lib/galera/libgalera_smm.so wsrep_provider_options = "gcache.size = 5G; gcache.name = /var/lib/mysql/galera.cache;" # wsrep_sst_method = rsync wsrep_sst_method = mariabackup wsrep_sst_auth = root:extra27frz wsrep_slave_threads=32 wsrep_certify_nonPK=1 wsrep_max_ws_rows=0 wsrep_max_ws_size=1073741824 wsrep_debug=0 wsrep_log_conflicts = 1 wsrep_retry_autocommit = 1 wsrep_cluster_address=gcomm://10.253.253.93,10.253.253.94,10.253.253.95 wsrep_cluster_address = gcomm:// wsrep_forced_binlog_format=ROW enforce_storage_engine = "InnoDB" # server-id = 01 [mysqldump] quick quote-names max_allowed_packet = 16M [mysql] #no-auto-rehash # faster start of mysql but no tab completion [isamchk] key_buffer = 16M # * IMPORTANT: Additional settings that can override those from this file! The files must end with '.cnf', otherwise they'll be ignored. # !include /etc/mysql/mariadb.cnf !includedir /etc/mysql/conf.d/ !include /etc/mysql/secrets-backup.cnf [xtrabackup] databases-exclude=lost+found ssl=0

Levieux stéphane added a comment - 2020-07-28 13:38

This morning i done an interesting test :
i run the transfer on the duplicate table (has 70 000 records from yesterday ) , no problem
30 minutes later, i run the same transfert on the production table (3M records) , the cluster has frozen ... ad need to bootstrap

so it seems ( need to be confirmed ) that it's not here the data volume imported the problem but the existings data and make an import on this table

Ps : we have another import on a 1M table without problem .

Levieux stéphane added a comment - 2020-07-28 13:38 This morning i done an interesting test : i run the transfer on the duplicate table (has 70 000 records from yesterday ) , no problem 30 minutes later, i run the same transfert on the production table (3M records) , the cluster has frozen ... ad need to bootstrap so it seems ( need to be confirmed ) that it's not here the data volume imported the problem but the existings data and make an import on this table Ps : we have another import on a 1M table without problem .

Levieux stéphane added a comment - 2020-08-31 06:51

News : Just after my post, i partitionned the table and since no more crash of our cluster mariaDB . So the problem was more the volume of existing data than the volume of data transfered ( the combinaison of both surely )

Levieux stéphane added a comment - 2020-08-31 06:51 News : Just after my post, i partitionned the table and since no more crash of our cluster mariaDB . So the problem was more the volume of existing data than the volume of data transfered ( the combinaison of both surely )

Ramesh Sivaraman added a comment - 2024-03-05 06:41 - edited

Could not reproduce the issue locally with the latest 10.4. Tested with LOAD DATA INFILE.. and sysbench data load

Ramesh Sivaraman added a comment - 2024-03-05 06:41 - edited Could not reproduce the issue locally with the latest 10.4. Tested with LOAD DATA INFILE.. and sysbench data load

MariaDB Server

despite the use of new Streaming Replication , cluster crash on large data operations

Details

Description

Attachments

Attachments

Activity

People

Dates

Git Integration