[MDEV-15261] WSREP: Failed to report last committed Created: 2018-02-09 Updated: 2019-12-12 Resolved: 2019-12-12 |
|
| Status: | Closed |
| Project: | MariaDB Server |
| Component/s: | Galera |
| Affects Version/s: | 10.1.18, 10.2.14 |
| Fix Version/s: | N/A |
| Type: | Bug | Priority: | Major |
| Reporter: | Rodney Antonio Ramos | Assignee: | Jan Lindström (Inactive) |
| Resolution: | Not a Bug | Votes: | 1 |
| Labels: | None | ||
| Environment: |
3 nodes running mysql Ver 15.1 Distrib 10.1.18-MariaDB, for Linux (x86_64) using readline 5.1.
1 node is the master and the others are slave.
[mysqld] wsrep_sst_auth=root:xxxxxxx innodb_autoextend_increment = 256 optimizer_switch = 'index_condition_pushdown=off' large-pages |
||
| Issue Links: |
|
||||||||
| Description |
|
Every node in the cluster is logging warnings similar to this: [Warning] WSREP: Failed to report last committed 211651504, -4 (Interrupted system call) At this point, one node starts the flow-control mechanism and the my application stops. I can see, running the myq_status too, that the "Queue down" on one node achieve the fc_limit and starts the flow control. I must stop the MariaDB on this node to application becomes up again. I have one master node e two slave nodes. I could see this behavior on the slave nodes only. I couln´t find why one node is starting the flow control. On the /var/log/messages I can see only the "WSREP: Failed to report last committed" messages. Nothing more. Any suggestion? |
| Comments |
| Comment by Zdravelina Sokolovska (Inactive) [ 2018-02-16 ] |
|
Hello rodneyra , are you receiving the same Warning messages with v10.1.30 ? |
| Comment by Rodney Antonio Ramos [ 2018-02-16 ] |
|
Hello winstone! I´m using the v10.1.18. The my.cnf is in the "Environment" header. I´m trying to upgrade to v10.2.13 in my test environment first, but I´m having some difficulties. Do you think that it should be a good idea to upgrade to 10.1.31 first? My production environment is very big, almost 750 GB of data, and I must be very careful to make any change. Thanks! |
| Comment by Rodney Antonio Ramos [ 2018-02-20 ] |
|
Hello winstone! I´m planning to upgrade my MariaDB to 10.1.31 release on March 3rd. Do you think it a good idea or should I upgrade to 10.2.13 release already? Thnaks! |
| Comment by Zdravelina Sokolovska (Inactive) [ 2018-02-20 ] |
|
hello rodneyra , it'd better to wait with upgrade the next version due to a current problem on both 10.1.31 and 10.2.13 |
| Comment by Rodney Antonio Ramos [ 2018-02-21 ] |
|
Sorry, winstone. I didn´t understand. Should I upgrade to 10.1.30 release? Aren´t the releases 10.1.31 and 10.2.13 stable? Thanks. |
| Comment by Rodney Antonio Ramos [ 2018-06-18 ] |
|
I´ve upgrade do 10.1.31 and problem is the same. The wsrep_local_recv_queue starts increase in one node and I don´t know why. There is no error or warning log messages anymore. At the moment, my wsrep_local_recv_queue is with more than 4 millions and galera cluster do not apply the write-sets. Can someone help me? There is nothing on the log, even with debug enabled. |
| Comment by Zdravelina Sokolovska (Inactive) [ 2018-09-17 ] |
|
wsrep debug enabled does not give any useful information either. |
| Comment by brianr [ 2018-10-04 ] |
|
I'm on MariaDB 10.2.18 and experience the exact same thing after upgrade from 10.2.9. I see apx 100 daily Warning of either (-110) or (-77) , predominantly the former, 110. It's cribbling the cluster performance, each node with 4 CPU's are fighting to serve the mysqld --wsrep_start_position=.... processes (and the abundance of threads it creates. (one process with 146 threads) ) And galeracluster.com support pages offer just about 0 options to investigates, so I hope this apporach kicks me in a better direction Best regards |
| Comment by Geoff Montee (Inactive) [ 2019-06-12 ] |
|
The "Failed to report last committed" messages don't necessarily mean that something is wrong. See MDEV-17550. If your cluster is under a lot of load and you are experiencing performance problems, then you may be seeing the effects of flow control. See the following to find out how to configure flow control parameters: http://galeracluster.com/documentation-webpages/documentation/managing-fc.html |