[MCOL-3383] mysqld crashing on replication drop table - local query PMs Created: 2019-06-14 Updated: 2020-08-25 Resolved: 2019-06-14 |
|
| Status: | Closed |
| Project: | MariaDB ColumnStore |
| Component/s: | MariaDB Server |
| Affects Version/s: | 1.2.3 |
| Fix Version/s: | Icebox |
| Type: | Bug | Priority: | Major |
| Reporter: | David Hill (Inactive) | Assignee: | Unassigned |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Environment: |
2um 2pm with local query |
||
| Issue Links: |
|
||||||||
| Description |
|
Customer reporting the system was down and the mysqld on pm1 and pm2 were crashing. This might be the same issue as https://jira.mariadb.org/browse/MCOL-2212, which is tied to 1.2.4. But customer is asking is the actually crash fixed? They request that the mysqld server be fixed not to crash in the case of the failed drop table. Server version: 10.3.13-MariaDB-log Thread pointer: 0x7efcf40012a8 Trying to get some variables. |
| Comments |
| Comment by Andrew Hutchings (Inactive) [ 2019-06-14 ] |
|
Duplicate of |
| Comment by David Hill (Inactive) [ 2019-06-14 ] |
|
From customer: 1) it seems that the upgrade procedure erase the existing comments, we have tables with autoincrement in comments. So this won't really work and we still have to manually do this. That 'workaround' needs to work in all cases. 2) my comment is still valid, replication issues should not crash the mysqld and a mysqld crash should not bring down the columnstore cluster. which the patch for |
| Comment by Andrew Hutchings (Inactive) [ 2019-06-17 ] |
|
1) Autoincrement getting removed from the table comments is fine, that is only used at the CREATE TABLE phase to tell ColumnStore the initial information. Many alternative workarounds have been tried and they are more risky or complicated than this one. Also if they use the "ALTER TABLE ... CHANGE" workaround as in my first comment to this ticket the table comment would remain since this is altering a column comment (any column will work). 2) The workaround does address this because it stops the crashing. But if that isn't acceptable then MDEV-19120 needs to be fixed. That is not a bug in ColumnStore but a bug in MariaDB Server and affects all external engines. All we can do is provide the workaround until it gets fixed. |
| Comment by David Hill (Inactive) [ 2019-06-18 ] |
|
from customer: I understand that the engine part cannot do anything about the server side crashing, do you want me to open a different ticket for the mariadb server side then? Also, the point I am making is different from who is responsible for that crash or can we workaround it. 1) The mysqld server side, should not crash on replication issue, maybe a log an error and stop the slave instead of a segfault. 2) The columnstore stack went degraded while the mysqld was not on a um and not all um access point were down. The system should have been queryable fine. Then the system stays in fail state upon restart. This has to do with management of the stack and what is crucial to run or not, and robustness and recovery from failure, whether or not the crash has originated from columnstore. This is viewed as a mariadb unified product (mariadb platform). Let me known if i need to open a different ticket to escalate this on the server side. |