[MDEV-8686] A user defined collation utf8_confusables doesn't work Created: 2015-08-27  Updated: 2016-06-23  Resolved: 2016-06-23

Status: Closed
Project: MariaDB Server
Component/s: Character Sets
Affects Version/s: 10.0, 10.1, 10.2
Fix Version/s: 10.1.15

Type: Bug Priority: Major
Reporter: Alexander Barkov Assignee: Alexander Barkov
Resolution: Fixed Votes: 0
Labels: None

Attachments: XML File Index.xml    

 Description   

The collation defined in the attached Index.xml does not work.
It crashes in DEBUG build and writes "Expansion too long" error to server error log in release build.

I seems the problem happens because MariaDB collation customization currently has some limitation in shifting characters which have implicit weights (i.e. those not listed on DUCET).



 Comments   
Comment by Alexander Barkov [ 2016-06-20 ]

How to repeat:

  • Put the attatched Index.xml into share/charsets
  • Run this query:

    SELECT _utf8'a' COLLATE utf8_confusables;
    

Comment by Alexander Barkov [ 2016-06-23 ]

The problem is the the collation customization code does not handle:

  • reset to a character with an implicit weight
  • shift of a character with an implicit weight
Generated at Thu Feb 08 07:29:01 UTC 2024 using Jira 8.20.16#820016-sha1:9d11dbea5f4be3d4cc21f03a88dd11d8c8687422.