[MDEV-27210] New naming convention for UCA collations Created: 2021-12-09 Updated: 2024-01-24 |
|
| Status: | Open |
| Project: | MariaDB Server |
| Component/s: | Character Sets |
| Fix Version/s: | 11.5 |
| Type: | New Feature | Priority: | Major |
| Reporter: | Alexander Barkov | Assignee: | Alexander Barkov |
| Resolution: | Unresolved | Votes: | 1 |
| Labels: | None | ||
| Issue Links: |
|
||||||||
| Description |
|
As of version 10.7, MariaDB understands the following flags in collation names:
We eventually want to support all customizations (collation preferences) as described in: This new naming convention will encode more flags inside collation names. This new naming conversion will be applied to newly added UCA based collations. Old collation names will stay untouched. Collation name structureThe whole collation name structure will consist of the following parts delimited by underscores:
PAD flags
Variable Weighting (punctuation) flags
Accent sensitivity flags
Case sensitivity flags
Identity sensitivity flags
Canonical collation namesThe collation name parser will understand flags in the described above order, e.g.
The canonical names (i.e. as displayed in SHOW CREATE statements or I_S queries) will also print flags in the order described above. The accent and case sensitivity flags will always be printed in canonical names, even with default values. Other flags will be printed only if they have a non-default value. Examples:
DisclaimerWe won't implement all flags mentioned here in a single patch. They will be added in steps under terms of different tasks. Variable weighting and Identity sensitivity flags will most likely be implemented later than other flags. |