Uploaded image for project: 'MariaDB Server'
  1. MariaDB Server
  2. MDEV-20912

Add support for utf8mb4_0900_* collations in MariaDB Server

Details

    Description

      Please add support for utf8mb4_0900_* collations in MariaDB Server. This would allow for better accent handling in text data.
      This also makes migrations from MySQL 8.0 to MariaDB easier.
      This is also a requirement for getting replication to work from MySQL 8.0 to MariaDB and MariaDB to MySQL 8.0

      The plan is to implement most of the MySQL 8.0 utf8mb4_0900 collations as alias for the MariaDB utf8mb4_1400 collations.
      This should be fine in most cases as utf8mb4_1400 is a newer version of the utf8mb4_0900 collations.

      Attachments

        Issue Links

          Activity

            Hello guys from Colnect team : ) !

            Waiting for utf8mb4_0900_ai_ci support to use it for search on https://colnect.com/en

            vimpil Kirill Shevchenko (Inactive) added a comment - Hello guys from Colnect team : ) ! Waiting for utf8mb4_0900_ai_ci support to use it for search on https://colnect.com/en

            utf8mb4_0900_a*_c* ai as ci cs ...would be nice to have available in MariaDB asap.

            bvrrc Brad Vincelette added a comment - utf8mb4_0900_a*_c* ai as ci cs ...would be nice to have available in MariaDB asap.

            bar, is it made obsolete by MDEV-27009 ?

            serg Sergei Golubchik added a comment - bar , is it made obsolete by MDEV-27009 ?

            Correct. This task can be considered as obsolete. Closing.

            bar Alexander Barkov added a comment - Correct. This task can be considered as obsolete. Closing.
            monty Michael Widenius added a comment - - edited

            I have now implemented the following 0900 collations (through aliases on the unicode 1400 collation):

            MariaDB [test]> select * from information_schema.COLLATIONS where collation_name like "%0900%";
            +----------------------------+--------------------+------+------------+-------------+---------+
            | COLLATION_NAME             | CHARACTER_SET_NAME | ID   | IS_DEFAULT | IS_COMPILED | SORTLEN |
            +----------------------------+--------------------+------+------------+-------------+---------+
            | utf8mb4_0900_ai_ci         | utf8mb4            |  255 |            | Yes         |       8 |
            | utf8mb4_de_pb_0900_ai_ci   | utf8mb4            |  256 |            | Yes         |       8 |
            | utf8mb4_is_0900_ai_ci      | utf8mb4            |  257 |            | Yes         |       8 |
            | utf8mb4_lv_0900_ai_ci      | utf8mb4            |  258 |            | Yes         |       8 |
            | utf8mb4_ro_0900_ai_ci      | utf8mb4            |  259 |            | Yes         |       8 |
            | utf8mb4_sl_0900_ai_ci      | utf8mb4            |  260 |            | Yes         |       8 |
            | utf8mb4_pl_0900_ai_ci      | utf8mb4            |  261 |            | Yes         |       8 |
            | utf8mb4_et_0900_ai_ci      | utf8mb4            |  262 |            | Yes         |       8 |
            | utf8mb4_es_0900_ai_ci      | utf8mb4            |  263 |            | Yes         |       8 |
            | utf8mb4_sv_0900_ai_ci      | utf8mb4            |  264 |            | Yes         |       8 |
            | utf8mb4_tr_0900_ai_ci      | utf8mb4            |  265 |            | Yes         |       8 |
            | utf8mb4_cs_0900_ai_ci      | utf8mb4            |  266 |            | Yes         |       8 |
            | utf8mb4_da_0900_ai_ci      | utf8mb4            |  267 |            | Yes         |       8 |
            | utf8mb4_lt_0900_ai_ci      | utf8mb4            |  268 |            | Yes         |       8 |
            | utf8mb4_sk_0900_ai_ci      | utf8mb4            |  269 |            | Yes         |       8 |
            | utf8mb4_es_trad_0900_ai_ci | utf8mb4            |  270 |            | Yes         |       8 |
            | utf8mb4_la_0900_ai_ci      | utf8mb4            |  271 |            | Yes         |       8 |
            | utf8mb4_eo_0900_ai_ci      | utf8mb4            |  273 |            | Yes         |       8 |
            | utf8mb4_hu_0900_ai_ci      | utf8mb4            |  274 |            | Yes         |       8 |
            | utf8mb4_hr_0900_ai_ci      | utf8mb4            |  275 |            | Yes         |       8 |
            | utf8mb4_vi_0900_ai_ci      | utf8mb4            |  277 |            | Yes         |       8 |
            | utf8mb4_0900_as_cs         | utf8mb4            |  278 |            | Yes         |       8 |
            | utf8mb4_de_pb_0900_as_cs   | utf8mb4            |  279 |            | Yes         |       8 |
            | utf8mb4_is_0900_as_cs      | utf8mb4            |  280 |            | Yes         |       8 |
            | utf8mb4_lv_0900_as_cs      | utf8mb4            |  281 |            | Yes         |       8 |
            | utf8mb4_ro_0900_as_cs      | utf8mb4            |  282 |            | Yes         |       8 |
            | utf8mb4_sl_0900_as_cs      | utf8mb4            |  283 |            | Yes         |       8 |
            | utf8mb4_pl_0900_as_cs      | utf8mb4            |  284 |            | Yes         |       8 |
            | utf8mb4_et_0900_as_cs      | utf8mb4            |  285 |            | Yes         |       8 |
            | utf8mb4_es_0900_as_cs      | utf8mb4            |  286 |            | Yes         |       8 |
            | utf8mb4_sv_0900_as_cs      | utf8mb4            |  287 |            | Yes         |       8 |
            | utf8mb4_tr_0900_as_cs      | utf8mb4            |  288 |            | Yes         |       8 |
            | utf8mb4_cs_0900_as_cs      | utf8mb4            |  289 |            | Yes         |       8 |
            | utf8mb4_da_0900_as_cs      | utf8mb4            |  290 |            | Yes         |       8 |
            | utf8mb4_lt_0900_as_cs      | utf8mb4            |  291 |            | Yes         |       8 |
            | utf8mb4_sk_0900_as_cs      | utf8mb4            |  292 |            | Yes         |       8 |
            | utf8mb4_es_trad_0900_as_cs | utf8mb4            |  293 |            | Yes         |       8 |
            | utf8mb4_la_0900_as_cs      | utf8mb4            |  294 |            | Yes         |       8 |
            | utf8mb4_eo_0900_as_cs      | utf8mb4            |  296 |            | Yes         |       8 |
            | utf8mb4_hu_0900_as_cs      | utf8mb4            |  297 |            | Yes         |       8 |
            | utf8mb4_hr_0900_as_cs      | utf8mb4            |  298 |            | Yes         |       8 |
            | utf8mb4_vi_0900_as_cs      | utf8mb4            |  300 |            | Yes         |       8 |
            | utf8mb4_0900_as_ci         | utf8mb4            |  305 |            | Yes         |       8 |
            | utf8mb4_0900_bin           | utf8mb4            |  309 |            | Yes         |       1 |
            +----------------------------+--------------------+------+------------+-------------+---------+
            
            

            monty Michael Widenius added a comment - - edited I have now implemented the following 0900 collations (through aliases on the unicode 1400 collation): MariaDB [test]> select * from information_schema.COLLATIONS where collation_name like "%0900%"; +----------------------------+--------------------+------+------------+-------------+---------+ | COLLATION_NAME | CHARACTER_SET_NAME | ID | IS_DEFAULT | IS_COMPILED | SORTLEN | +----------------------------+--------------------+------+------------+-------------+---------+ | utf8mb4_0900_ai_ci | utf8mb4 | 255 | | Yes | 8 | | utf8mb4_de_pb_0900_ai_ci | utf8mb4 | 256 | | Yes | 8 | | utf8mb4_is_0900_ai_ci | utf8mb4 | 257 | | Yes | 8 | | utf8mb4_lv_0900_ai_ci | utf8mb4 | 258 | | Yes | 8 | | utf8mb4_ro_0900_ai_ci | utf8mb4 | 259 | | Yes | 8 | | utf8mb4_sl_0900_ai_ci | utf8mb4 | 260 | | Yes | 8 | | utf8mb4_pl_0900_ai_ci | utf8mb4 | 261 | | Yes | 8 | | utf8mb4_et_0900_ai_ci | utf8mb4 | 262 | | Yes | 8 | | utf8mb4_es_0900_ai_ci | utf8mb4 | 263 | | Yes | 8 | | utf8mb4_sv_0900_ai_ci | utf8mb4 | 264 | | Yes | 8 | | utf8mb4_tr_0900_ai_ci | utf8mb4 | 265 | | Yes | 8 | | utf8mb4_cs_0900_ai_ci | utf8mb4 | 266 | | Yes | 8 | | utf8mb4_da_0900_ai_ci | utf8mb4 | 267 | | Yes | 8 | | utf8mb4_lt_0900_ai_ci | utf8mb4 | 268 | | Yes | 8 | | utf8mb4_sk_0900_ai_ci | utf8mb4 | 269 | | Yes | 8 | | utf8mb4_es_trad_0900_ai_ci | utf8mb4 | 270 | | Yes | 8 | | utf8mb4_la_0900_ai_ci | utf8mb4 | 271 | | Yes | 8 | | utf8mb4_eo_0900_ai_ci | utf8mb4 | 273 | | Yes | 8 | | utf8mb4_hu_0900_ai_ci | utf8mb4 | 274 | | Yes | 8 | | utf8mb4_hr_0900_ai_ci | utf8mb4 | 275 | | Yes | 8 | | utf8mb4_vi_0900_ai_ci | utf8mb4 | 277 | | Yes | 8 | | utf8mb4_0900_as_cs | utf8mb4 | 278 | | Yes | 8 | | utf8mb4_de_pb_0900_as_cs | utf8mb4 | 279 | | Yes | 8 | | utf8mb4_is_0900_as_cs | utf8mb4 | 280 | | Yes | 8 | | utf8mb4_lv_0900_as_cs | utf8mb4 | 281 | | Yes | 8 | | utf8mb4_ro_0900_as_cs | utf8mb4 | 282 | | Yes | 8 | | utf8mb4_sl_0900_as_cs | utf8mb4 | 283 | | Yes | 8 | | utf8mb4_pl_0900_as_cs | utf8mb4 | 284 | | Yes | 8 | | utf8mb4_et_0900_as_cs | utf8mb4 | 285 | | Yes | 8 | | utf8mb4_es_0900_as_cs | utf8mb4 | 286 | | Yes | 8 | | utf8mb4_sv_0900_as_cs | utf8mb4 | 287 | | Yes | 8 | | utf8mb4_tr_0900_as_cs | utf8mb4 | 288 | | Yes | 8 | | utf8mb4_cs_0900_as_cs | utf8mb4 | 289 | | Yes | 8 | | utf8mb4_da_0900_as_cs | utf8mb4 | 290 | | Yes | 8 | | utf8mb4_lt_0900_as_cs | utf8mb4 | 291 | | Yes | 8 | | utf8mb4_sk_0900_as_cs | utf8mb4 | 292 | | Yes | 8 | | utf8mb4_es_trad_0900_as_cs | utf8mb4 | 293 | | Yes | 8 | | utf8mb4_la_0900_as_cs | utf8mb4 | 294 | | Yes | 8 | | utf8mb4_eo_0900_as_cs | utf8mb4 | 296 | | Yes | 8 | | utf8mb4_hu_0900_as_cs | utf8mb4 | 297 | | Yes | 8 | | utf8mb4_hr_0900_as_cs | utf8mb4 | 298 | | Yes | 8 | | utf8mb4_vi_0900_as_cs | utf8mb4 | 300 | | Yes | 8 | | utf8mb4_0900_as_ci | utf8mb4 | 305 | | Yes | 8 | | utf8mb4_0900_bin | utf8mb4 | 309 | | Yes | 1 | +----------------------------+--------------------+------+------------+-------------+---------+

            Pushed to 11.4 tree

            monty Michael Widenius added a comment - Pushed to 11.4 tree

            People

              monty Michael Widenius
              ivikga --
              Votes:
              12 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.