Uploaded image for project: 'MariaDB Server'
  1. MariaDB Server
  2. MDEV-20912

Add support for utf8mb4_0900_* collations in MariaDB Server

Details

    Description

      Please add support for utf8mb4_0900_* collations in MariaDB Server. This would allow for better accent handling in text data.
      This also makes migrations from MySQL 8.0 to MariaDB easier.
      This is also a requirement for getting replication to work from MySQL 8.0 to MariaDB and MariaDB to MySQL 8.0

      The plan is to implement most of the MySQL 8.0 utf8mb4_0900 collations as alias for the MariaDB utf8mb4_1400 collations.
      This should be fine in most cases as utf8mb4_1400 is a newer version of the utf8mb4_0900 collations.

      Attachments

        Issue Links

          Activity

            ivikga -- created issue -
            julien.fritsch Julien Fritsch made changes -
            Field Original Value New Value
            Fixing Priority 250

            Hello guys from Colnect team : ) !

            Waiting for utf8mb4_0900_ai_ci support to use it for search on https://colnect.com/en

            vimpil Kirill Shevchenko (Inactive) added a comment - Hello guys from Colnect team : ) ! Waiting for utf8mb4_0900_ai_ci support to use it for search on https://colnect.com/en
            danblack Daniel Black made changes -
            greenman Ian Gilfillan made changes -
            Labels collation utf8mb4 Compatibility collation utf8mb4

            utf8mb4_0900_a*_c* ai as ci cs ...would be nice to have available in MariaDB asap.

            bvrrc Brad Vincelette added a comment - utf8mb4_0900_a*_c* ai as ci cs ...would be nice to have available in MariaDB asap.

            bar, is it made obsolete by MDEV-27009 ?

            serg Sergei Golubchik added a comment - bar , is it made obsolete by MDEV-27009 ?
            serg Sergei Golubchik made changes -
            Workflow MariaDB v3 [ 100647 ] MariaDB v4 [ 131178 ]
            greenman Ian Gilfillan made changes -
            Labels Compatibility collation utf8mb4 Compatibility collation compat80 utf8mb4
            monty Michael Widenius made changes -

            Correct. This task can be considered as obsolete. Closing.

            bar Alexander Barkov added a comment - Correct. This task can be considered as obsolete. Closing.
            bar Alexander Barkov made changes -
            Fix Version/s N/A [ 14700 ]
            Assignee Alexander Barkov [ bar ]
            Resolution Won't Do [ 10201 ]
            Status Open [ 1 ] Closed [ 6 ]
            danblack Daniel Black made changes -
            monty Michael Widenius made changes -
            Resolution Won't Do [ 10201 ]
            Status Closed [ 6 ] Stalled [ 10000 ]
            monty Michael Widenius made changes -
            Assignee Alexander Barkov [ bar ] Michael Widenius [ monty ]
            monty Michael Widenius made changes -
            Fix Version/s 11.4.5 [ 29956 ]
            Fix Version/s N/A [ 14700 ]
            monty Michael Widenius made changes -
            Description Please add support for utf8mb4_0900_* collations in MariaDB Server. This would allow for better accent handling in text data. Please add support for utf8mb4_0900_* collations in MariaDB Server. This would allow for better accent handling in text data.
            This also makes migrations from MySQL 8.0 to MariaDB easier.
            This is also a requirement for getting replication to work from MySQL 8.0 to MariaDB and MariaDB to MySQL 8.0
            monty Michael Widenius made changes -
            Status Stalled [ 10000 ] In Progress [ 3 ]
            monty Michael Widenius made changes -
            monty Michael Widenius made changes -
            Description Please add support for utf8mb4_0900_* collations in MariaDB Server. This would allow for better accent handling in text data.
            This also makes migrations from MySQL 8.0 to MariaDB easier.
            This is also a requirement for getting replication to work from MySQL 8.0 to MariaDB and MariaDB to MySQL 8.0
            Please add support for utf8mb4_0900_* collations in MariaDB Server. This would allow for better accent handling in text data.
            This also makes migrations from MySQL 8.0 to MariaDB easier.
            This is also a requirement for getting replication to work from MySQL 8.0 to MariaDB and MariaDB to MySQL 8.0

            The plan is to implement most of the MySQL 8.0 character sets
            monty Michael Widenius made changes -
            Description Please add support for utf8mb4_0900_* collations in MariaDB Server. This would allow for better accent handling in text data.
            This also makes migrations from MySQL 8.0 to MariaDB easier.
            This is also a requirement for getting replication to work from MySQL 8.0 to MariaDB and MariaDB to MySQL 8.0

            The plan is to implement most of the MySQL 8.0 character sets
            Please add support for utf8mb4_0900_* collations in MariaDB Server. This would allow for better accent handling in text data.
            This also makes migrations from MySQL 8.0 to MariaDB easier.
            This is also a requirement for getting replication to work from MySQL 8.0 to MariaDB and MariaDB to MySQL 8.0

            The plan is to implement most of the MySQL 8.0 utf8mb4_0900 collations as alias for the MariaDB utf8mb4_1400 collations.
            This should be fine in most cases as utf8mb4_1400 is a newer version of the utf8mb4_0900 collations.
            monty Michael Widenius added a comment - - edited

            I have now implemented the following 0900 collations (through aliases on the unicode 1400 collation):

            MariaDB [test]> select * from information_schema.COLLATIONS where collation_name like "%0900%";
            +----------------------------+--------------------+------+------------+-------------+---------+
            | COLLATION_NAME             | CHARACTER_SET_NAME | ID   | IS_DEFAULT | IS_COMPILED | SORTLEN |
            +----------------------------+--------------------+------+------------+-------------+---------+
            | utf8mb4_0900_ai_ci         | utf8mb4            |  255 |            | Yes         |       8 |
            | utf8mb4_de_pb_0900_ai_ci   | utf8mb4            |  256 |            | Yes         |       8 |
            | utf8mb4_is_0900_ai_ci      | utf8mb4            |  257 |            | Yes         |       8 |
            | utf8mb4_lv_0900_ai_ci      | utf8mb4            |  258 |            | Yes         |       8 |
            | utf8mb4_ro_0900_ai_ci      | utf8mb4            |  259 |            | Yes         |       8 |
            | utf8mb4_sl_0900_ai_ci      | utf8mb4            |  260 |            | Yes         |       8 |
            | utf8mb4_pl_0900_ai_ci      | utf8mb4            |  261 |            | Yes         |       8 |
            | utf8mb4_et_0900_ai_ci      | utf8mb4            |  262 |            | Yes         |       8 |
            | utf8mb4_es_0900_ai_ci      | utf8mb4            |  263 |            | Yes         |       8 |
            | utf8mb4_sv_0900_ai_ci      | utf8mb4            |  264 |            | Yes         |       8 |
            | utf8mb4_tr_0900_ai_ci      | utf8mb4            |  265 |            | Yes         |       8 |
            | utf8mb4_cs_0900_ai_ci      | utf8mb4            |  266 |            | Yes         |       8 |
            | utf8mb4_da_0900_ai_ci      | utf8mb4            |  267 |            | Yes         |       8 |
            | utf8mb4_lt_0900_ai_ci      | utf8mb4            |  268 |            | Yes         |       8 |
            | utf8mb4_sk_0900_ai_ci      | utf8mb4            |  269 |            | Yes         |       8 |
            | utf8mb4_es_trad_0900_ai_ci | utf8mb4            |  270 |            | Yes         |       8 |
            | utf8mb4_la_0900_ai_ci      | utf8mb4            |  271 |            | Yes         |       8 |
            | utf8mb4_eo_0900_ai_ci      | utf8mb4            |  273 |            | Yes         |       8 |
            | utf8mb4_hu_0900_ai_ci      | utf8mb4            |  274 |            | Yes         |       8 |
            | utf8mb4_hr_0900_ai_ci      | utf8mb4            |  275 |            | Yes         |       8 |
            | utf8mb4_vi_0900_ai_ci      | utf8mb4            |  277 |            | Yes         |       8 |
            | utf8mb4_0900_as_cs         | utf8mb4            |  278 |            | Yes         |       8 |
            | utf8mb4_de_pb_0900_as_cs   | utf8mb4            |  279 |            | Yes         |       8 |
            | utf8mb4_is_0900_as_cs      | utf8mb4            |  280 |            | Yes         |       8 |
            | utf8mb4_lv_0900_as_cs      | utf8mb4            |  281 |            | Yes         |       8 |
            | utf8mb4_ro_0900_as_cs      | utf8mb4            |  282 |            | Yes         |       8 |
            | utf8mb4_sl_0900_as_cs      | utf8mb4            |  283 |            | Yes         |       8 |
            | utf8mb4_pl_0900_as_cs      | utf8mb4            |  284 |            | Yes         |       8 |
            | utf8mb4_et_0900_as_cs      | utf8mb4            |  285 |            | Yes         |       8 |
            | utf8mb4_es_0900_as_cs      | utf8mb4            |  286 |            | Yes         |       8 |
            | utf8mb4_sv_0900_as_cs      | utf8mb4            |  287 |            | Yes         |       8 |
            | utf8mb4_tr_0900_as_cs      | utf8mb4            |  288 |            | Yes         |       8 |
            | utf8mb4_cs_0900_as_cs      | utf8mb4            |  289 |            | Yes         |       8 |
            | utf8mb4_da_0900_as_cs      | utf8mb4            |  290 |            | Yes         |       8 |
            | utf8mb4_lt_0900_as_cs      | utf8mb4            |  291 |            | Yes         |       8 |
            | utf8mb4_sk_0900_as_cs      | utf8mb4            |  292 |            | Yes         |       8 |
            | utf8mb4_es_trad_0900_as_cs | utf8mb4            |  293 |            | Yes         |       8 |
            | utf8mb4_la_0900_as_cs      | utf8mb4            |  294 |            | Yes         |       8 |
            | utf8mb4_eo_0900_as_cs      | utf8mb4            |  296 |            | Yes         |       8 |
            | utf8mb4_hu_0900_as_cs      | utf8mb4            |  297 |            | Yes         |       8 |
            | utf8mb4_hr_0900_as_cs      | utf8mb4            |  298 |            | Yes         |       8 |
            | utf8mb4_vi_0900_as_cs      | utf8mb4            |  300 |            | Yes         |       8 |
            | utf8mb4_0900_as_ci         | utf8mb4            |  305 |            | Yes         |       8 |
            | utf8mb4_0900_bin           | utf8mb4            |  309 |            | Yes         |       1 |
            +----------------------------+--------------------+------+------------+-------------+---------+
            
            

            monty Michael Widenius added a comment - - edited I have now implemented the following 0900 collations (through aliases on the unicode 1400 collation): MariaDB [test]> select * from information_schema.COLLATIONS where collation_name like "%0900%"; +----------------------------+--------------------+------+------------+-------------+---------+ | COLLATION_NAME | CHARACTER_SET_NAME | ID | IS_DEFAULT | IS_COMPILED | SORTLEN | +----------------------------+--------------------+------+------------+-------------+---------+ | utf8mb4_0900_ai_ci | utf8mb4 | 255 | | Yes | 8 | | utf8mb4_de_pb_0900_ai_ci | utf8mb4 | 256 | | Yes | 8 | | utf8mb4_is_0900_ai_ci | utf8mb4 | 257 | | Yes | 8 | | utf8mb4_lv_0900_ai_ci | utf8mb4 | 258 | | Yes | 8 | | utf8mb4_ro_0900_ai_ci | utf8mb4 | 259 | | Yes | 8 | | utf8mb4_sl_0900_ai_ci | utf8mb4 | 260 | | Yes | 8 | | utf8mb4_pl_0900_ai_ci | utf8mb4 | 261 | | Yes | 8 | | utf8mb4_et_0900_ai_ci | utf8mb4 | 262 | | Yes | 8 | | utf8mb4_es_0900_ai_ci | utf8mb4 | 263 | | Yes | 8 | | utf8mb4_sv_0900_ai_ci | utf8mb4 | 264 | | Yes | 8 | | utf8mb4_tr_0900_ai_ci | utf8mb4 | 265 | | Yes | 8 | | utf8mb4_cs_0900_ai_ci | utf8mb4 | 266 | | Yes | 8 | | utf8mb4_da_0900_ai_ci | utf8mb4 | 267 | | Yes | 8 | | utf8mb4_lt_0900_ai_ci | utf8mb4 | 268 | | Yes | 8 | | utf8mb4_sk_0900_ai_ci | utf8mb4 | 269 | | Yes | 8 | | utf8mb4_es_trad_0900_ai_ci | utf8mb4 | 270 | | Yes | 8 | | utf8mb4_la_0900_ai_ci | utf8mb4 | 271 | | Yes | 8 | | utf8mb4_eo_0900_ai_ci | utf8mb4 | 273 | | Yes | 8 | | utf8mb4_hu_0900_ai_ci | utf8mb4 | 274 | | Yes | 8 | | utf8mb4_hr_0900_ai_ci | utf8mb4 | 275 | | Yes | 8 | | utf8mb4_vi_0900_ai_ci | utf8mb4 | 277 | | Yes | 8 | | utf8mb4_0900_as_cs | utf8mb4 | 278 | | Yes | 8 | | utf8mb4_de_pb_0900_as_cs | utf8mb4 | 279 | | Yes | 8 | | utf8mb4_is_0900_as_cs | utf8mb4 | 280 | | Yes | 8 | | utf8mb4_lv_0900_as_cs | utf8mb4 | 281 | | Yes | 8 | | utf8mb4_ro_0900_as_cs | utf8mb4 | 282 | | Yes | 8 | | utf8mb4_sl_0900_as_cs | utf8mb4 | 283 | | Yes | 8 | | utf8mb4_pl_0900_as_cs | utf8mb4 | 284 | | Yes | 8 | | utf8mb4_et_0900_as_cs | utf8mb4 | 285 | | Yes | 8 | | utf8mb4_es_0900_as_cs | utf8mb4 | 286 | | Yes | 8 | | utf8mb4_sv_0900_as_cs | utf8mb4 | 287 | | Yes | 8 | | utf8mb4_tr_0900_as_cs | utf8mb4 | 288 | | Yes | 8 | | utf8mb4_cs_0900_as_cs | utf8mb4 | 289 | | Yes | 8 | | utf8mb4_da_0900_as_cs | utf8mb4 | 290 | | Yes | 8 | | utf8mb4_lt_0900_as_cs | utf8mb4 | 291 | | Yes | 8 | | utf8mb4_sk_0900_as_cs | utf8mb4 | 292 | | Yes | 8 | | utf8mb4_es_trad_0900_as_cs | utf8mb4 | 293 | | Yes | 8 | | utf8mb4_la_0900_as_cs | utf8mb4 | 294 | | Yes | 8 | | utf8mb4_eo_0900_as_cs | utf8mb4 | 296 | | Yes | 8 | | utf8mb4_hu_0900_as_cs | utf8mb4 | 297 | | Yes | 8 | | utf8mb4_hr_0900_as_cs | utf8mb4 | 298 | | Yes | 8 | | utf8mb4_vi_0900_as_cs | utf8mb4 | 300 | | Yes | 8 | | utf8mb4_0900_as_ci | utf8mb4 | 305 | | Yes | 8 | | utf8mb4_0900_bin | utf8mb4 | 309 | | Yes | 1 | +----------------------------+--------------------+------+------------+-------------+---------+
            ralf.gebhardt Ralf Gebhardt made changes -
            Fix Version/s 11.4 [ 29301 ]
            Fix Version/s 11.4.5 [ 29956 ]

            Pushed to 11.4 tree

            monty Michael Widenius added a comment - Pushed to 11.4 tree
            monty Michael Widenius made changes -
            issue.field.resolutiondate 2024-12-30 14:33:50.0 2024-12-30 14:33:50.306
            monty Michael Widenius made changes -
            Fix Version/s 11.4.5 [ 29956 ]
            Fix Version/s 11.4 [ 29301 ]
            Resolution Fixed [ 1 ]
            Status In Progress [ 3 ] Closed [ 6 ]
            serg Sergei Golubchik made changes -
            serg Sergei Golubchik made changes -
            alice Alice Sherepa made changes -
            bar Alexander Barkov made changes -

            People

              monty Michael Widenius
              ivikga --
              Votes:
              12 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Git Integration

                  Error rendering 'com.xiplink.jira.git.jira_git_plugin:git-issue-webpanel'. Please contact your Jira administrators.