Details
-
New Feature
-
Status: Stalled (View Workflow)
-
Critical
-
Resolution: Unresolved
-
None
Description
Identifier names can't contain characters outside of BMP, i.e they are restricted to utf8mb3
Here is a relevant part of Slack discussion on why it is so, and on possible fix
... discussion on character_set_system and why it is utf8mb3...
|
....
|
bar Oct 13th, 2021 at 4:23 PM
|
@wlad yes, it's hard-coded. I think the biggest problem is to implement table-name-to-file-name encoding for non-BMP characters. Should be doable but needs some time.
|
5 replies
|
|
wlad 3 months ago
|
so, a surrogate pair won't do? like, @d801@dc37
|
|
bar 3 months ago
|
for characters that do not have lower/upper variants, it will do.
|
|
bar 3 months ago
|
It will actually do for characters that have lower/upper variants as well.
|
|
bar 3 months ago
|
Thanks for the good idea.
|
Attachments
Issue Links
- is blocked by
-
MDEV-30556 UPPER() returns an empty string for U+0251 in Unicode-5.2.0+ collations for utf8
-
- Closed
-
-
MDEV-30577 Case folding for uca1400 collations is not up to date
-
- Closed
-
-
MDEV-30661 UPPER() returns an empty string for U+0251 in uca1400 collations for utf8
-
- Closed
-
-
MDEV-31340 Remove MY_COLLATION_HANDLER::strcasecmp()
-
- In Review
-
-
MDEV-31531 Remove my_casedn_str() and my_caseup_str()
-
- In Review
-
-
MDEV-31606 Refactor check_db_name() to get a const argument
-
- Closed
-
-
MDEV-31972 Change parameter of make_sp_name*() from LEX_CSTRING to Lex_ident_sys_st
-
- Closed
-
-
MDEV-31978 Turn ok_for_lower_case_names() to a method in Lex_ident_fs
-
- Closed
-
-
MDEV-32002 Remove my_casedn_str() in append_identifier() context
-
- Closed
-
-
MDEV-32019 Replace my_casedn_str(local_buffer) to CharBuffer::copy_casedn()
-
- Closed
-
-
MDEV-32081 Remove my_casedn_str() from get_canonical_filename()
-
- Closed
-
- relates to
-
MDEV-19123 Change default charset from latin1 to utf8mb4
-
- Open
-
-
MDEV-25829 Change default collation to utf8mb4_1400_ai_ci
-
- In Review
-
-
MDEV-32904 smiley emoji (F09F9883) valid in utf8 but not utf8mb4
-
- Closed
-