NO PAD collations (MDEV-9711)

[MDEV-10742] LDML: make conf_to_src reuse common data between collations Created: 2016-09-05  Updated: 2016-09-07  Resolved: 2016-09-05

Status: Closed
Project: MariaDB Server
Component/s: Character Sets
Affects Version/s: None
Fix Version/s: 10.2.2

Type: Technical task Priority: Major
Reporter: Alexander Barkov Assignee: Alexander Barkov
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Blocks
blocks MDEV-9711 NO PAD collations Closed
Sprint: 10.2.2-3, 10.2.2-2

 Description   

conf_to_src duplicates the ctype, to_lower, to_upper and tab_to_uni arrays when dumping data from the XML files in /sql/share/charsets/ to ctype-extra.c

For example, the ctype map from cp1251.xml is dumped five times:

static const uchar ctype_cp1251_bulgarian_ci[] = ..
static const uchar ctype_cp1251_ukrainian_ci[] = ..
static const uchar ctype_cp1251_bin[] = ..
static const uchar ctype_cp1251_general_ci[] = ..
static const uchar ctype_cp1251_general_cs[] = ..

After adding MDEV-9711 the number of duplicate data will grow even further two times.

We'll change conf_to_src.c to detect duplicate arrays. Non-primary collations will reuse arrays from the primary collation if the data is the same.


Generated at Thu Feb 08 07:44:34 UTC 2024 using Jira 8.20.16#820016-sha1:9d11dbea5f4be3d4cc21f03a88dd11d8c8687422.