ICU is the
premier library for
software internationalization, used by a
wide array of companies and organizations to support the world's
languages, implementing both the latest version of the Unicode Standard and
of the Unicode locale data (CLDR).
CLDR provides key building blocks for
software to support the world's languages (dates, times, numbers,
sort-order, etc.). All major browsers and all modern mobile phones use CLDR
for language support. (See
Who uses CLDR?)
CLDR extends the support for “short”
Chinese sort orders to cover some additional, required characters for Level
2. This is carried over into ICU collation.
ICU has a modified character conversion
table, mapping some GB18030 characters to Unicode characters that were
encoded after GB18030-2005.
There are optional variants of time formats
with AM/PM (only for English) using ASCII spaces in CLDR that can also be
used in ICU via custom data generation. This is intended to help certain
implementers transition to the improved patterns, which have used a narrow
no-break space between the time and AM/PM since
- For how to generate ICU data with this option, look for alt="ascii" on tools/cldr/cldr-to-icu/README.md
The changes to the word segmentation
behavior of @ sign that were in CLDR 42 (ICU 72) have been reverted. These
caused problems for certain parsers that did not expect @ to join to
For details, please see:
To support Unicode’s mission to ensure everyone can communicate in their languages across all devices, please consider adopting a character, making a gift of stock, or making a donation. As Unicode, Inc. is a US-based open source, open standards, non-profit, 501(c)3 organization, your contribution may be eligible for a tax deduction. Please consult with a tax advisor for details.