The Unicode CLDR 48 Alpha is now available for integration testing.
CLDR provides key building blocks for software to support the world's languages (dates, times, numbers, sort-order, etc.) For example, all major browsers and all modern mobile phones use CLDR for language support. (See Who uses CLDR?)
Via the Survey Tool, contributors supply data for their languages — data that is widely used to support much of the world’s software. This data is also a factor in determining which languages are supported on mobile phones and computer operating systems.
The alpha has already been integrated into the development versions of ICU 78 and ICU4X. We would especially appreciate feedback from non-ICU consumers of CLDR data and on Migration issues. Feedback can be filed via CLDR Tickets.
Some of the most significant changes in this release are the following (for more detail, see the CLDR 48 release note page):
- Updated for Unicode 17, including new names and search terms for new emoji, new sort order, Han→Latin romanization additions for many characters.
- Updated to the latest external standards and data sources, such as the language subtag registry, UN M49 macro regions, ISO 4217 currencies, etc.
- Many enhancements of the CLDR specification (LDML) are due for addition by Oct 1.
- Many additions to language data including:
- Likely Subtags, for deriving the likely script and region from the language (used in many processes)
- Language populations in countries: significant updates to improve accuracy and maintainability
- New formatting options
- Rational number formats added, allowing for formats like “5½”
- For timezones, usesMetazone adds two new attributes stdOffset and dstOffset so that implementations can use either “vanguard” or “rearguard” TZDB data sources
- Combination formats added for relative dates + times, such as “tomorrow at 12:30”
- Additional units added for scientific contexts (coulombs, farads, teslas, etc.) and for English systems (fortnights, imperial pints, etc.)
- Many corrections and updates for Metazone data, for calendars (including removal of eras and fixes to start dates).
- This is the first release where the new CLDR Organization process is in place for DDL languages. As a result, several locales were able to reach higher levels (see below).
Locale Coverage Levels
Changes in coverage
For the details, see the CLDR 48 release note page, which has information on accessing the data, reviewing charts of the changes, and — importantly — will cover Migration issues.
----------------------------------------------
Adopt a Character and Support Unicode’s Mission
Looking to give that special someone a special something?
Or maybe something to treat yourself?
🕉️💗🏎️🐨🔥🚀爱₿♜🍀
Adopt a character or emoji to give it the attention it deserves, while also supporting Unicode’s mission to ensure everyone can communicate in their own languages across all devices.
Each adoption includes a digital badge and certificate that you can proudly display!
Have fun and support a good cause
You can also donate funds or gift stock