Tuesday, September 15, 2020

Unicode CLDR Locale Data v38 alpha available for testing

The alpha version of Unicode CLDR version 38 is now available for data testing. The final release of v38 is planned for October 22, 2020. If you find any problems with the data, please file a ticket.

Unicode CLDR provides an update to the key building blocks for software supporting the world's languages. CLDR data is used by all major software systems (including all mobile phones) for their software internationalization and localization, adapting software to the conventions of different languages.

CLDR v38 includes:
  • Enhancements to existing locale data: adding support for units of measurement in inflected languages (phase 1), adding annotations (names and search keywords) for Unicode symbols that are non-emoji (~400), and annotations for Emoji v13.1.
  • New locales added: Dogri and Sanskrit.
  • Survey Tool upgrades: substantial performance improvements, plus structured forum entries to improve coordination among translators.
See additional details in the draft CLDR v38 Release note

The overall changes to the data items were:

Added Deleted Changed
155,131 33,805 45,895

Over 140,000 characters are available for adoption to help the Unicode Consortium’s work on digitally disadvantaged languages