Wednesday, November 1, 2017

CLDR Version 32 Released

Graph of CLDR 32 data growth Unicode CLDR 32 provides an update to the key building blocks for software supporting the world’s languages. This data is used by all major software systems for their software internationalization and localization, adapting software to the conventions of different languages for such common software tasks.

Some of the improvements in the release are:
  • More complete data
    • Major contributions of main locale data for Chakma (ccp), Sindhi (sd), Odia (or), Kabyle (kab), Pashto (ps), Turkmen (tk), Norwegian Nynorsk (nn), Assamese (as), and others.
    • Rule-based number formats for Indian English, Akan, Hindi (oblique), Cherokee; revisions to some others.
    • Import of draft subdivision names and language groups from wikidata.
  • New data types
    • Numeric exemplars. For example, in zh: [\- , . % ‰ + 0 1 2 3 4 5 6 7 8 9 〇 一 七 三 九 二 五 八 六 四]
    • “Disjunctive” list style (eg “a, b, or c”)
    • AvailableFormats items for day periods (skeleton “Bhm” → pattern “h:mm B” → “1:30 in the afternoon”)
  • Major additions for Emoji
    • Emoji name and keyword updates for Unicode 10 and Emoji 5.0 (minor updates for English, full data collection for other languages). Keywords now in sorted order.
    • Adjustments to emoji collation
For further details and links to documentation, see the CLDR Release Notes.

ICU 60 Released

ICU LogoUnicode® ICU 60 has just been released! ICU is a software library widely used by products and other libraries to support the world's languages, implementing both the latest version of the Unicode encoding standard and of the Unicode locale data (CLDR).

ICU 60 upgrades to Unicode 10 and CLDR 32, and ICU4J has been tested with Java 9. ICU 60 includes a new API for number formatting. There are many more features and bug fixes.

For details please see http://site.icu-project.org/download/60