Thursday, September 9, 2021

Unicode CLDR v40 Alpha available for testing

construction image The Unicode CLDR v40 Alpha is now available for testing. The alpha has already been integrated into the development version of ICU. We would especially appreciate feedback from non-ICU consumers of CLDR data. Feedback can be filed at CLDR Tickets.

Alpha means that the main data and charts are available for review, but the specification, JSON data, and other components are not yet ready for review. Some data may change if showstopper bugs are found. The planned schedule is:
  • Sep 21 — Beta (data)
  • Oct 06 — Beta2 (spec)
  • Oct 27 — Release
In CLDR v40, the main focus is on:
  • Grammatical features (gender and case) for units of measurement in additional locales

    Phase 1 (v39) of grammatical features included just 12 locales (da, de, es, fr, hi, it, nl, no, pl, pt, ru, sv).

    Phase 2 (v40) has expanded the number of locales by 29 (am, ar, bn, ca, cs, el, fi, gu, he, hr, hu, hy, is, kn, lt, lv, ml, mr, nb, pa, ro, si, sk, sl, sr, ta, te, uk, ur), but for a narrower set of units.

  • Emoji v14 names and search keywords
  • Modernized Survey Tool front end.
There are many other changes: to find out more, see the draft CLDR v40 release page, which has information on accessing the date, reviewing charts of the changes, and necessary migration changes.

Unicode CLDR provides key building blocks for software supporting the world’s languages. CLDR data is used by all major software systems (including all mobile phones) for their software internationalization and localization, adapting software to the conventions of different languages.

Over 140,000 characters are available for adoption to help the Unicode Consortium’s work on digitally disadvantaged languages