Wednesday, February 27, 2019

Unicode CLDR 35 alpha available for testing

The alpha version of Unicode CLDR 35 is available for testing. The alpha period lasts until the beta release on March 13, which will include updates to the LDML spec. The final release is expected on March 27.

Unicode CLDR 35 provides an update to the key building blocks for software supporting the world's languages. CLDR data is used by all major software systems for their software internationalization and localization, adapting software to the conventions of different languages for such common software tasks.

CLDR 35 included a limited Survey Tool data collection phase, adding approximately 54 thousand new translated fields:

Basic coverage New languages at Basic coverage: Cebuano (ceb), Hausa (ha), Igbo (ig), Yoruba (yo)
Modern coverage Languages Somali (so) and Javanese (jv) has additional coverage from Moderate to Modern
Emoji 12.0 Names and annotations (search keywords) for 90+ new emoji;
Also includes fixes for previous names & keywords
Collation Collation updated to Unicode 12.0, including new emoji;
Japanese single-character (ligature) era names added to collation and search collation
Measurement units  23 additional units
Date formats Two additional flexible formats, and 20 new interval formats
Japanese calendar Updated to Gannen (元年) number format
Region Names Many names updated to local equivalents of  “North Macedonia” (MK) and “Eswatini” (SZ)

A dot release, version 35.1 is expected in April, with further changes for Japanese calendar.

For details, see Detailed Specification Changes, Detailed Structure Changes, Detailed Data Changes, Growth.

Tuesday, February 5, 2019

Unicode Emoji 12.0 — final for 2019

emoji 12 image Emoji 12.0 data has been released, with 59 new emoji such as:

mechanical arm image
mechanical arm
deaf person image
deaf person
people holding hands image
people holding hands
otter image
otter
waffle image
waffle
ice cube image
ice cube
ringed planet image
ringed planet
drop of blood image
drop of blood

With 171 variants for gender and skin tone, this makes a total of 230 emoji including variants, such as:

The new emoji are listed in Emoji Recently Added v12.0, with sample images. These images are just samples: vendors for mobile phones, PCs, and web platforms will typically use images that fit their overall emoji designs. In particular, the Emoji Ordering v12.0 chart shows how the new emoji sort compared to the others, with new emoji marked with rounded-rectangles. The other Emoji Charts for Version 12.0 have been updated to show the emoji.

The new emoji typically start showing up on mobile phones in September/October — some platforms may release them earlier. The new emoji will soon be available for adoption to help the Unicode Consortium’s work on digitally disadvantaged languages.

For implementers:
  1. The new Emoji 12.0 set includes the data needed for vendors to begin working on their emoji fonts and code ahead of the release of Unicode 12.0, scheduled for March 5.
  2. The emoji specification (UTS #51) has additional guidelines on gender and skin tone, and other clarifications. The definitions in UTS #51 and data files and have been enhanced to be more consistent and useful. For details, see Modifications
  3. The people holding hands emoji now have four combinations of gender and all the various combinations of skin tones, for a total of 71 new variants. Implementations may optionally support skin-tone combinations for other multi-person emoji.
  4. The CLDR names and search keywords for the new emoji characters in over 80 languages, and the sort order for emoji, will be finalized by the end of March with the release of CLDR v35.


Over 130,000 characters are available for adoption, to help the Unicode Consortium’s work on digitally disadvantaged languages.

[badge]