Monday, September 18, 2017

Unicode CLDR 32α available for testing

cldr v31 alpha The alpha version of Unicode CLDR 32 is available for testing. The alpha period lasts until the beta release on September 27, which will include updates to the LDML spec. The final release is expected on October 19.

CLDR 32 provides an update to the key building blocks for software supporting the world's languages. This data is used by all major software systems for their software internationalization and localization, adapting software to the conventions of different languages for such common software tasks.

CLDR 32 included a Survey Tool data collection phase, with a resulting significant increase in data size, especially for emoji names/annotations and geographic subdivision names. Other enhancements include rule-based number formats for additional languages, a new “disjunctive” list style (a, b, or c), and fixes for Chinese collation and transliteration. The draft release page at http://cldr.unicode.org/index/downloads/cldr-32 lists the major features, and has pointers to the newest data and charts. It will be fleshed out over the coming weeks with more details, migration issues, known problems, and so on. Particularly useful for review are:
Please report any problems that you find using a CLDR ticket. We'd also appreciate it if programmatic users of CLDR data download the xml files and do a trial integration to see if any problems arise.

Wednesday, August 30, 2017

New Emoji Subcommittee Vice-Chairs

silhouette imageThe Emoji Subcommittee (ESC) is on the front lines of Unicode emoji. It is responsible for accepting requests for new emoji and emoji sequences, helping requesters to fill out missing areas in their proposals, and providing prioritized recommendations to the Unicode Technical Committee.

Peter Edberg is stepping down as the co-chair of ESC, a role he has filled since its inception. He is one of the key people involved in Unicode emoji since the very beginning, so we are very lucky that he will continue as one of the technical leaders of ESC, and remain the co-author of “Unicode Emoji” (UTS #51). To ensure the smooth operation of the ESC, we have three eminently-qualified new vice-chairs: Jeremy Burge, who has been responsible for crafting and refining proposals from the most popular requests received at Emojipedia; Jennifer 8 Lee, who has played a pivotal role in developing, inspiring, and mentoring emoji requests through Emojination; and Tayfun Karadeniz, who has researched, organized, and shepherded emoji in the very popular areas of smileys and human-form emoji. All three have already made lasting contributions to the work of the ESC, and we welcome them in their new roles.

— Mark Davis, ESC chair

Over 100,000 characters are available for adoption, to help the Unicode Consortium’s work on digitally disadvantaged languages.

[fortune cookie badge]

Friday, August 25, 2017

Unicode 10.0 Paperback Available

[Unicode 10.0 Cover Art] The Unicode 10.0 core specification is now available in paperback book form with a new, original cover design. This edition consists of a pair of modestly priced print-on-demand volumes containing the complete text of the core specification of Version 10.0 of the Unicode Standard.

Each of the two volumes is a compact 6×9 inch US trade paperback size. The two volumes may be purchased separately or together, although they are intended as a set. The cost for the pair is US $16.85, plus postage and applicable taxes. Please visit the description page to order.

Note that these volumes do not include the Version 10.0 code charts, nor do they include the Version 10.0 Standard Annexes and Unicode Character Database, which are freely available on the Unicode website.

Purchase The Unicode Standard, Version 10.0 - Core Specification

Thursday, August 24, 2017

Gold Sponsor SMU Guildhall



The Unicode Consortium is pleased to announce that SMU Guildhall is now a gold sponsor for:


SMU Guildhall's  sponsorship directly funds the work of the Unicode Consortium in enabling modern software and computing systems to support the widest range of human languages. There are approximately 7,000 living human languages. Fewer than 100 of these languages are well-supported on computers, mobile phones, and other devices. AAC donations are used to improve support for digitally disadvantaged languages, and to help preserve the world’s linguistic heritage.
SMU Guildhall is the #1 graduate school for video game design, the first in the world to offer a master's degree in interactive technology, and the only program with specializations in all four cornerstones of game development: Art, Design, Programming, and Production. Their game industry faculty turn a passion for gaming into a viable and fulfilling career by mentoring students through the two year program and 3+ team game projects. Over 700 Guildhall alumni have worked at over 250 game studios globally.  — SMU Guildhall
The Unicode Consortium thanks SMU Guildhall for their support!

All sponsors are listed on Sponsors of Adopted Characters. More than 128,000 other characters are available for adoption — see Adopt a Character

Tuesday, August 22, 2017

Keynote Speaker Announced for IUC 41


Can We Escape Alphabetic Order? Chinese I.T. Before and After Unicode

Thomas S. Mullaney
Associate Professor of Chinese History, Stanford University

Drawing upon more than a decade of research in the fields of Chinese and Non-Western information technology, Stanford historian Tom Mullaney maps out the parallel and still-uncharted worlds of Chinese IT. Worlds where text encoding systems have never been hidden from the average user’s view, but instead where they are directly manipulated, command-line-style, by hundreds of millions of code-conscious users. Worlds where the mythology of “plaintext” was never allowed to take hold, and where everyone knows that ‘WYS’ is not ‘WYG’. With vivid examples pulled from the archives of telegraphy, computing, machine translation, digital typography, and more, he will give a guided tour of China‘s 200-year-old quest for a stable information order, posing a fundamental question along the way: Why has the Chinese script proven so difficult to encode, and what does this tell us about the fundamental inequalities that are still baked into our modern-day information order?

About IUC 41, October 16-18, 2017: For twenty-six years the Internationalization & Unicode® Conference (IUC) has been the preeminent event highlighting the latest innovations and best practices of global and multilingual software providers. Please join us for our 41st conference! This year's event is being held on October 16-18, 2017 in Santa Clara, California. Read more.