Thursday, September 19, 2019

New Public Review Issues for Unicode Technical Reports

stopwatch image The Unicode Consortium has recently opened several Public Review Issues for proposed updates to Unicode Standard Annexes and other technical reports . The closing date for comments on these open issues is September 30, 2019, for feedback to be reviewed at the UTC meeting.

Highlights include a major proposed update to UTS #51, Unicode Emoji as well as significant updates to UAX #14, Unicode Line Breaking Algorithm, UTS #18, Unicode Regular Expressions, UAX #29, Unicode Text Segmentation, and UAX #38, Unicode Han Database.

Please see the Public Review Issues page for a full list of the items for review and links to the documents.

Over 136,000 characters are available for adoption, to help the Unicode Consortium’s work on digitally disadvantaged languages.


Friday, August 30, 2019

Internationalization & Unicode Conference #43: Keynote Speaker Announced

Rathna Ramanathan image

Don’t Believe a Word: Multilingual Typographic Systems and a 100-year Publishing Project

Dr. Rathna Ramanathan
Reader in Intercultural Communication and Dean of the School of Communication, Royal College of Art, London

The Murty Classical Library of India aims to make accessible modern translations of Indian texts in print and online. In the first five years, 22 volumes in 12 different languages have been published. Please join us as Dr. Ramanathan reflects on the delights and challenges of building a complex, multilingual typographic system for this unique 100-year publishing project. In addition, Dr. Ramanathan will discuss a subsequent research project which aims to create typographic guidelines for Indian languages and scripts.

A typographer, researcher and educator, known for her expertise in intercultural communication, typography and alternative publishing practices, Dr. Ramanathan has, for the past 20 years, run a design studio (based in Chennai and London) with a focus on research-led, intercultural, multi-platform graphic communication. Her practice evidences an interest in the research and design of marginalised content, endangered languages and practices in South Asia and an expertise in the design of multilingual communication.

See What’s Happening At IUC 43

For over 28 years the Internationalization & Unicode® Conference (IUC) has been the preeminent event highlighting the latest innovations and best practices of global and multilingual software providers. Join us in Santa Clara to promote your ideas and experiences working with natural languages, multicultural user interfaces, producing and supporting multinational and multilingual products, linguistic algorithms, applying internationalization across mobile and social media platforms, or advancements in relevant standards.

Join expert practitioners and industry leaders as they present detailed recommendations for businesses looking to expand to new international markets and those seeking to improve time to market and cost-efficiency of supporting existing markets. Recent conferences have provided specific advice on designing software for European countries, Latin America, China, India, Japan, Korea, the Middle East, and emerging markets.

For further information and to register, please visit the IUC website.


Over 136,000 characters are available for adoption, to help the Unicode Consortium’s work on digitally disadvantaged languages.


Wednesday, July 17, 2019

The Unicode Consortium Launches New Website in Celebration of World Emoji Day

The New Also Offers Emoji Enthusiasts the Chance to “Adopt a Character”

The Unicode Consortium, a nonprofit that maintains text standards to support all the world’s written languages across every device, today debuted a new look for The redesigned website will make information about the emoji proposal process more easily accessible while encouraging public participation and engagement in all Unicode initiatives.

“Unicode is a global technology standard that is one of the core building blocks of the internet,” said Unicode board member Greg Welch. “Unicode has helped facilitate the work of programmers and linguists from around the world since the 1990s. But with the rise of mobile devices and public enthusiasm for emoji, we knew it was time to redesign the Unicode website to make information more easily accessible, and increase community involvement.”

Emoji were adopted into the Unicode Standard in 2010 in a move that made the characters available everywhere. Today, emoji have been used by 92% of the world’s online population. And while emoji encoding and standardization make up just one small part of the Consortium’s text standards work, the growing popularity and demand for emoji have put the organization in the international spotlight.

“We’ve been working with the Unicode Consortium for several years to open up the emoji proposals process by making it more accessible and understandable,” said Jennifer 8. Lee, co-founder of Emojination. “While I personally found the late-90s aesthetic of the developer-centric site very retro and nerd charming, the new site redesign is a reflection of Unicode’s deep desire to engage the public in its work.”

In addition to offering a clearer picture of the emoji submission and standardization process, the new website offers information about the Consortium and its mission to enable people everywhere in the world to use any language on any device.

“Emoji are just one element of our broader mission,” said Mark Davis, president and co-founder of the Unicode Consortium. “The Consortium is a team of largely volunteers who are dedicated to ensuring that people all over the world can use their language of choice in digital communication across any computer, phone or other device. From English and Chinese to Cherokee, Hindi and Rohingya, the Consortium is committed to preserving every language for the digital era.”

A team of designers from Adobe provided design and branding support, as well as free access to leading design tools, to bring Unicode’s new website to life.

“The Unicode Consortium’s work to keep digitally disadvantaged languages alive is incredibly important,” said Adobe Design Program Manager Lisa Pedee. “We collaborated closely with the Consortium to develop a unique visual brand and streamlined web interface that makes everything from contributing language data to proposing an emoji more accessible, inclusive and user-friendly.”

The Consortium’s recent language work includes adding language data for Cherokee, encoding the Hanifi Rohingya script, and developing the Mayan hieroglyphic script.

The Consortium invites emoji and language enthusiasts to celebrate World Emoji Day on July 17 and “Adopt a Character” to support its ongoing efforts. More than 136,000 characters are up for adoption — including this new Emoji 12.0 additions such as the sloth, the sea otter, the waffle and Saturn.

sloth image otter image waffle image ice image ringed planet image

Those who choose to adopt will receive a custom digital badge they can display to publicly show their support, whether on their website or social media. The Unicode Consortium is a 501(c)(3) charitable organization and “adoption fees” are tax-deductible in the U.S. Additionally, some companies may provide matching funds. Learn more and adopt your character here.

About the Unicode Consortium
The Unicode Consortium is a nonprofit on a mission to enable anyone to use any language across every device, globally. The Consortium develops, extends and promotes the use of the Unicode Standard, freely-available specifications and data that form the foundation for software internationalization in all major operating systems, search applications and the web.

The Unicode Consortium is open to all and comprises individuals, companies, academic institutions and governments. Members include Adobe, Apple, Emojipedia, Facebook, Google, IBM, Microsoft, Netflix, Oracle and SAP, among others. For more information, please visit

Tuesday, July 16, 2019

Unicode Technical Committee Considers Emoji Color Mechanism

blackcat-whitewine image The Unicode Technical Committee (UTC) is discussing a mechanism for color changes to existing emoji characters. Such a mechanism could be used for emoji representations of a black cat or a glass of white wine, for example. The color mechanism would use the emoji color characters (including the seven colored square characters at U+1F7E6..U+1F7EB) that were added to the Unicode Standard Version 12.0 in early 2019.

Emoji color mechanisms could potentially be defined as part of Unicode Emoji 13.0. The topic will be discussed at the upcoming July UTC meeting. Specific proposals for new colored emoji characters will not be taken up until the fundamental color mechanism has been established.

For more information, see the Working Draft for Proposed Update UTS #51: Unicode Emoji, section 2.9 “Color”.

The Unicode Standard is the foundation for all modern software and communications around the world, including operating systems, browsers, laptops, and smart phones—plus the Internet and Web (URLs, HTML, XML, CSS, JSON, etc.). The Unicode Standard and its associated standards and data form the foundation for CLDR and ICU releases.

Over 136,000 characters are available for adoption, to help the Unicode Consortium’s work on digitally disadvantaged languages


Wednesday, June 12, 2019

Unicode 12.0 Paperback Available

Unicode 12.0 POD image The Unicode 12.0 core specification is now available in paperback book form with a new, original cover design by Monica Tang. This edition consists of a pair of modestly priced print-on-demand volumes containing the complete text of the core specification of Version 12.0 of the Unicode Standard.

Each of the two volumes is a compact 6×9 inch US trade paperback size. The two volumes may be purchased separately or together, although they are intended as a set. The cost for the pair is US $23.46, plus shipping and taxes (if applicable). Please visit the description page to order.

Note that these volumes do not include the Version 12.0 code charts, nor do they include the Version 12.0 Standard Annexes and Unicode Character Database, which are all freely available on the Unicode website.

Purchase The Unicode Standard, Version 12.0 - Core Specification

Over 136,000 characters are available for adoption, to help the Unicode Consortium’s work on digitally disadvantaged languages