Wednesday, July 30, 2025

Highlights from UTC Meeting #184

The Unicode Technical Committee (UTC) meeting #184 was held last week, July 22 – 24, in Redmond, Washington, hosted by Microsoft. Here are some highlights.

Finalizing Unicode 17.0

The top priority was to finalize technical decisions for Unicode 17.0 in preparation for a September 9 release. Beta feedback and a small number of new proposals were considered, and various decisions affecting Unicode 17.0 were taken. 

The most significant change from the Unicode 17.0 Beta is the removal of 44 characters, based on feedback requesting more time to review these characters and the associated proposals:

  • 09FF BENGALI LETTER SANSKRIT BA
  • 0B53 ORIYA SIGN DOT ABOVE
  • 0B54 ORIYA SIGN DOUBLE DOT ABOVE
  • 1FADD APPLE CORE
  • 40 Chisoi script characters and the Chisoi block at 16D80..16DAF

These characters have been postponed to Unicode 18.0. With this change, the total number of new characters for Unicode 17.0 will be 4,803, including CJK Extension J and four new scripts.

Glyph changes were also approved for 21 characters, all of which were encoded in earlier versions.

Certain character property changes were also approved. These include a change to the Word_Break property for 00B8 CEDILLA to accommodate orthographic usage for SENĆOŦEN, an indigenous language spoken in Western Canada. In relation to identifiers and security, the seven scripts added in Unicode 16.0 (Garay, Gurung Khema, Kirat Rai, Ol Onal, Sunuwar, Todhri, and Tulu-Tigalari) will be classified in UAX #31 as Excluded Scripts (Table 4), which means that these will not be included in the General Security Profile for secure identifiers.

First characters approved for Unicode 18.0

The tentative plan for new characters to be added in the next Unicode version is usually decided at the fall UTC meeting. The first approvals for Unicode 18.0, however, were decided last week at UTC #184. These include the 44 characters postponed from Unicode 17.0, mentioned above, as well as u+20CE UAE DIRHAM SIGN and 16 geometric symbols used in the manuscripts of the 17th-century polymath Gottfried Wilhelm Leibniz.

As typically happens at each UTC meeting, several code points were provisionally assigned for other new characters that will be candidates for future versions. 

For characters approved for 18 or provisionally assigned for future versions, see https://www.unicode.org/alloc/Pipeline.html#future.

Text Terminal Working Group progress

A temporary working group was created at UTC #175 to work on improved support for Unicode text in text-only terminal environments, particularly for scripts requiring advanced layout. Due to changes in availability of key participants early on, progress was hindered, but the working group is now meeting regularly. 

To scope the project, they will prioritize scripts classified in UAX #31 as Recommended. These include a number of scripts for which examples of fixed-width text have not been readily available, and the working group would welcome contributions from anyone with knowledge of prior art for fixed-width Indic text.

For complete details from UTC #184, see the draft minutes

About the Unicode Standard

The world relies on digital communications. The Unicode Standard is one of the building blocks for global digital communications, providing the encoding for more than 155,000 characters used by thousands of languages and scripts throughout the world. 


Each character—letter, diacritic, symbol, emoji, etc.—is represented by a unique numeric code, and has defined properties data that define how characters behave in several text processing algorithms. 


With this combination, The Unicode Standard provides the foundation for implementations to support the world's writing systems, enabling billions of people across the globe to seamlessly communicate with one another across platforms and devices. The Standard is also the foundation for the suite of code, libraries, data, and products that the Unicode Consortium delivers for robust language support.

----------------------------------------------

Adopt a Character and Support Unicode’s Mission

Looking to give that special someone a special something?
Or maybe something to treat yourself?
🕉️💗🏎️🐨🔥🚀爱₿♜🍀

Adopt a character or emoji to give it the attention it deserves, while also supporting Unicode’s mission to ensure everyone can communicate in their own languages across all devices.

Each adoption includes a digital badge and certificate that you can proudly display!

Have fun and support a good cause

You can also donate funds or gift stock