Tuesday, September 8, 2009

[Unicode Announcement] Unicode Collation Algorithm 5.2.0 Beta Data Files Now Available

Version 5.2.0 of The Unicode Collation Algorithm (UCA) is being prepared
for release in parallel with Unicode 5.2. The UCA data files have been
recently updated and are ready for review. Please see the Public Review
Issue:
http://www.unicode.org/review/#pri143
as well as the beta data files and collation test files:
http://www.unicode.org/Public/UCA/5.2.0/

1. The data files contain weights for all new assigned characters.
a. There have been significant changes to the ordering of
many combining marks. Many of those that are not in customary
use in modern languages now have the same secondary weight,
and will only be distinguished on a fourth level, by code
point ordering.
b. The ordering for Tamil and Malayalam has been improved,
but would still need tailoring for the Tamil and Malayalam
languages.
2. The text of UTS#10 has been updated. See the
modifications section for details:
http://www.unicode.org/reports/tr10/tr10-19.html#Modifications

Time is very short for this beta review, which closes on September 23,
2009, so reviewers are urged to download and test the files as soon as
they can.

Feedback should be sent through the usual Error Reporting Form:
http://www.unicode.org/reporting.html


----
All of the Unicode Consortium lists are strictly opt-in lists for members
or interested users of our standards. We make every effort to remove
users who do not wish to receive e-mail from us. To see why you are getting
this mail and how to remove yourself from our lists if you want, please
see http://www.unicode.org/consortium/distlist.html#announcements