Tuesday, March 6, 2012

PRI #182: Unicode Regular Expressions: new proposed update

UTS #18, Unicode Regular Expressions provides the foundation for handling Unicode characters in regular expression engines, a key component of many programs and programming languages.

There are significant additions and changes in the new proposed update of this specification, with the addition of Name_Alias matching, matching rules from UAX #44, use of the new Script_Extensions property, new recommended properties, a compact form of \u{...}, alignment of rule RL1.4 with Appendix C, and the incorporation of text for PRI #179.

There are several of review notes requesting feedback on particular issues. Please submit feedback on those and the rest of this document by May 1 for consideration at the UTC meeting starting on May 7. For details, see: