Thursday, July 19, 2012

Version 15 of UTS #18, Unicode Regular Expressions has been released by the Unicode Consortium. Regular expressions are used throughout much of the world's software for matching and manipulating text. UTS #18 provides the foundation for the handling of Unicode text in those expressions.

Version 15 is a major revision. Changes include:
  • Conformance clauses dealing with non 1:1 equivalences were either retracted or modified.
  • A Level 2 conformance clause for full properties was added.
  • New properties, including Name_Alias matching and Script_Extensions, were added.
  • A recommended compact form of Unicode escapes was added: \u{...}.
  • There were many clarifications of the text. See http://www.unicode.org/reports/tr18/tr18-15.html