Monday, 2 August 2004
Unicode Normalization in Java
Given recent discussions on the atom-syntax mailing list regarding normalization, I figured it'd be useful to point other Java folks (again) at IBM's International Components for Unicode for Java. In particular, the Normalizer. So, if you needed to do something like normalize using NFKC (Compatability decomposition followed by canonical composition), you could use the following method:
static String normalize(String src, Normalizer.Mode mode)with the mode of
Normalizer.Mode.NFKCDoesn't seem that terrible.
Posted by at 10:44 AM in Internationalization
