Monday, 2 August 2004

Unicode Normalization in Java

« One strike and you're out! | Main | blojsom and MetricsReloaded »
Given recent discussions on the atom-syntax mailing list regarding normalization, I figured it'd be useful to point other Java folks (again) at IBM's International Components for Unicode for Java. In particular, the Normalizer. So, if you needed to do something like normalize using NFKC (Compatability decomposition followed by canonical composition), you could use the following method:
static String normalize(String src, Normalizer.Mode mode) 
with the mode of
Normalizer.Mode.NFKC 
Doesn't seem that terrible.
Posted by david at 10:44 AM in Internationalization

 

[Trackback URL for this entry]

Your comment:

(not displayed)
 
 
 

Live Comment Preview:

 
« First  « Prev   1 2 3 4 5   Next »  Last »
« August »
SunMonTueWedThuFriSat
1234567
891011121314
15161718192021
22232425262728
293031