Monday, 2 August 2004

Unicode Normalization in Java

Given recent discussions on the atom-syntax mailing list regarding normalization, I figured it'd be useful to point other Java folks (again) at IBM's International Components for Unicode for Java. In particular, the Normalizer. So, if you needed to do something like normalize using NFKC (Compatability decomposition followed by canonical composition), you could use the following method:
static String normalize(String src, Normalizer.Mode mode) 
with the mode of
Normalizer.Mode.NFKC 
Doesn't seem that terrible.
Posted by david at 10:44 AM in Internationalization

One strike and you're out!

Unfortunately, if you allow public signup for something like your blog or wiki or whatever, invariably you're going to get the random punter who thinks it's funny to post some off-color comment or jibberish Word document.

Although I suspect Elvis Presley could've added something to the blojsom documentation, as of this morning, Elvis has quote-unquote, left the building.

<elvis>Thank you very much!</elvis>
Posted by david at 10:29 AM in blojsom ... all blojsom
« First  « Prev   1 2 3 4 5   Next »  Last »
« August »
SunMonTueWedThuFriSat
1234567
891011121314
15161718192021
22232425262728
293031