Monthly Archives: May 2009

UTF-8 charset encoding update

The UTF-8 encoding is easy to abuse in some ways. Or rather, sometimes people use it in unexpected ways.

Recently the Java platform received an update to reject one malformed UTF-8 encoding sequence called “non-shortest form.” You can learn more about this fix and its implications for you in the article Overhauling the Java UTF-8 Charset.