UTF-8 charset encoding update
May 20th, 2009
No comments
The UTF-8 encoding is easy to abuse in some ways. Or rather, sometimes people use it in unexpected ways.
Recently the Java platform received an update to reject one malformed UTF-8 encoding sequence called “non-shortest form.” You can learn more about this fix and its implications for you in the article Overhauling the Java UTF-8 Charset.