(Bug?) XMLStreamException: An invalid XML character
I'm trying to parse a wikipedia dump file, and so one should expect 'strange' character sets within the file.
I'm getting the following exception:
[javax.xml.stream.XMLStreamException: ParseError at [row,col]:[21828,9]
Message: An invalid XML character (Unicode: 0x10339) was found in the element content of the document.]
However, 0x10339 is a valid Unicode character, see http://www.unicode.org/charts/PDF/U10330.pdf
and also according to the Java API docs, see http://java.sun.com/j2se/1.5.0/docs/api/java/lang/Character.html.
Further, java.lang.Character.isLetter( 0x10339 ) returns 'true'.