(Note: Apache UIMA now requires Java 1.5 support -
so this issue is moot.)
The XML serialization support in Sun Java 1.4.2_12
doesn't serialize CR characters to XML. As a result,
if the document text contains CR characters, XCAS or
XMI serialization will cause them to be lost,
resulting in incorrect annotation offsets. This is
exposed in the DocumentAnalyzer, with the
highlighting being incorrect if the input document
contains CR characters.