In several documents, there are dates encoded in two slightly different ways:
<date value="1998">1998a</date>
<date value="1998a">1998a</date>
These are actually both from the same document. First, check whether this variance actually affects the output in any way; following that, pick one format and regularlize all instances.