I think that individual XML files are easier to understand for contributors - especially if they contain comments if the metadata is supposed to be mixed (XML and annotations).
Using other JVM languages for testing could be a benefit in the long run, but I think using a "standard" approach to the problem at the moment is best.
And of course splitting up test cases is good - a big bang approach is too difficult to maintain.