Could you provide the full stacktrace? If I understood correctly this you are using Tika in conjunction in the mass indexer, is that correct? I would expect that failures should go via the ErrorHandler which would allow you to handle failing documents.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira