As already discussed on GitHub (https://github.com/hibernate/hibernate-search/pull/1634)
While updating to a more recent version of Tika, we cannot rely on the default AutoDetectParser anymore, as we need to define a custom Tika instance with a special tika-config.xml to drop a specific parser (GrobIdParser as this has HUGE dependencies ). This is effectively suggested in
https://issues.apache.org/jira/browse/TIKA-2243
To reflect this customization in the TikaBridge as well, some additional handle for this is required the AutoDetectParser is hard-wired in TikaBridge.
|
|