Hi... I created a task to extract the Dublin Core properties from an image and a HTML code similar to the org.exoplatform.services.document.impl.MSExcelDocumentReader . I have a patch for it.  Can be it useful?

The task is:

https://jira.jboss.org/jira/browse/EXOJCR-624