[teiid-issues] [JBoss JIRA] (TEIID-3693) Add data source support for reading documents (i.e., RTF, DOC, PDF)

Steven Hawkins (JIRA) issues at jboss.org
Wed Sep 9 14:40:01 EDT 2015


    [ https://issues.jboss.org/browse/TEIID-3693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13107115#comment-13107115 ] 

Steven Hawkins commented on TEIID-3693:
---------------------------------------

Each doc type will need to be a separate issue, unless you are simply indicating extraction at a blob level.

> Add data source support for reading documents (i.e., RTF, DOC, PDF)
> -------------------------------------------------------------------
>
>                 Key: TEIID-3693
>                 URL: https://issues.jboss.org/browse/TEIID-3693
>             Project: Teiid
>          Issue Type: Feature Request
>          Components: Misc. Connectors
>            Reporter: Van Halbert
>            Assignee: Steven Hawkins
>
> Add data source support for reading documents (i.e., RTF, DOC, PDF).
> From a runtime perspective the extraction logic is straight-forward when there is a parsing library. Here's one from CA using JSoup - https://github.com/rokhmanov/teiid-translators/blob/master/translator-scrape/src/main/java/com/rokhmanov/teiid/translator/scrape/



--
This message was sent by Atlassian JIRA
(v6.4.11#64026)


More information about the teiid-issues mailing list