[teiid-issues] [JBoss JIRA] (TEIID-4594) Add ability to read Parquet Files

Steven Hawkins (Jira) issues at jboss.org
Fri Jul 10 08:16:00 EDT 2020


    [ https://issues.redhat.com/browse/TEIID-4594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14213700#comment-14213700 ] 

Steven Hawkins commented on TEIID-4594:
---------------------------------------

> What do you mean by this Steven Hawkins? The first sentence

The hive metastore is a common metadata source for hive, presto. spark, etc.  The create table syntax is quite extensive: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-CreateTable - it is not expected that we would support all of options.  Alternatively we could consider requiring/leveraging a metastore instance, similar to presto, rather than coming up with our own metadata representation.  A single source, meaning an s3 bucket or a filesystem location, could have multiple sets of paraquet files each representing a separate table.

> Add ability to read Parquet Files
> ---------------------------------
>
>                 Key: TEIID-4594
>                 URL: https://issues.redhat.com/browse/TEIID-4594
>             Project: Teiid
>          Issue Type: Feature Request
>          Components: Misc. Connectors
>    Affects Versions: 9.2
>            Reporter: Van Halbert
>            Assignee: Steven Hawkins
>            Priority: Major
>             Fix For: 15.0
>
>
> Integration with Parquet files on Gluster is an important requirement. RADAnalytics will be accessing data from Parquet which is a common file format for Spark. 



--
This message was sent by Atlassian Jira
(v7.13.8#713008)


More information about the teiid-issues mailing list