]
Steven Hawkins reassigned TEIID-3442:
-------------------------------------
Fix Version/s: Open To Community
(was: 9.x)
Assignee: (was: Steven Hawkins)
Apache Spark support via SparkSQL and DataFrames
------------------------------------------------
Key: TEIID-3442
URL:
https://issues.jboss.org/browse/TEIID-3442
Project: Teiid
Issue Type: Feature Request
Components: Misc. Connectors
Affects Versions: 8.10
Reporter: John Muller
Labels: Connectors, Spark, Translators
Fix For: Open To Community
Original Estimate: 20 weeks
Remaining Estimate: 20 weeks
Eliciting comments for Apache Spark support. With the release of Panda's like
DataFrames, it is a little more feasible to directly translate to SparkSQL:
https://spark.apache.org/docs/latest/sql-programming-guide.html
Options in order of complexity:
1. Use the existing Hive connector / translator. Spark still uses the Hive metastore.
2. Thrift JDBC driver. This is what Microstrategy, Tableau, QlikView and others use,
most rudimentary API for accessing Spark.
3. Native SparkSQL via building Spark jobs and submitting them to a running Spark driver.