[teiid-issues] [JBoss JIRA] (TEIID-2289) Create quickstart to demonstrate using Hive/Hadoop

Van Halbert (JIRA) jira-events at lists.jboss.org
Fri Nov 2 10:10:18 EDT 2012


    [ https://issues.jboss.org/browse/TEIID-2289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12731226#comment-12731226 ] 

Van Halbert commented on TEIID-2289:
------------------------------------

Instructions to use:

this is the how to for connecting Hadoop through Teiid and issue some queries on data files.

    Download Hadoop [1] and Hive [2] 
    Create some file 'ahoj.txt' with several lines, one letter on each line.
    Configure Hadoop for pseudo distributed operation (see [3]). Do the steps up to and including 'bin/start-all.sh'.
    In Hadoop bin issue:    hadoop fs -mkdir myfiles 
    In Hadoop bin issue:    hadoop fs -put ahoj.txt myfiles/a
    Put Hadoop/bin on your PATH and set HADOOP_HOME, then startup hive in this console (./hive)
    CREATE EXTERNAL TABLE myfiles(col STRING)
        ROW FORMAT DELIMITED FIELDS TERMINATED BY '44' LINES TERMINATED BY '\n'
        STORED AS TEXTFILE
        LOCATION '/user/fnguyen/myfiles';
    Try that the table from (8) is working  "FROM myfiles mf SELECT mf.col"
    Start the Hive JDBC interface by HIVE_HOME/bin/hive --service hiveserver
    Copy libraries HADOOP_HOME/lib, HIVE_HOME/lib and HADOOP_HOME/hive*jar to your production profile 'lib' directory
    Create Teiid user admin with admin role and password 'teiid'
    Deploy datasource (copy hive-ds.xml from soa-p-5.3.0.GA/eds/teiid/examples/jca).
    Deploy VDB (in the attachment)
    Startup the SOA P. Now no errors should be there and VDB should get ACTIVE
    Now you can startup the Teiid adminshell and test it from there
    In adminshell issue 'sql = connect("jdbc:teiid:Hadoop at mm://127.0.0.1:31000","admin","teiid");'
    Then 'def rows = sql.rows("select * from myfiles;")'

The (8) and (14) are critical points and should work. If any problem should arise, contact me.

[1] wget http://www.mirrorservice.org/sites/ftp.apache.org/hadoop/common/hadoop-1.0.3/hadoop-1.0.3.tar.gz
[2] wget http://apache.mirror.anlx.net/hive/stable/hive-0.8.1-bin.tar.gz
[3] http://hadoop.apache.org/docs/r1.0.3/single_node_setup.html
                
> Create quickstart to demonstrate using Hive/Hadoop
> --------------------------------------------------
>
>                 Key: TEIID-2289
>                 URL: https://issues.jboss.org/browse/TEIID-2289
>             Project: Teiid
>          Issue Type: Sub-task
>          Components: Build/Kits
>            Reporter: Van Halbert
>            Assignee: Steven Hawkins
>
> Create a quick start that can demonstrate using Hive/Hadoop.  This will contain instructions for setting up Hadoop.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


More information about the teiid-issues mailing list