Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
Hive >> mail # user >> Hive HBase integeration use case

G.S.Vijay raajaa 2013-02-01, 13:25
Copy link to this message
RE: Hive HBase integeration use case
You may want to try by creating UDF/UDTF hive function

From: G.S.Vijay raajaa [[EMAIL PROTECTED]]
Sent: 01 February 2013 18:55
Subject: Hive HBase integeration use case


       I would like to have HBase as a data storage and use Hive for data warehousing.

The issue with the integration is,

The HBase table is composed of composite keys following the below structure:
HBase ROWKEY:  Hash(customer_id)+customer_id+time+event_id


column: usage : value

The structure of Hbase rowkey makes every entry a distinct one.

Is it possible to split the rowkey and map them as columns of the hive table as explained below:

I am trying to create a hive table with the following column structure:


This will enable me aggregate data by grouping the column ( time or event_id)??

Any thoughts on the same??

If there isn't a direct handling by hive HBase integration, can you suggest any other means??

Vijay Raajaa G S

NOTE: This message may contain information that is confidential, proprietary, privileged or otherwise protected by law. The message is intended solely for the named addressee. If received in error, please destroy and notify the sender. Any use of this email is prohibited when received in error. Impetus does not represent, warrant and/or guarantee, that the integrity of this communication has been maintained nor that the communication is free of errors, virus, interception or interference.