Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> HBase => replication => Hive

Copy link to this message
HBase => replication => Hive

Since HBase has a mechanism to replicate edit logs to another HBase cluster, I
was wondering if people think it would be possible to implement HBase=>Hive
replication? (and really make the destination pluggable later on)

I'm asking because while one can integrate Hive and HBase by creating external
tables in Hive that actually point to tables in HBase, apparently Hive queries
run about x5 slower than queries that go against normal Hive tables.

And because all HBase export options are for 1 table at a time and not point in
time snapshots of the whole table, exporting data from HBase and importing into
Hive doesn't sound like a viable option.

Sematext :: http://sematext.com/ :: Solr - Lucene - Hadoop
Hadoop ecosystem search :: http://search-hadoop.com/