Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # user - MR on HDFS data inserted via HBase?


Copy link to this message
-
MR on HDFS data inserted via HBase?
Otis Gospodnetic 2010-01-14, 04:06
Hello,

If I import data into HBase, can I still run a hand-written MapReduce job over that data in HDFS?
That is, not using TableInputFormat to read the data back out via HBase.

Similarly, can one run Hive or Pig scripts against that data, but again, without Hive or Pig reading the data via HBase, but rather getting to it directly via HDFS?  I'm asking because I'm wondering whether storing data in HBase means I can no longer use Hive and Pig to run my ad-hoc jobs.

Thanks,
Otis
--
Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch