Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
HBase >> mail # user >> Bulkload or hbase API

Lashing 2013-03-14, 16:27
Tariq 2013-03-14, 16:31
Lashing 2013-03-14, 16:54
Copy link to this message
Re: Bulkload or hbase API
Hello Lashing,

MapReduce would be great :

Each mapper addresses a different MySQL DB and "TableOutputFormat" to the
corresponding HTable.

maybe pig : UNION after LOAD on different MySQL DB and then STORE on the
différent table according to your policy (may need several M/R jobs all
managed by pig workflow).

The more efficient (1 job) would be pure home made Java MapReduce (mapper
only for each MySQL DB bulk loading on HTables)


Damien HARDY
Lashing 2013-03-14, 16:51
Damien Hardy 2013-03-14, 17:04
Lashing 2013-03-15, 16:00