Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Bulkload or hbase API

Copy link to this message
Re: Bulkload or hbase API
Hello Lashing,

MapReduce would be great :

Each mapper addresses a different MySQL DB and "TableOutputFormat" to the
corresponding HTable.

maybe pig : UNION after LOAD on different MySQL DB and then STORE on the
différent table according to your policy (may need several M/R jobs all
managed by pig workflow).

The more efficient (1 job) would be pure home made Java MapReduce (mapper
only for each MySQL DB bulk loading on HTables)


Damien HARDY