Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Bulkload or hbase API


Copy link to this message
-
Re: Bulkload or hbase API
Hello Lashing,

MapReduce would be great :

Each mapper addresses a different MySQL DB and "TableOutputFormat" to the
corresponding HTable.

maybe pig : UNION after LOAD on different MySQL DB and then STORE on the
différent table according to your policy (may need several M/R jobs all
managed by pig workflow).

The more efficient (1 job) would be pure home made Java MapReduce (mapper
only for each MySQL DB bulk loading on HTables)

Cheers,

--
Damien HARDY
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB