Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
HBase >> mail # user >> Bulkload or hbase API


+
Lashing 2013-03-14, 16:27
+
Tariq 2013-03-14, 16:31
+
Lashing 2013-03-14, 16:54
Copy link to this message
-
Re: Bulkload or hbase API
Hello Lashing,

MapReduce would be great :

Each mapper addresses a different MySQL DB and "TableOutputFormat" to the
corresponding HTable.

maybe pig : UNION after LOAD on different MySQL DB and then STORE on the
différent table according to your policy (may need several M/R jobs all
managed by pig workflow).

The more efficient (1 job) would be pure home made Java MapReduce (mapper
only for each MySQL DB bulk loading on HTables)

Cheers,

--
Damien HARDY
+
Lashing 2013-03-14, 16:51
+
Damien Hardy 2013-03-14, 17:04
+
Lashing 2013-03-15, 16:00
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB