Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
HBase >> mail # user >> Parallel Scan with TableMapReduceUtil


+
Guillermo Ortiz 2014-05-15, 16:26
Copy link to this message
-
Re: Parallel Scan with TableMapReduceUtil
Hi Guillermo,

You should see as many MR tasks as you have regions in your input table.
There will be one scan per task. They will all run in parallel is you have
enough MR slots. Else, some of them will run in parallel, and the others
will wait for an available slot. HBase will try to run those tasks on the
RS the regions are. So doing on the client side using multiple thread will
have a bigger impact on the resources usage since you will have a lot of
calls between the client and all the region servers.

JM
2014-05-07 8:34 GMT-04:00 Guillermo Ortiz <[EMAIL PROTECTED]>: