-Re: improve performance of a MapReduce job with HBase input
Alok Kumar 2012-05-25, 18:24
you can make use of 'setCaching' method of your scan object.
Scan objScan = new Scan();
objScan.setCaching(100); // set it to some integer, as per ur use case.
On Fri, May 25, 2012 at 11:33 PM, Ey-Chih chow <[EMAIL PROTECTED]> wrote:
> We have a MapReduce job of which input data is from HBase. We would like
> to improve performance of the job. According to the HBase book, we can do
> that by setting scan caching to a number higher than default. We use
> TableInputFormat to read data from the job. I look at the implementation
> of the class. The class does not set caching when a scan object is
> created. Is there anybody know how to externally set caching for the scan
> created in TableInputFormat? Thanks.
> Ey-Chih Chow