Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Question about the time to execute joins in HBase!


Copy link to this message
-
Re: Question about the time to execute joins in HBase!
QQ what is your caching set to?
On Aug 22, 2013 11:25 AM, "Pavan Sudheendra" <[EMAIL PROTECTED]> wrote:

> Hi all,
>
> A serious question.. I know this isn't one of the best hbase practices but
> I really want to know..
>
> I am doing a join across 3 table in hbase.. One table contain 19m records,
> one contains 2m and another contains 1m records.
>
> I'm doing this inside the mapper function.. I know this can be done with
> pig and hive etc. Leaving the specifics out, how long would experts think
> it would take for the mapper to finish aggregating them across a 6 node
> cluster.. One is the job tracker and 5 are task trackers.. By the time I
> see the map reduce job status for input records reach 600,000 it's taking
> an hour.. It can't be right..
>
> Any tips? Please help.
>
> Thanks.
>
> --
> Regards-
> Pavan
>