Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Question about the time to execute joins in HBase!


Copy link to this message
-
Re: Question about the time to execute joins in HBase!
QQ what is your caching set to?
On Aug 22, 2013 11:25 AM, "Pavan Sudheendra" <[EMAIL PROTECTED]> wrote:

> Hi all,
>
> A serious question.. I know this isn't one of the best hbase practices but
> I really want to know..
>
> I am doing a join across 3 table in hbase.. One table contain 19m records,
> one contains 2m and another contains 1m records.
>
> I'm doing this inside the mapper function.. I know this can be done with
> pig and hive etc. Leaving the specifics out, how long would experts think
> it would take for the mapper to finish aggregating them across a 6 node
> cluster.. One is the job tracker and 5 are task trackers.. By the time I
> see the map reduce job status for input records reach 600,000 it's taking
> an hour.. It can't be right..
>
> Any tips? Please help.
>
> Thanks.
>
> --
> Regards-
> Pavan
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB