As long as the smaller table size is in the range of a few MBs. It is a good candidate for map join.
If the smaller table size is still more then you can take a look at bucketed map joins.
Sent from remote device, Please excuse typos
From: Santosh Achhra <[EMAIL PROTECTED]>
Date: Wed, 9 Jan 2013 00:11:37
To: <[EMAIL PROTECTED]>
Reply-To: [EMAIL PROTECTED]
Subject: Re: Map Reduce Local Task
Thank you Dean,
One of our table is very small, it has only 16,000 rows and other big table
has 45 million plus records. Wont doing a loacl task help in this case ?
Good wishes,always !
On Tue, Jan 8, 2013 at 11:59 PM, Dean Wampler <
[EMAIL PROTECTED]> wrote:
> more aggressive about trying to convert a join to a local task, where it
> bypasses the job tracker. When you're experimenting with queries on a small
> data set, it can make things much faster, but won't be useful for large
> data sets where you need the cluster.