The benefit of using the partitioned approach is really nicely described in the oreilly book "Programming Hive". (Thanks for writing it Edward)
For me the ability to drop a single partition if there's any doubt about the quality of the data of just one job is a large benefit.
From: Edward Capriolo [mailto:[EMAIL PROTECTED]]
Sent: Thursday, January 24, 2013 3:52 PM
To: [EMAIL PROTECTED]
Subject: Re: Loading a Hive table simultaneously from 2 different sources
Partition the table and load the data into different partitions. That or build the data outside he table and then use scripting to move the data in using LOAD DATA INPATH or copying.
On Thu, Jan 24, 2013 at 9:44 AM, Krishnan K <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> wrote:
Could you please let me know what would happen if we try to load a table from 2 different sources at the same time ?
I had tried this earlier and got an error for 1 load job and while the other job loaded the data successfully into the table..
I guess it was because of lock acquired on the table by the first load process.
Is there anyway to handle this ?
Please give your insights.