|
|
Eran Kutner 2011-03-02, 14:40
Hi, I'm trying to copy data from an older cluster using 0.89 (CDH3b3) to a new one using 0.91 (CDH3b4) using the CopyTable MR job but it always hangs on "map 0% reduce 0%" until eventually the job is killed by Hadoop for not responding after 600 seconds. I verified that it works fine when copying from one table to another on the same cluster and I verified that the servers in the source cluster have network access to those in the destination cluster.
Any idea what could be causing it?
-eran
-
Re: CopyTable MR job hangs
Eran Kutner 2011-03-15, 08:11
No idea anyone?
-eran
On Wed, Mar 2, 2011 at 16:40, Eran Kutner <[EMAIL PROTECTED]> wrote:
> Hi, > I'm trying to copy data from an older cluster using 0.89 (CDH3b3) to a new > one using 0.91 (CDH3b4) using the CopyTable MR job but it always hangs on > "map 0% reduce 0%" until eventually the job is killed by Hadoop for not > responding after 600 seconds. > I verified that it works fine when copying from one table to another on the > same cluster and I verified that the servers in the source cluster have > network access to those in the destination cluster. > > Any idea what could be causing it? > > -eran > >
-
Re: CopyTable MR job hangs
Lars George 2011-03-15, 14:15
Hi Eran,
We need more details. It sounds like an issue with the ZooKeeper quorum. In other words that it cannot connect to the ZK servers. Often this is then logged during the task failures as it trying to connect to localhost. Could you grab more logs and up them to pastebin or some such?
Lars
On Tue, Mar 15, 2011 at 8:11 AM, Eran Kutner <[EMAIL PROTECTED]> wrote: > No idea anyone? > > -eran > > > > On Wed, Mar 2, 2011 at 16:40, Eran Kutner <[EMAIL PROTECTED]> wrote: > >> Hi, >> I'm trying to copy data from an older cluster using 0.89 (CDH3b3) to a new >> one using 0.91 (CDH3b4) using the CopyTable MR job but it always hangs on >> "map 0% reduce 0%" until eventually the job is killed by Hadoop for not >> responding after 600 seconds. >> I verified that it works fine when copying from one table to another on the >> same cluster and I verified that the servers in the source cluster have >> network access to those in the destination cluster. >> >> Any idea what could be causing it? >> >> -eran >> >> >
-
Re: CopyTable MR job hangs
Jean-Daniel Cryans 2011-03-15, 17:10
Strangely enough I did answer that question the day you sent it but it doesn't show up on the mailing list aggregators even tho gmail marks it as sent... anyways here's what I said:
It won't work because those versions aren't wire-compatible.
What you can do instead is doing an Export, distcp the files, then do an Import. If the hadoop versions are different, use the hftp interface like the distcp documentation recommends.
J-D
On Tue, Mar 15, 2011 at 1:11 AM, Eran Kutner <[EMAIL PROTECTED]> wrote: > No idea anyone? > > -eran > > > > On Wed, Mar 2, 2011 at 16:40, Eran Kutner <[EMAIL PROTECTED]> wrote: > >> Hi, >> I'm trying to copy data from an older cluster using 0.89 (CDH3b3) to a new >> one using 0.91 (CDH3b4) using the CopyTable MR job but it always hangs on >> "map 0% reduce 0%" until eventually the job is killed by Hadoop for not >> responding after 600 seconds. >> I verified that it works fine when copying from one table to another on the >> same cluster and I verified that the servers in the source cluster have >> network access to those in the destination cluster. >> >> Any idea what could be causing it? >> >> -eran >> >> >
-
Re: CopyTable MR job hangs
Eran Kutner 2011-03-16, 13:12
Double thanks (one for each reply) J-D, I'll use distcp as you suggest.
-eran
On Tue, Mar 15, 2011 at 19:10, Jean-Daniel Cryans <[EMAIL PROTECTED]>wrote:
> Strangely enough I did answer that question the day you sent it but it > doesn't show up on the mailing list aggregators even tho gmail marks > it as sent... anyways here's what I said: > > It won't work because those versions aren't wire-compatible. > > What you can do instead is doing an Export, distcp the files, then do > an Import. If the hadoop versions are different, use the hftp > interface like the distcp documentation recommends. > > J-D > > On Tue, Mar 15, 2011 at 1:11 AM, Eran Kutner <[EMAIL PROTECTED]> wrote: > > No idea anyone? > > > > -eran > > > > > > > > On Wed, Mar 2, 2011 at 16:40, Eran Kutner <[EMAIL PROTECTED]> wrote: > > > >> Hi, > >> I'm trying to copy data from an older cluster using 0.89 (CDH3b3) to a > new > >> one using 0.91 (CDH3b4) using the CopyTable MR job but it always hangs > on > >> "map 0% reduce 0%" until eventually the job is killed by Hadoop for not > >> responding after 600 seconds. > >> I verified that it works fine when copying from one table to another on > the > >> same cluster and I verified that the servers in the source cluster have > >> network access to those in the destination cluster. > >> > >> Any idea what could be causing it? > >> > >> -eran > >> > >> > > >
|
|