Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # user - One Region Server fails - all M/R jobs crash.


Copy link to this message
-
Re: One Region Server fails - all M/R jobs crash.
David Koch 2013-11-22, 17:35
Here you go:

Task log: http://pastebin.com/VePTLHEk
Region Server log: http://pastebin.com/iu8y0VYL
On Fri, Nov 22, 2013 at 6:27 PM, Ted Yu <[EMAIL PROTECTED]> wrote:

> Attachment didn't go through.
>
> Can you pastebin their contents ?
>
> Thanks
>
> On Nov 23, 2013, at 12:55 AM, David Koch <[EMAIL PROTECTED]> wrote:
>
> > Sorry for the previous message, I attach the equired log files.
> >
> > Regards,
> >
> > David
> >
> >
> > On Fri, Nov 22, 2013 at 5:53 PM, David Koch <[EMAIL PROTECTED]>
> wrote:
> >>
> >>
> >>
> >> On Fri, Nov 22, 2013 at 4:17 PM, Ted Yu <[EMAIL PROTECTED]> wrote:
> >>> Can you pastebin snippet of:
> >>> 1. task logs which show failure
> >>> 2. region server log shortly before the crash
> >>>
> >>> Thanks
> >>>
> >>>
> >>> On Fri, Nov 22, 2013 at 7:14 AM, David Koch <[EMAIL PROTECTED]>
> wrote:
> >>>
> >>> > Hello,
> >>> >
> >>> > We experience reliability problems when running M/R jobs over HBase
> tables.
> >>> > Specifically, it suffices for one Region Server to crash in order to
> fail
> >>> > all M/R jobs.
> >>> >
> >>> > My guess is that this is not normal with a replication factor of 3.
> >>> >
> >>> > The HBase version is 0.94.6 installed as part of of Cloudera 4.4.
> HBase
> >>> > settings are pre-sets. Cluster size is 30 machines.
> >>> >
> >>> > What steps can I follow to improve the situation?
> >>> >
> >>> > Thank you,
> >>> >
> >>> > /David
> >>> >
> >
>