|
Devaraj Das
2012-08-07, 06:12
Stack
2012-08-07, 07:23
Devaraj Das
2012-08-08, 16:05
Stack
2012-08-16, 15:56
Shrijeet Paliwal
2012-08-17, 18:16
Stack
2012-08-17, 18:40
Devaraj Das
2012-08-21, 07:25
Stack
2012-08-24, 18:19
Shrijeet Paliwal
2012-08-24, 18:57
Stack
2012-08-24, 19:28
Dave Wang
2012-08-24, 19:55
Shrijeet Paliwal
2012-08-24, 20:52
Shrijeet Paliwal
2012-08-24, 20:58
Shrijeet Paliwal
2012-08-24, 21:02
Stack
2012-08-24, 21:14
Devaraj Das
2012-08-24, 21:15
Stack
2012-08-24, 21:20
Stack
2012-08-24, 21:25
Shrijeet Paliwal
2012-08-24, 21:27
Ted Yu
2012-08-28, 18:36
Ted Yu
2012-08-29, 18:03
Stack
2012-08-30, 06:14
Ted Yu
2012-08-30, 20:52
Dave Wang
2012-08-30, 20:55
Ted Yu
2012-08-30, 21:07
N Keywal
2012-08-30, 21:18
Ted Yu
2012-08-30, 21:37
Dave Wang
2012-08-30, 22:14
Ted Yu
2012-08-31, 15:48
Stack
2012-08-31, 17:10
Ted Yu
2012-08-31, 17:26
Stack
2012-08-31, 23:43
Stack
2012-09-01, 06:02
|
-
[Discuss] Release 0.92.2?Devaraj Das 2012-08-07, 06:12
Folks, since the last release from the 0.92 branch, there have been quite a few fixes (131) on the branch. Should we consider making a 0.92.2 release at this point?
Thanks, Devaraj.
-
Re: [Discuss] Release 0.92.2?Stack 2012-08-07, 07:23
On Tue, Aug 7, 2012 at 7:12 AM, Devaraj Das <[EMAIL PROTECTED]> wrote:
> Folks, since the last release from the 0.92 branch, there have been quite a few fixes (131) on the branch. Should we consider making a 0.92.2 release at this point? > Yes. I tried to put up an RC a while back. Will have another go at it... St.Ack
-
Re: [Discuss] Release 0.92.2?Devaraj Das 2012-08-08, 16:05
Thanks, Stack. I can spend some time in the testing of the RC.
Devaraj. On Aug 7, 2012, at 12:23 AM, Stack wrote: > On Tue, Aug 7, 2012 at 7:12 AM, Devaraj Das <[EMAIL PROTECTED]> wrote: >> Folks, since the last release from the 0.92 branch, there have been quite a few fixes (131) on the branch. Should we consider making a 0.92.2 release at this point? >> > > Yes. I tried to put up an RC a while back. Will have another go at it... > > St.Ack
-
Re: [Discuss] Release 0.92.2?Stack 2012-08-16, 15:56
On Wed, Aug 8, 2012 at 9:05 AM, Devaraj Das <[EMAIL PROTECTED]> wrote:
> Thanks, Stack. I can spend some time in the testing of the RC. > FYI, I'm running builds up on jenkins of current 0.92 trunk. I'm finding a few tests are flakey. Looking into it. Will cut an RC after I get things to settle some. St.Ack
-
Re: [Discuss] Release 0.92.2?Shrijeet Paliwal 2012-08-17, 18:16
Hi Stack ,
Should I hold my breath ? Let me know if you need a hand. Looking forward towards 0.92.2 RC. -Shrijeet On Thu, Aug 16, 2012 at 8:56 AM, Stack <[EMAIL PROTECTED]> wrote: > On Wed, Aug 8, 2012 at 9:05 AM, Devaraj Das <[EMAIL PROTECTED]> wrote: >> Thanks, Stack. I can spend some time in the testing of the RC. >> > > FYI, I'm running builds up on jenkins of current 0.92 trunk. I'm > finding a few tests are flakey. Looking into it. Will cut an RC > after I get things to settle some. > > St.Ack
-
Re: [Discuss] Release 0.92.2?Stack 2012-08-17, 18:40
On Fri, Aug 17, 2012 at 11:16 AM, Shrijeet Paliwal
<[EMAIL PROTECTED]> wrote: > Hi Stack , > Should I hold my breath ? Let me know if you need a hand. Looking > forward towards 0.92.2 RC. > We have a couple of flakey tests. See https://builds.apache.org/job/HBase-0.92/. They fail the odd time. Should I put up an RC in spite of this? Or should we try and fix them first? St.Ack
-
Re: [Discuss] Release 0.92.2?Devaraj Das 2012-08-21, 07:25
On Aug 17, 2012, at 11:40 AM, Stack wrote: > On Fri, Aug 17, 2012 at 11:16 AM, Shrijeet Paliwal > <[EMAIL PROTECTED]> wrote: >> Hi Stack , >> Should I hold my breath ? Let me know if you need a hand. Looking >> forward towards 0.92.2 RC. >> > > > We have a couple of flakey tests. See > https://builds.apache.org/job/HBase-0.92/. They fail the odd time. > Should I put up an RC in spite of this? Or should we try and fix > them first? > St.Ack I'll take a look at the flaky tests. I'll post updates.
-
Re: [Discuss] Release 0.92.2?Stack 2012-08-24, 18:19
On Tue, Aug 21, 2012 at 12:25 AM, Devaraj Das <[EMAIL PROTECTED]> wrote:
> I'll take a look at the flaky tests. I'll post updates. I think we're making pretty good progress on the flakeys. There is still a log splitting and replication test that we could nail and then I'd say we should cut the RC. St.Ack
-
Re: [Discuss] Release 0.92.2?Shrijeet Paliwal 2012-08-24, 18:57
0.92.2 has following error messages in region server logs (while it is
initializing RegionServerMetrics). Some one reported it here https://issues.apache.org/jira/browse/HBASE-6514 . 1591 2012-08-22 20:08:28,106 ERROR org.apache.hadoop.metrics.MetricsUtil: unknown metrics type: org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram 1592 2012-08-22 20:08:28,106 ERROR org.apache.hadoop.metrics.MetricsUtil: unknown metrics type: org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram 1593 2012-08-22 20:08:28,106 ERROR org.apache.hadoop.metrics.MetricsUtil: unknown metrics type: org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram 1594 2012-08-22 20:08:28,106 ERROR org.apache.hadoop.metrics.MetricsUtil: unknown metrics type: org.apache.hadoop.hbase.metrics.ExactCounterMetric 1595 2012-08-22 20:08:28,106 ERROR org.apache.hadoop.metrics.MetricsUtil: unknown metrics type: org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram 1596 2012-08-22 20:08:28,107 ERROR org.apache.hadoop.metrics.MetricsUtil: unknown metrics type: org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram Is this known? On Fri, Aug 24, 2012 at 11:19 AM, Stack <[EMAIL PROTECTED]> wrote: > On Tue, Aug 21, 2012 at 12:25 AM, Devaraj Das <[EMAIL PROTECTED]> wrote: >> I'll take a look at the flaky tests. I'll post updates. > > I think we're making pretty good progress on the flakeys. There is > still a log splitting and replication test that we could nail and then > I'd say we should cut the RC. > St.Ack
-
Re: [Discuss] Release 0.92.2?Stack 2012-08-24, 19:28
On Fri, Aug 24, 2012 at 11:57 AM, Shrijeet Paliwal
<[EMAIL PROTECTED]> wrote: > 0.92.2 has following error messages in region server logs (while it is > initializing RegionServerMetrics). Some one reported it here > https://issues.apache.org/jira/browse/HBASE-6514 . > > 1591 2012-08-22 20:08:28,106 ERROR org.apache.hadoop.metrics.MetricsUtil: > unknown metrics type: > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > 1592 2012-08-22 20:08:28,106 ERROR org.apache.hadoop.metrics.MetricsUtil: > unknown metrics type: > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > 1593 2012-08-22 20:08:28,106 ERROR org.apache.hadoop.metrics.MetricsUtil: > unknown metrics type: > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > 1594 2012-08-22 20:08:28,106 ERROR org.apache.hadoop.metrics.MetricsUtil: > unknown metrics type: org.apache.hadoop.hbase.metrics.ExactCounterMetric > 1595 2012-08-22 20:08:28,106 ERROR org.apache.hadoop.metrics.MetricsUtil: > unknown metrics type: > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > 1596 2012-08-22 20:08:28,107 ERROR org.apache.hadoop.metrics.MetricsUtil: > unknown metrics type: > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > > Is this known? > I pulled it in as a blocker on 0.92.2. Will take a looksee. Thanks Shrijeet. St.Ack
-
Re: [Discuss] Release 0.92.2?Dave Wang 2012-08-24, 19:55
I believe this would be solved by a backport of HBASE-6211 into 0.92.x.
- Dave On Fri, Aug 24, 2012 at 12:28 PM, Stack <[EMAIL PROTECTED]> wrote: > On Fri, Aug 24, 2012 at 11:57 AM, Shrijeet Paliwal > <[EMAIL PROTECTED]> wrote: > > 0.92.2 has following error messages in region server logs (while it is > > initializing RegionServerMetrics). Some one reported it here > > https://issues.apache.org/jira/browse/HBASE-6514 . > > > > 1591 2012-08-22 20:08:28,106 ERROR > org.apache.hadoop.metrics.MetricsUtil: > > unknown metrics type: > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > > 1592 2012-08-22 20:08:28,106 ERROR > org.apache.hadoop.metrics.MetricsUtil: > > unknown metrics type: > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > > 1593 2012-08-22 20:08:28,106 ERROR > org.apache.hadoop.metrics.MetricsUtil: > > unknown metrics type: > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > > 1594 2012-08-22 20:08:28,106 ERROR > org.apache.hadoop.metrics.MetricsUtil: > > unknown metrics type: org.apache.hadoop.hbase.metrics.ExactCounterMetric > > 1595 2012-08-22 20:08:28,106 ERROR > org.apache.hadoop.metrics.MetricsUtil: > > unknown metrics type: > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > > 1596 2012-08-22 20:08:28,107 ERROR > org.apache.hadoop.metrics.MetricsUtil: > > unknown metrics type: > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > > > > Is this known? > > > > I pulled it in as a blocker on 0.92.2. Will take a looksee. Thanks > Shrijeet. > St.Ack >
-
Re: [Discuss] Release 0.92.2?Shrijeet Paliwal 2012-08-24, 20:52
Hi,
I wanted to report one more issue. Recently I upgraded three data centers to our own checkout of 0.92.2, last commit : commit 5accb6a1be4776630126ac21d07adb652b74df95 Author: Zhihong Yu <[EMAIL PROTECTED]> Date: Mon Aug 20 18:19:45 2012 +0000 24 HBASE-6608 Fix for HBASE-6160, META entries from daughters can be deleted before parent entries, shouldn't compare HRegionInfo's (Enis) On Fri, Aug 24, 2012 at 12:55 PM, Dave Wang <[EMAIL PROTECTED]> wrote: > I believe this would be solved by a backport of HBASE-6211 into 0.92.x. > > - Dave > > On Fri, Aug 24, 2012 at 12:28 PM, Stack <[EMAIL PROTECTED]> wrote: > > > On Fri, Aug 24, 2012 at 11:57 AM, Shrijeet Paliwal > > <[EMAIL PROTECTED]> wrote: > > > 0.92.2 has following error messages in region server logs (while it is > > > initializing RegionServerMetrics). Some one reported it here > > > https://issues.apache.org/jira/browse/HBASE-6514 . > > > > > > 1591 2012-08-22 20:08:28,106 ERROR > > org.apache.hadoop.metrics.MetricsUtil: > > > unknown metrics type: > > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > > > 1592 2012-08-22 20:08:28,106 ERROR > > org.apache.hadoop.metrics.MetricsUtil: > > > unknown metrics type: > > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > > > 1593 2012-08-22 20:08:28,106 ERROR > > org.apache.hadoop.metrics.MetricsUtil: > > > unknown metrics type: > > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > > > 1594 2012-08-22 20:08:28,106 ERROR > > org.apache.hadoop.metrics.MetricsUtil: > > > unknown metrics type: > org.apache.hadoop.hbase.metrics.ExactCounterMetric > > > 1595 2012-08-22 20:08:28,106 ERROR > > org.apache.hadoop.metrics.MetricsUtil: > > > unknown metrics type: > > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > > > 1596 2012-08-22 20:08:28,107 ERROR > > org.apache.hadoop.metrics.MetricsUtil: > > > unknown metrics type: > > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram > > > > > > Is this known? > > > > > > > I pulled it in as a blocker on 0.92.2. Will take a looksee. Thanks > > Shrijeet. > > St.Ack > > >
-
Re: [Discuss] Release 0.92.2?Shrijeet Paliwal 2012-08-24, 20:58
Sorry sent too early by mistake.
Continuing.. I upgraded three data centers to our own checkout of 0.92.2. Two went fine, upgrade to one data center failed. Failed in the sense that ROOT and META assignment took unusually long. Panic struck I restarted master and all region servers and managed to get ROOT assigned. But META assignment got stuck badly. The log is here : https://raw.github.com/gist/3455435/adebd118b47aa3d715201010aa09e5eb8930033c/npe_rs_0.92.2.log Notice how region server was stuck in a loop of NPE (grep processBatchCallback). There is one more NPE related to zookeeper constructor. JD was there at irc channel and he thought it could be regression. On Fri, Aug 24, 2012 at 1:52 PM, Shrijeet Paliwal <[EMAIL PROTECTED]>wrote: > Hi, > > I wanted to report one more issue. Recently I upgraded three data centers > to our own checkout of 0.92.2, last commit : > > commit 5accb6a1be4776630126ac21d07adb652b74df95 > Author: Zhihong Yu <[EMAIL PROTECTED]> > Date: Mon Aug 20 18:19:45 2012 +0000 > 24 > HBASE-6608 Fix for HBASE-6160, META entries from daughters can be deleted > before parent entries, shouldn't compare HRegionInfo's (Enis) > > > > On Fri, Aug 24, 2012 at 12:55 PM, Dave Wang <[EMAIL PROTECTED]> wrote: > >> I believe this would be solved by a backport of HBASE-6211 into 0.92.x. >> >> - Dave >> >> On Fri, Aug 24, 2012 at 12:28 PM, Stack <[EMAIL PROTECTED]> wrote: >> >> > On Fri, Aug 24, 2012 at 11:57 AM, Shrijeet Paliwal >> > <[EMAIL PROTECTED]> wrote: >> > > 0.92.2 has following error messages in region server logs (while it is >> > > initializing RegionServerMetrics). Some one reported it here >> > > https://issues.apache.org/jira/browse/HBASE-6514 . >> > > >> > > 1591 2012-08-22 20:08:28,106 ERROR >> > org.apache.hadoop.metrics.MetricsUtil: >> > > unknown metrics type: >> > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram >> > > 1592 2012-08-22 20:08:28,106 ERROR >> > org.apache.hadoop.metrics.MetricsUtil: >> > > unknown metrics type: >> > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram >> > > 1593 2012-08-22 20:08:28,106 ERROR >> > org.apache.hadoop.metrics.MetricsUtil: >> > > unknown metrics type: >> > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram >> > > 1594 2012-08-22 20:08:28,106 ERROR >> > org.apache.hadoop.metrics.MetricsUtil: >> > > unknown metrics type: >> org.apache.hadoop.hbase.metrics.ExactCounterMetric >> > > 1595 2012-08-22 20:08:28,106 ERROR >> > org.apache.hadoop.metrics.MetricsUtil: >> > > unknown metrics type: >> > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram >> > > 1596 2012-08-22 20:08:28,107 ERROR >> > org.apache.hadoop.metrics.MetricsUtil: >> > > unknown metrics type: >> > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram >> > > >> > > Is this known? >> > > >> > >> > I pulled it in as a blocker on 0.92.2. Will take a looksee. Thanks >> > Shrijeet. >> > St.Ack >> > >> > >
-
Re: [Discuss] Release 0.92.2?Shrijeet Paliwal 2012-08-24, 21:02
Finally I should add , going back to 0.92.1 got cluster back on feet. It
does not prove anything though, I might've got lucky! On Fri, Aug 24, 2012 at 1:58 PM, Shrijeet Paliwal <[EMAIL PROTECTED]>wrote: > Sorry sent too early by mistake. > Continuing.. > > I upgraded three data centers to our own checkout of 0.92.2. Two went > fine, upgrade to one data center failed. Failed in the sense that > ROOT and META assignment took unusually long. Panic struck I restarted > master and all region servers and managed to get ROOT assigned. > But META assignment got stuck badly. > > The log is here : > https://raw.github.com/gist/3455435/adebd118b47aa3d715201010aa09e5eb8930033c/npe_rs_0.92.2.log > > Notice how region server was stuck in a loop of NPE (grep processBatchCallback). > There is one more NPE related to zookeeper constructor. > > JD was there at irc channel and he thought it could be regression. > > On Fri, Aug 24, 2012 at 1:52 PM, Shrijeet Paliwal <[EMAIL PROTECTED] > > wrote: > >> Hi, >> >> I wanted to report one more issue. Recently I upgraded three data centers >> to our own checkout of 0.92.2, last commit : >> >> commit 5accb6a1be4776630126ac21d07adb652b74df95 >> Author: Zhihong Yu <[EMAIL PROTECTED]> >> Date: Mon Aug 20 18:19:45 2012 +0000 >> 24 >> HBASE-6608 Fix for HBASE-6160, META entries from daughters can be deleted >> before parent entries, shouldn't compare HRegionInfo's (Enis) >> >> >> >> On Fri, Aug 24, 2012 at 12:55 PM, Dave Wang <[EMAIL PROTECTED]> wrote: >> >>> I believe this would be solved by a backport of HBASE-6211 into 0.92.x. >>> >>> - Dave >>> >>> On Fri, Aug 24, 2012 at 12:28 PM, Stack <[EMAIL PROTECTED]> wrote: >>> >>> > On Fri, Aug 24, 2012 at 11:57 AM, Shrijeet Paliwal >>> > <[EMAIL PROTECTED]> wrote: >>> > > 0.92.2 has following error messages in region server logs (while it >>> is >>> > > initializing RegionServerMetrics). Some one reported it here >>> > > https://issues.apache.org/jira/browse/HBASE-6514 . >>> > > >>> > > 1591 2012-08-22 20:08:28,106 ERROR >>> > org.apache.hadoop.metrics.MetricsUtil: >>> > > unknown metrics type: >>> > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram >>> > > 1592 2012-08-22 20:08:28,106 ERROR >>> > org.apache.hadoop.metrics.MetricsUtil: >>> > > unknown metrics type: >>> > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram >>> > > 1593 2012-08-22 20:08:28,106 ERROR >>> > org.apache.hadoop.metrics.MetricsUtil: >>> > > unknown metrics type: >>> > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram >>> > > 1594 2012-08-22 20:08:28,106 ERROR >>> > org.apache.hadoop.metrics.MetricsUtil: >>> > > unknown metrics type: >>> org.apache.hadoop.hbase.metrics.ExactCounterMetric >>> > > 1595 2012-08-22 20:08:28,106 ERROR >>> > org.apache.hadoop.metrics.MetricsUtil: >>> > > unknown metrics type: >>> > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram >>> > > 1596 2012-08-22 20:08:28,107 ERROR >>> > org.apache.hadoop.metrics.MetricsUtil: >>> > > unknown metrics type: >>> > > org.apache.hadoop.hbase.metrics.histogram.MetricsHistogram >>> > > >>> > > Is this known? >>> > > >>> > >>> > I pulled it in as a blocker on 0.92.2. Will take a looksee. Thanks >>> > Shrijeet. >>> > St.Ack >>> > >>> >> >> >
-
Re: [Discuss] Release 0.92.2?Stack 2012-08-24, 21:14
On Fri, Aug 24, 2012 at 1:58 PM, Shrijeet Paliwal
<[EMAIL PROTECTED]> wrote: > Notice how region server was stuck in a loop of NPE (grep > processBatchCallback). > There is one more NPE related to zookeeper constructor. > > JD was there at irc channel and he thought it could be regression. > FIle a blocker Shrijeet w/ what info you have. Will take a looksee. St.Ack
-
Re: [Discuss] Release 0.92.2?Devaraj Das 2012-08-24, 21:15
What's the log splitting one?
I am looking at the replication one .. if anyone else (including you, Stack) is willing to look at it, welcome.. On Aug 24, 2012, at 11:19 AM, Stack wrote: > On Tue, Aug 21, 2012 at 12:25 AM, Devaraj Das <[EMAIL PROTECTED]> wrote: >> I'll take a look at the flaky tests. I'll post updates. > > I think we're making pretty good progress on the flakeys. There is > still a log splitting and replication test that we could nail and then > I'd say we should cut the RC. > St.Ack
-
Re: [Discuss] Release 0.92.2?Stack 2012-08-24, 21:20
On Fri, Aug 24, 2012 at 2:15 PM, Devaraj Das <[EMAIL PROTECTED]> wrote:
> What's the log splitting one? > I am looking at the replication one .. if anyone else (including you, Stack) is willing to look at it, welcome.. > org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster.testSplitBeforeSettingSplittingInZK in https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/519 I've seen it once or twice before that. Will take a look... St.Ack
-
Re: [Discuss] Release 0.92.2?Stack 2012-08-24, 21:25
On Fri, Aug 24, 2012 at 12:55 PM, Dave Wang <[EMAIL PROTECTED]> wrote:
> I believe this would be solved by a backport of HBASE-6211 into 0.92.x. > I did the backport David but the little test program in HBASE-6211 still throws the ERROR loggings. Elliott is taking a look. St.Ack
-
Re: [Discuss] Release 0.92.2?Shrijeet Paliwal 2012-08-24, 21:27
I have filed https://issues.apache.org/jira/browse/HBASE-6660 . Sorry for
spoiling the party :( On Fri, Aug 24, 2012 at 2:25 PM, Stack <[EMAIL PROTECTED]> wrote: > On Fri, Aug 24, 2012 at 12:55 PM, Dave Wang <[EMAIL PROTECTED]> wrote: > > I believe this would be solved by a backport of HBASE-6211 into 0.92.x. > > > > I did the backport David but the little test program in HBASE-6211 > still throws the ERROR loggings. Elliott is taking a look. > St.Ack >
-
Re: [Discuss] Release 0.92.2?Ted Yu 2012-08-28, 18:36
About test failures, I saw the following in
https://builds.apache.org/job/HBase-0.92-security<https://builds.apache.org/job/HBase-0.92-security/ws/trunk/target/surefire-reports>build 118: Failed tests: queueFailover(org.apache. hadoop.hbase.replication.TestReplication): Waited too much time for queueFailover replication. Waited 34183ms. I want to get people's opinion on the above test whose failure spans 0.92, 0.94 and 0.96 Cheers On Fri, Aug 24, 2012 at 2:27 PM, Shrijeet Paliwal <[EMAIL PROTECTED]>wrote: > I have filed https://issues.apache.org/jira/browse/HBASE-6660 . Sorry for > spoiling the party :( > > On Fri, Aug 24, 2012 at 2:25 PM, Stack <[EMAIL PROTECTED]> wrote: > > > On Fri, Aug 24, 2012 at 12:55 PM, Dave Wang <[EMAIL PROTECTED]> wrote: > > > I believe this would be solved by a backport of HBASE-6211 into 0.92.x. > > > > > > > I did the backport David but the little test program in HBASE-6211 > > still throws the ERROR loggings. Elliott is taking a look. > > St.Ack > > >
-
Re: [Discuss] Release 0.92.2?Ted Yu 2012-08-29, 18:03
According to:
https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/519/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitTransactionOnCluster/testSplitBeforeSettingSplittingInZK/ The test failure was due to: java.lang.RuntimeException: java.io.FileNotFoundException: /home/hudson/hudson-slave/workspace/HBase-0.92/trunk/target/classes/hbase-default.xml (Too many open files) I think currently only HBASE-6649 (TestReplication.queueFailover) is open TestReplication actually hangs in trunk. So it should be fixed across 0.92, 0.94 and trunk. Cheers On Fri, Aug 24, 2012 at 2:20 PM, Stack <[EMAIL PROTECTED]> wrote: > On Fri, Aug 24, 2012 at 2:15 PM, Devaraj Das <[EMAIL PROTECTED]> wrote: > > What's the log splitting one? > > I am looking at the replication one .. if anyone else (including you, > Stack) is willing to look at it, welcome.. > > > > > org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster.testSplitBeforeSettingSplittingInZK > in https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/519 > > I've seen it once or twice before that. > > Will take a look... > > St.Ack >
-
Re: [Discuss] Release 0.92.2?Stack 2012-08-30, 06:14
On Wed, Aug 29, 2012 at 11:03 AM, Ted Yu <[EMAIL PROTECTED]> wrote:
> According to: > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/519/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitTransactionOnCluster/testSplitBeforeSettingSplittingInZK/ > > The test failure was due to: > > java.lang.RuntimeException: java.io.FileNotFoundException: > /home/hudson/hudson-slave/workspace/HBase-0.92/trunk/target/classes/hbase-default.xml > (Too many open files) > > I think currently only HBASE-6649 (TestReplication.queueFailover) is open > > TestReplication actually hangs in trunk. So it should be fixed across 0.92, > 0.94 and trunk. > I haven't seen 'too many open files' in recent runs. I saw TestDrainingServer fail here https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/543/ weirdly in setUpBeforeClass TestSplitLogManager here: https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/542/ It seems like TestMetaReaderEditor failed in 540..... Lets try fix a few of the above such that 0.92 passes more often that it fails (Its better already w/ fixes that have gone in lately in that it usually passes when I do a build locally). St.Ack
-
Re: [Discuss] Release 0.92.2?Ted Yu 2012-08-30, 20:52
Looking at TestDrainingServer failure:
admin.enableTable(TABLENAME); // Assert that every regionserver has some regions on it. MiniHBaseCluster cluster = TEST_UTIL.getMiniHBaseCluster(); for (int i = 0; i < cluster.getRegionServerThreads().size(); i++) { HRegionServer hrs = cluster.getRegionServer(i); Assert.assertFalse(hrs.getOnlineRegions().isEmpty()); } When table is enabled, it was possible that some region server didn't carry any region. It seems the above assertion can be removed. Cheers On Wed, Aug 29, 2012 at 11:14 PM, Stack <[EMAIL PROTECTED]> wrote: > On Wed, Aug 29, 2012 at 11:03 AM, Ted Yu <[EMAIL PROTECTED]> wrote: > > According to: > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/519/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitTransactionOnCluster/testSplitBeforeSettingSplittingInZK/ > > > > The test failure was due to: > > > > java.lang.RuntimeException: java.io.FileNotFoundException: > > > /home/hudson/hudson-slave/workspace/HBase-0.92/trunk/target/classes/hbase-default.xml > > (Too many open files) > > > > I think currently only HBASE-6649 (TestReplication.queueFailover) is open > > > > TestReplication actually hangs in trunk. So it should be fixed across > 0.92, > > 0.94 and trunk. > > > > I haven't seen 'too many open files' in recent runs. > > I saw TestDrainingServer fail here > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/543/ > weirdly in setUpBeforeClass > > TestSplitLogManager here: > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/542/ > > It seems like TestMetaReaderEditor failed in 540..... > > Lets try fix a few of the above such that 0.92 passes more often that > it fails (Its better already w/ fixes that have gone in lately in that > it usually passes when I do a build locally). > > St.Ack >
-
Re: [Discuss] Release 0.92.2?Dave Wang 2012-08-30, 20:55
Does HBASE-5992 fix this? That fix is only in trunk right now.
- Dave On Thu, Aug 30, 2012 at 1:52 PM, Ted Yu <[EMAIL PROTECTED]> wrote: > Looking at TestDrainingServer failure: > > admin.enableTable(TABLENAME); > // Assert that every regionserver has some regions on it. > MiniHBaseCluster cluster = TEST_UTIL.getMiniHBaseCluster(); > for (int i = 0; i < cluster.getRegionServerThreads().size(); i++) { > HRegionServer hrs = cluster.getRegionServer(i); > Assert.assertFalse(hrs.getOnlineRegions().isEmpty()); > } > > When table is enabled, it was possible that some region server didn't carry > any region. > It seems the above assertion can be removed. > > Cheers > > On Wed, Aug 29, 2012 at 11:14 PM, Stack <[EMAIL PROTECTED]> wrote: > > > On Wed, Aug 29, 2012 at 11:03 AM, Ted Yu <[EMAIL PROTECTED]> wrote: > > > According to: > > > > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/519/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitTransactionOnCluster/testSplitBeforeSettingSplittingInZK/ > > > > > > The test failure was due to: > > > > > > java.lang.RuntimeException: java.io.FileNotFoundException: > > > > > > /home/hudson/hudson-slave/workspace/HBase-0.92/trunk/target/classes/hbase-default.xml > > > (Too many open files) > > > > > > I think currently only HBASE-6649 (TestReplication.queueFailover) is > open > > > > > > TestReplication actually hangs in trunk. So it should be fixed across > > 0.92, > > > 0.94 and trunk. > > > > > > > I haven't seen 'too many open files' in recent runs. > > > > I saw TestDrainingServer fail here > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/543/ > > weirdly in setUpBeforeClass > > > > TestSplitLogManager here: > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/542/ > > > > It seems like TestMetaReaderEditor failed in 540..... > > > > Lets try fix a few of the above such that 0.92 passes more often that > > it fails (Its better already w/ fixes that have gone in lately in that > > it usually passes when I do a build locally). > > > > St.Ack > > >
-
Re: [Discuss] Release 0.92.2?Ted Yu 2012-08-30, 21:07
Looking at 5992.v11.patch, it is quite big, touching master,
AssignmentManager, etc. At this moment, I want to limit the scope of changes going into 0.92 branch. Cheers On Thu, Aug 30, 2012 at 1:55 PM, Dave Wang <[EMAIL PROTECTED]> wrote: > Does HBASE-5992 fix this? That fix is only in trunk right now. > > - Dave > > On Thu, Aug 30, 2012 at 1:52 PM, Ted Yu <[EMAIL PROTECTED]> wrote: > > > Looking at TestDrainingServer failure: > > > > admin.enableTable(TABLENAME); > > // Assert that every regionserver has some regions on it. > > MiniHBaseCluster cluster = TEST_UTIL.getMiniHBaseCluster(); > > for (int i = 0; i < cluster.getRegionServerThreads().size(); i++) { > > HRegionServer hrs = cluster.getRegionServer(i); > > Assert.assertFalse(hrs.getOnlineRegions().isEmpty()); > > } > > > > When table is enabled, it was possible that some region server didn't > carry > > any region. > > It seems the above assertion can be removed. > > > > Cheers > > > > On Wed, Aug 29, 2012 at 11:14 PM, Stack <[EMAIL PROTECTED]> wrote: > > > > > On Wed, Aug 29, 2012 at 11:03 AM, Ted Yu <[EMAIL PROTECTED]> wrote: > > > > According to: > > > > > > > > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/519/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitTransactionOnCluster/testSplitBeforeSettingSplittingInZK/ > > > > > > > > The test failure was due to: > > > > > > > > java.lang.RuntimeException: java.io.FileNotFoundException: > > > > > > > > > > /home/hudson/hudson-slave/workspace/HBase-0.92/trunk/target/classes/hbase-default.xml > > > > (Too many open files) > > > > > > > > I think currently only HBASE-6649 (TestReplication.queueFailover) is > > open > > > > > > > > TestReplication actually hangs in trunk. So it should be fixed across > > > 0.92, > > > > 0.94 and trunk. > > > > > > > > > > I haven't seen 'too many open files' in recent runs. > > > > > > I saw TestDrainingServer fail here > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/543/ > > > weirdly in setUpBeforeClass > > > > > > TestSplitLogManager here: > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/542/ > > > > > > It seems like TestMetaReaderEditor failed in 540..... > > > > > > Lets try fix a few of the above such that 0.92 passes more often that > > > it fails (Its better already w/ fixes that have gone in lately in that > > > it usually passes when I do a build locally). > > > > > > St.Ack > > > > > >
-
Re: [Discuss] Release 0.92.2?N Keywal 2012-08-30, 21:18
Well, if I remember well the test was flaky because of multiple bugs on
draining servers management... (cf. the mail on test speed :-) ) On Thu, Aug 30, 2012 at 11:07 PM, Ted Yu <[EMAIL PROTECTED]> wrote: > Looking at 5992.v11.patch, it is quite big, touching master, > AssignmentManager, etc. > > At this moment, I want to limit the scope of changes going into 0.92 > branch. > > Cheers > > On Thu, Aug 30, 2012 at 1:55 PM, Dave Wang <[EMAIL PROTECTED]> wrote: > > > Does HBASE-5992 fix this? That fix is only in trunk right now. > > > > - Dave > > > > On Thu, Aug 30, 2012 at 1:52 PM, Ted Yu <[EMAIL PROTECTED]> wrote: > > > > > Looking at TestDrainingServer failure: > > > > > > admin.enableTable(TABLENAME); > > > // Assert that every regionserver has some regions on it. > > > MiniHBaseCluster cluster = TEST_UTIL.getMiniHBaseCluster(); > > > for (int i = 0; i < cluster.getRegionServerThreads().size(); i++) { > > > HRegionServer hrs = cluster.getRegionServer(i); > > > Assert.assertFalse(hrs.getOnlineRegions().isEmpty()); > > > } > > > > > > When table is enabled, it was possible that some region server didn't > > carry > > > any region. > > > It seems the above assertion can be removed. > > > > > > Cheers > > > > > > On Wed, Aug 29, 2012 at 11:14 PM, Stack <[EMAIL PROTECTED]> wrote: > > > > > > > On Wed, Aug 29, 2012 at 11:03 AM, Ted Yu <[EMAIL PROTECTED]> > wrote: > > > > > According to: > > > > > > > > > > > > > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/519/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitTransactionOnCluster/testSplitBeforeSettingSplittingInZK/ > > > > > > > > > > The test failure was due to: > > > > > > > > > > java.lang.RuntimeException: java.io.FileNotFoundException: > > > > > > > > > > > > > > > /home/hudson/hudson-slave/workspace/HBase-0.92/trunk/target/classes/hbase-default.xml > > > > > (Too many open files) > > > > > > > > > > I think currently only HBASE-6649 (TestReplication.queueFailover) > is > > > open > > > > > > > > > > TestReplication actually hangs in trunk. So it should be fixed > across > > > > 0.92, > > > > > 0.94 and trunk. > > > > > > > > > > > > > I haven't seen 'too many open files' in recent runs. > > > > > > > > I saw TestDrainingServer fail here > > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/543/ > > > > weirdly in setUpBeforeClass > > > > > > > > TestSplitLogManager here: > > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/542/ > > > > > > > > It seems like TestMetaReaderEditor failed in 540..... > > > > > > > > Lets try fix a few of the above such that 0.92 passes more often that > > > > it fails (Its better already w/ fixes that have gone in lately in > that > > > > it usually passes when I do a build locally). > > > > > > > > St.Ack > > > > > > > > > >
-
Re: [Discuss] Release 0.92.2?Ted Yu 2012-08-30, 21:37
5992.v11.patch couldn't be cleanly applied to 0.92
I got: ./src/main/java/org/apache/hadoop/hbase/master/AssignmentManager.java.rej ./src/main/java/org/apache/hadoop/hbase/master/handler/ServerShutdownHandler.java.rej ./src/main/java/org/apache/hadoop/hbase/master/HMaster.java.rej ./src/main/java/org/apache/hadoop/hbase/master/ServerManager.java.rej ./src/main/java/org/apache/hadoop/hbase/regionserver/HRegionServer.java.rej ./src/test/java/org/apache/hadoop/hbase/TestDrainingServer.java.rej On Thu, Aug 30, 2012 at 2:18 PM, N Keywal <[EMAIL PROTECTED]> wrote: > Well, if I remember well the test was flaky because of multiple bugs on > draining servers management... (cf. the mail on test speed :-) ) > > On Thu, Aug 30, 2012 at 11:07 PM, Ted Yu <[EMAIL PROTECTED]> wrote: > > > Looking at 5992.v11.patch, it is quite big, touching master, > > AssignmentManager, etc. > > > > At this moment, I want to limit the scope of changes going into 0.92 > > branch. > > > > Cheers > > > > On Thu, Aug 30, 2012 at 1:55 PM, Dave Wang <[EMAIL PROTECTED]> wrote: > > > > > Does HBASE-5992 fix this? That fix is only in trunk right now. > > > > > > - Dave > > > > > > On Thu, Aug 30, 2012 at 1:52 PM, Ted Yu <[EMAIL PROTECTED]> wrote: > > > > > > > Looking at TestDrainingServer failure: > > > > > > > > admin.enableTable(TABLENAME); > > > > // Assert that every regionserver has some regions on it. > > > > MiniHBaseCluster cluster = TEST_UTIL.getMiniHBaseCluster(); > > > > for (int i = 0; i < cluster.getRegionServerThreads().size(); > i++) { > > > > HRegionServer hrs = cluster.getRegionServer(i); > > > > Assert.assertFalse(hrs.getOnlineRegions().isEmpty()); > > > > } > > > > > > > > When table is enabled, it was possible that some region server didn't > > > carry > > > > any region. > > > > It seems the above assertion can be removed. > > > > > > > > Cheers > > > > > > > > On Wed, Aug 29, 2012 at 11:14 PM, Stack <[EMAIL PROTECTED]> wrote: > > > > > > > > > On Wed, Aug 29, 2012 at 11:03 AM, Ted Yu <[EMAIL PROTECTED]> > > wrote: > > > > > > According to: > > > > > > > > > > > > > > > > > > > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/519/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitTransactionOnCluster/testSplitBeforeSettingSplittingInZK/ > > > > > > > > > > > > The test failure was due to: > > > > > > > > > > > > java.lang.RuntimeException: java.io.FileNotFoundException: > > > > > > > > > > > > > > > > > > > > > /home/hudson/hudson-slave/workspace/HBase-0.92/trunk/target/classes/hbase-default.xml > > > > > > (Too many open files) > > > > > > > > > > > > I think currently only HBASE-6649 (TestReplication.queueFailover) > > is > > > > open > > > > > > > > > > > > TestReplication actually hangs in trunk. So it should be fixed > > across > > > > > 0.92, > > > > > > 0.94 and trunk. > > > > > > > > > > > > > > > > I haven't seen 'too many open files' in recent runs. > > > > > > > > > > I saw TestDrainingServer fail here > > > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/543/ > > > > > weirdly in setUpBeforeClass > > > > > > > > > > TestSplitLogManager here: > > > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/542/ > > > > > > > > > > It seems like TestMetaReaderEditor failed in 540..... > > > > > > > > > > Lets try fix a few of the above such that 0.92 passes more often > that > > > > > it fails (Its better already w/ fixes that have gone in lately in > > that > > > > > it usually passes when I do a build locally). > > > > > > > > > > St.Ack > > > > > > > > > > > > > > >
-
Re: [Discuss] Release 0.92.2?Dave Wang 2012-08-30, 22:14
+1 on limiting changes into 0.92. Just wanted to point out a patch that we
may be able to pull part of to fix the test. - Dave On Thu, Aug 30, 2012 at 2:07 PM, Ted Yu <[EMAIL PROTECTED]> wrote: > Looking at 5992.v11.patch, it is quite big, touching master, > AssignmentManager, etc. > > At this moment, I want to limit the scope of changes going into 0.92 > branch. > > Cheers > > On Thu, Aug 30, 2012 at 1:55 PM, Dave Wang <[EMAIL PROTECTED]> wrote: > > > Does HBASE-5992 fix this? That fix is only in trunk right now. > > > > - Dave > > > > On Thu, Aug 30, 2012 at 1:52 PM, Ted Yu <[EMAIL PROTECTED]> wrote: > > > > > Looking at TestDrainingServer failure: > > > > > > admin.enableTable(TABLENAME); > > > // Assert that every regionserver has some regions on it. > > > MiniHBaseCluster cluster = TEST_UTIL.getMiniHBaseCluster(); > > > for (int i = 0; i < cluster.getRegionServerThreads().size(); i++) { > > > HRegionServer hrs = cluster.getRegionServer(i); > > > Assert.assertFalse(hrs.getOnlineRegions().isEmpty()); > > > } > > > > > > When table is enabled, it was possible that some region server didn't > > carry > > > any region. > > > It seems the above assertion can be removed. > > > > > > Cheers > > > > > > On Wed, Aug 29, 2012 at 11:14 PM, Stack <[EMAIL PROTECTED]> wrote: > > > > > > > On Wed, Aug 29, 2012 at 11:03 AM, Ted Yu <[EMAIL PROTECTED]> > wrote: > > > > > According to: > > > > > > > > > > > > > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/519/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitTransactionOnCluster/testSplitBeforeSettingSplittingInZK/ > > > > > > > > > > The test failure was due to: > > > > > > > > > > java.lang.RuntimeException: java.io.FileNotFoundException: > > > > > > > > > > > > > > > /home/hudson/hudson-slave/workspace/HBase-0.92/trunk/target/classes/hbase-default.xml > > > > > (Too many open files) > > > > > > > > > > I think currently only HBASE-6649 (TestReplication.queueFailover) > is > > > open > > > > > > > > > > TestReplication actually hangs in trunk. So it should be fixed > across > > > > 0.92, > > > > > 0.94 and trunk. > > > > > > > > > > > > > I haven't seen 'too many open files' in recent runs. > > > > > > > > I saw TestDrainingServer fail here > > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/543/ > > > > weirdly in setUpBeforeClass > > > > > > > > TestSplitLogManager here: > > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/542/ > > > > > > > > It seems like TestMetaReaderEditor failed in 540..... > > > > > > > > Lets try fix a few of the above such that 0.92 passes more often that > > > > it fails (Its better already w/ fixes that have gone in lately in > that > > > > it usually passes when I do a build locally). > > > > > > > > St.Ack > > > > > > > > > >
-
Re: [Discuss] Release 0.92.2?Ted Yu 2012-08-31, 15:48
Thanks for the response Dave.
I exchanged a few emails with Lars H last night and we felt we shouldn't block 0.92.2 / 0.94.2 releases due to a few flaky tests. Let's fix flaky tests continuously. On Thu, Aug 30, 2012 at 3:14 PM, Dave Wang <[EMAIL PROTECTED]> wrote: > +1 on limiting changes into 0.92. Just wanted to point out a patch that we > may be able to pull part of to fix the test. > > - Dave > > On Thu, Aug 30, 2012 at 2:07 PM, Ted Yu <[EMAIL PROTECTED]> wrote: > > > Looking at 5992.v11.patch, it is quite big, touching master, > > AssignmentManager, etc. > > > > At this moment, I want to limit the scope of changes going into 0.92 > > branch. > > > > Cheers > > > > On Thu, Aug 30, 2012 at 1:55 PM, Dave Wang <[EMAIL PROTECTED]> wrote: > > > > > Does HBASE-5992 fix this? That fix is only in trunk right now. > > > > > > - Dave > > > > > > On Thu, Aug 30, 2012 at 1:52 PM, Ted Yu <[EMAIL PROTECTED]> wrote: > > > > > > > Looking at TestDrainingServer failure: > > > > > > > > admin.enableTable(TABLENAME); > > > > // Assert that every regionserver has some regions on it. > > > > MiniHBaseCluster cluster = TEST_UTIL.getMiniHBaseCluster(); > > > > for (int i = 0; i < cluster.getRegionServerThreads().size(); > i++) { > > > > HRegionServer hrs = cluster.getRegionServer(i); > > > > Assert.assertFalse(hrs.getOnlineRegions().isEmpty()); > > > > } > > > > > > > > When table is enabled, it was possible that some region server didn't > > > carry > > > > any region. > > > > It seems the above assertion can be removed. > > > > > > > > Cheers > > > > > > > > On Wed, Aug 29, 2012 at 11:14 PM, Stack <[EMAIL PROTECTED]> wrote: > > > > > > > > > On Wed, Aug 29, 2012 at 11:03 AM, Ted Yu <[EMAIL PROTECTED]> > > wrote: > > > > > > According to: > > > > > > > > > > > > > > > > > > > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/519/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitTransactionOnCluster/testSplitBeforeSettingSplittingInZK/ > > > > > > > > > > > > The test failure was due to: > > > > > > > > > > > > java.lang.RuntimeException: java.io.FileNotFoundException: > > > > > > > > > > > > > > > > > > > > > /home/hudson/hudson-slave/workspace/HBase-0.92/trunk/target/classes/hbase-default.xml > > > > > > (Too many open files) > > > > > > > > > > > > I think currently only HBASE-6649 (TestReplication.queueFailover) > > is > > > > open > > > > > > > > > > > > TestReplication actually hangs in trunk. So it should be fixed > > across > > > > > 0.92, > > > > > > 0.94 and trunk. > > > > > > > > > > > > > > > > I haven't seen 'too many open files' in recent runs. > > > > > > > > > > I saw TestDrainingServer fail here > > > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/543/ > > > > > weirdly in setUpBeforeClass > > > > > > > > > > TestSplitLogManager here: > > > > > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/542/ > > > > > > > > > > It seems like TestMetaReaderEditor failed in 540..... > > > > > > > > > > Lets try fix a few of the above such that 0.92 passes more often > that > > > > > it fails (Its better already w/ fixes that have gone in lately in > > that > > > > > it usually passes when I do a build locally). > > > > > > > > > > St.Ack > > > > > > > > > > > > > > >
-
Re: [Discuss] Release 0.92.2?Stack 2012-08-31, 17:10
On Fri, Aug 31, 2012 at 8:48 AM, Ted Yu <[EMAIL PROTECTED]> wrote:
> Thanks for the response Dave. > > I exchanged a few emails with Lars H last night and we felt we shouldn't > block 0.92.2 / 0.94.2 releases due to a few flaky tests. > > Let's fix flaky tests continuously. > That seems basically fine except I'd think that we'd pass most of the time rather than as it is now, where we fail and its the exception that tests pass. Whatever about Jenkins, I'd think that when I build locally, again, most of the time the build should pass but its not been the case during my builds of 0.92 locally. Has triage on 0.92 failing tests been done to figure if tests are flakey or whether its hbase that is the problem (I've seen JIRA traffic on 0.92 unit tests so this may have been done already). Can we not disable tests that we've labelled flakey and not issues in hbase? St.Ack
-
Re: [Discuss] Release 0.92.2?Ted Yu 2012-08-31, 17:26
bq. Can we not disable tests that we've labelled flakey and not issues in
hbase? No tests were disabled this past week. I will continue to work on finding root cause for flay tests, with the help from people which related JIRAs are assigned to. Cheers On Fri, Aug 31, 2012 at 10:10 AM, Stack <[EMAIL PROTECTED]> wrote: > On Fri, Aug 31, 2012 at 8:48 AM, Ted Yu <[EMAIL PROTECTED]> wrote: > > Thanks for the response Dave. > > > > I exchanged a few emails with Lars H last night and we felt we shouldn't > > block 0.92.2 / 0.94.2 releases due to a few flaky tests. > > > > Let's fix flaky tests continuously. > > > > That seems basically fine except I'd think that we'd pass most of the > time rather than as it is now, where we fail and its the exception > that tests pass. Whatever about Jenkins, I'd think that when I build > locally, again, most of the time the build should pass but its not > been the case during my builds of 0.92 locally. > > Has triage on 0.92 failing tests been done to figure if tests are > flakey or whether its hbase that is the problem (I've seen JIRA > traffic on 0.92 unit tests so this may have been done already). Can > we not disable tests that we've labelled flakey and not issues in > hbase? > > St.Ack >
-
Re: [Discuss] Release 0.92.2?Stack 2012-08-31, 23:43
On Fri, Aug 31, 2012 at 10:26 AM, Ted Yu <[EMAIL PROTECTED]> wrote:
> bq. Can we not disable tests that we've labelled flakey and not issues in > hbase? > > No tests were disabled this past week. > ? Because 0.92 branch had no flakey tests this week? The last two builds of 0.92 succeeded as did my most recent local build. Maybe its not as bad as I thought. I'll run some more builds up on jenkins and see how it goes. St.Ack
-
Re: [Discuss] Release 0.92.2?Stack 2012-09-01, 06:02
On Fri, Aug 31, 2012 at 4:43 PM, Stack <[EMAIL PROTECTED]> wrote:
> On Fri, Aug 31, 2012 at 10:26 AM, Ted Yu <[EMAIL PROTECTED]> wrote: >> bq. Can we not disable tests that we've labelled flakey and not issues in >> hbase? >> >> No tests were disabled this past week. >> > > ? Because 0.92 branch had no flakey tests this week? > > The last two builds of 0.92 succeeded as did my most recent local > build. Maybe its not as bad as I thought. I'll run some more builds > up on jenkins and see how it goes. > The build https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/552/ failed TestReplication and TestSplitTransactionOnCluster, a couple of old favorites. St.Ack |