Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # dev >> Re: Build failed in Jenkins: HBase-TRUNK #3498


Copy link to this message
-
Re: Build failed in Jenkins: HBase-0.94 #589
Agreed. Looking at TestSplitLogManager.testUnassignedTimeout.
It has some pretty tight timeouts (500ms), which is likely to be a problem on slow (or overloaded) build machines.
I'm doubling the timeouts.
Anyway, the current run just got past these too tests, let's hope there're no other failures.
Then we can tackle these tests in 0.94.4 and 0.96.

-- Lars
----- Original Message -----
From: Ted Yu <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]>
Cc:
Sent: Wednesday, November 14, 2012 12:30 PM
Subject: Re: Build failed in Jenkins: HBase-0.94 #589

I noticed the test failures in TestSplitTransactionOnCluster

0.94.3 has fix for region splitting issue. I think we should pay a little
attention fixing TestSplitTransactionOnCluster so that it passes more often.

Cheers

On Wed, Nov 14, 2012 at 12:18 PM, lars hofhansl <[EMAIL PROTECTED]> wrote:

> Here're the test that failed recently without a fix:
>
>
> TestSplitLogManager.testUnassignedTimeout x 3
> TestSplitLogManager.testMultipleResubmits
> TestSplitTransactionOnCluster.testShutdownFixupWhenDaughterHasSplit x 2
> TestSplitTransactionOnCluster.testMasterRestartWhenSplittingIsPartial
>
> TestSplitTransactionOnCluster.testShouldThrowIOExceptionIfStoreFileSizeIsEmptyAndSHouldSuccessfullyExecuteRollback
> TestCatalogTrackerOnCluster.testBadOriginalRootLocation
> TestDistributedLogSplitting.testDelayedDeleteOnFailure
> TestScannerTimeout.test3686a
> TestReplication.testVerifyRepJob
> TestReplication.queueFailover
> TestFromClientSideWithCoprocessor.testPoolBehavior
> TestColumnSeeking.testDuplicateVersions
>
>
>
> Based on that at least TestSplitLogManager.testUnassignedTimeout should
> get the axe (or be investigated)
>
> -- Lars
>
> ----- Original Message -----
> From: Jimmy Xiang <[EMAIL PROTECTED]>
> To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]>
> Cc:
> Sent: Wednesday, November 14, 2012 12:12 PM
> Subject: Re: Build failed in Jenkins: HBase-0.94 #589
>
> I agree. +1
>
> We can keep a list of flaky tests so that we can fix them later on.
>
> Thanks,
> Jimmy
>
> On Wed, Nov 14, 2012 at 11:55 AM, lars hofhansl <[EMAIL PROTECTED]>
> wrote:
> > Sigh.
> >
> > It seems we're back at having a successful build being the exception
> rather than the rule.
> > In this case it was some(?) timeout (all tests ran and passed), but many
> previous runs had at least one test failing.
> >
> > Flaky tests are useless. They do not add confidence to a run, and worse
> they add noise, which requires us to manually filter the good from the bad
> runs but looking at the results.
> >
> > There was talk about separating the flaky tests from the good ones.
> > Short term I propose to disable or remove every test that failed more
> than once in the last 10 runs.
> >
> >
> > This is getting quite frustrating.
> >
> > -- Lars
> >
> >
> > ----- Original Message -----
> > From: Apache Jenkins Server <[EMAIL PROTECTED]>
> > To: [EMAIL PROTECTED]
> > Cc:
> > Sent: Wednesday, November 14, 2012 11:13 AM
> > Subject: Build failed in Jenkins: HBase-0.94 #589
> >
> > See <https://builds.apache.org/job/HBase-0.94/589/>
> >
> > ------------------------------------------
> > [...truncated 581 lines...]
> > Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 4.706 sec
> > Running org.apache.hadoop.hbase.regionserver.TestAtomicOperation
> > Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 15.647
> sec
> > Running
> org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster
> > Tests run: 10, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 117.495
> sec
> > Running org.apache.hadoop.hbase.regionserver.TestRegionServerMetrics
> > Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 114.374
> sec
> > Running org.apache.hadoop.hbase.regionserver.metrics.TestSchemaMetrics
> > Tests run: 6, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 15.448
> sec
> > Running org.apache.hadoop.hbase.regionserver.TestHRegionOnCluster
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB