Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # dev >> Re: Build failed in Jenkins: HBase-TRUNK #3498


Copy link to this message
-
Re: Build failed in Jenkins: HBase-0.94 #589
I noticed the test failures in TestSplitTransactionOnCluster

0.94.3 has fix for region splitting issue. I think we should pay a little
attention fixing TestSplitTransactionOnCluster so that it passes more often.

Cheers

On Wed, Nov 14, 2012 at 12:18 PM, lars hofhansl <[EMAIL PROTECTED]> wrote:

> Here're the test that failed recently without a fix:
>
>
> TestSplitLogManager.testUnassignedTimeout x 3
> TestSplitLogManager.testMultipleResubmits
> TestSplitTransactionOnCluster.testShutdownFixupWhenDaughterHasSplit x 2
> TestSplitTransactionOnCluster.testMasterRestartWhenSplittingIsPartial
>
> TestSplitTransactionOnCluster.testShouldThrowIOExceptionIfStoreFileSizeIsEmptyAndSHouldSuccessfullyExecuteRollback
> TestCatalogTrackerOnCluster.testBadOriginalRootLocation
> TestDistributedLogSplitting.testDelayedDeleteOnFailure
> TestScannerTimeout.test3686a
> TestReplication.testVerifyRepJob
> TestReplication.queueFailover
> TestFromClientSideWithCoprocessor.testPoolBehavior
> TestColumnSeeking.testDuplicateVersions
>
>
>
> Based on that at least TestSplitLogManager.testUnassignedTimeout should
> get the axe (or be investigated)
>
> -- Lars
>
> ----- Original Message -----
> From: Jimmy Xiang <[EMAIL PROTECTED]>
> To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]>
> Cc:
> Sent: Wednesday, November 14, 2012 12:12 PM
> Subject: Re: Build failed in Jenkins: HBase-0.94 #589
>
> I agree. +1
>
> We can keep a list of flaky tests so that we can fix them later on.
>
> Thanks,
> Jimmy
>
> On Wed, Nov 14, 2012 at 11:55 AM, lars hofhansl <[EMAIL PROTECTED]>
> wrote:
> > Sigh.
> >
> > It seems we're back at having a successful build being the exception
> rather than the rule.
> > In this case it was some(?) timeout (all tests ran and passed), but many
> previous runs had at least one test failing.
> >
> > Flaky tests are useless. They do not add confidence to a run, and worse
> they add noise, which requires us to manually filter the good from the bad
> runs but looking at the results.
> >
> > There was talk about separating the flaky tests from the good ones.
> > Short term I propose to disable or remove every test that failed more
> than once in the last 10 runs.
> >
> >
> > This is getting quite frustrating.
> >
> > -- Lars
> >
> >
> > ----- Original Message -----
> > From: Apache Jenkins Server <[EMAIL PROTECTED]>
> > To: [EMAIL PROTECTED]
> > Cc:
> > Sent: Wednesday, November 14, 2012 11:13 AM
> > Subject: Build failed in Jenkins: HBase-0.94 #589
> >
> > See <https://builds.apache.org/job/HBase-0.94/589/>
> >
> > ------------------------------------------
> > [...truncated 581 lines...]
> > Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 4.706 sec
> > Running org.apache.hadoop.hbase.regionserver.TestAtomicOperation
> > Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 15.647
> sec
> > Running
> org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster
> > Tests run: 10, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 117.495
> sec
> > Running org.apache.hadoop.hbase.regionserver.TestRegionServerMetrics
> > Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 114.374
> sec
> > Running org.apache.hadoop.hbase.regionserver.metrics.TestSchemaMetrics
> > Tests run: 6, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 15.448
> sec
> > Running org.apache.hadoop.hbase.regionserver.TestHRegionOnCluster
> > Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 43.172
> sec
> > Running org.apache.hadoop.hbase.regionserver.TestBlocksRead
> > Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.987 sec
> > Running
> org.apache.hadoop.hbase.regionserver.TestStoreFileBlockCacheSummary
> > Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 34.313
> sec
> > Running org.apache.hadoop.hbase.regionserver.wal.TestLogRollingNoCluster
> > Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 2.529 sec
> > Running org.apache.hadoop.hbase.regionserver.TestHRegion