My information here may be stale.
I remember we increased the timeout interval from 3 to 30 minutes, because
the master injecting itself into mid-assignment often triggered races and
led to double assignments and other bad stuff. At 30 minutes, this is not
useful IMO. As an operator I'd run hbck to sort it out long before then.
On Thursday, December 6, 2012, Nicolas Liochon wrote:
> See comments in HBASE-7247: the master checks the time spent by the
> regionserver, and assign it to another if it takes too long. It adds
> from Stack: "I'm currently of the opinion that this expensive facility of
> master failing an open because it has been taking too long on a particular
> regionserver has been of no use – worse, it has only caused headache – but
> I may be just not remembering and others out on dev list will have better
> recall than I."
> So, opinions & memories are more than welcome.
> Removing this feature would be a huge simplification!
Problems worthy of attack prove their worth by hitting back. - Piet Hein
(via Tom White)