No take on this one?
In Zookeeper the heartbeats happen on every third of the timeout. If I am
not mistaken, recomended timeout is more than 2 minutes to avoid false
But I still cannot see the relationship on HDFS between heartbeat interval
and timeout. Okay 10 minutes seems to be a conservative value to avoid
false positives in a big cluster. But that means 200 hearbeats. Heartbeats
on HDFS are not only used for liveness detection but also to send
information about free space and load and to receive commands from
NameNode. So they are also essential for block placement decisions and for
ensuring the replication levels. Would that then be reason why heartbeats
are so frequent? A lot can happen to a DataNode in just three seconds?
On Thu, Mar 7, 2013 at 10:37 PM, André Oriani <[EMAIL PROTECTED]> wrote:
> Is there any particular reason why the default heartbeat interval is 3
> seconds and the timeout is 10 minutes? Everywhere I looked (code, Google,
> ..) only mentions the values but no clue on why those values were chosen.
> Thanks in advance,
> André Oriani