[jboss-jira] [JBoss JIRA] Resolved: (JGRP-746) FD: messages from members other than ping_dest causes missing-heartbeat count to be reset
Bela Ban (JIRA)
jira-events at lists.jboss.org
Mon Apr 28 05:56:22 EDT 2008
[ http://jira.jboss.com/jira/browse/JGRP-746?page=all ]
Bela Ban resolved JGRP-746.
---------------------------
Resolution: Done
Fixed in 2.6 and 2.7
> FD: messages from members other than ping_dest causes missing-heartbeat count to be reset
> -----------------------------------------------------------------------------------------
>
> Key: JGRP-746
> URL: http://jira.jboss.com/jira/browse/JGRP-746
> Project: JGroups
> Issue Type: Bug
> Reporter: Bela Ban
> Assigned To: Bela Ban
> Priority: Critical
> Fix For: 2.6.3, 2.7
>
>
> [email from John Smith]
> I'm not sure FD is behaving like it should.
> I started a group with two members. I then suspended one instance with a kill -SIGSTOP. After a while I expected the FD protocol to suspect the suspended jvm but it did not do it.
> I looked at FD code and it seems like messages that do not come from ping_dest reset num_tries and thus prevent the member from being suspected. Is this intended? Why would a message from self reset num_tries?
> I'm using jgroups 2.6.2.
> Here is and the relevant part of the jgroups logs:
> 10:10:48,291 DEBUG [FD] sending are-you-alive msg to 192.168.128.105:47870 (own address=192.168.128.129:57685)
> 10:10:48,291 DEBUG [FD] heartbeat missing from 192.168.128.105:47870
> (number=0)
> 10:11:18,293 DEBUG [FD] sending are-you-alive msg to 192.168.128.105:47870 (own address=192.168.128.129:57685)
> 10:11:18,293 DEBUG [FD] heartbeat missing from 192.168.128.105:47870
> (number=1)
> 10:11:48,294 DEBUG [FD] sending are-you-alive msg to 192.168.128.105:47870 (own address=192.168.128.129:57685)
> 10:11:48,294 DEBUG [FD] heartbeat missing from 192.168.128.105:47870
> (number=2)
> 10:12:18,296 DEBUG [FD] sending are-you-alive msg to 192.168.128.105:47870 (own address=192.168.128.129:57685)
> 10:12:18,296 DEBUG [FD] heartbeat missing from 192.168.128.105:47870
> (number=3)
> 10:12:48,299 DEBUG [FD] sending are-you-alive msg to 192.168.128.105:47870 (own address=192.168.128.129:57685)
> 10:12:48,299 DEBUG [FD] heartbeat missing from 192.168.128.105:47870
> (number=4)
> 10:12:51,265 DEBUG [FD] received msg from 192.168.128.129:57685 (counts as ack) 10:13:18,300 DEBUG [FD] sending are-you-alive msg to 192.168.128.105:47870 (own address=192.168.128.129:57685)
> 10:13:19,336 DEBUG [FD] received msg from 192.168.128.129:57685 (counts as ack)
> 10:13:45,988 DEBUG [FD] received msg from 192.168.128.129:57685 (counts as ack)
> 10:13:48,302 DEBUG [FD] sending are-you-alive msg to 192.168.128.105:47870 (own address=192.168.128.129:57685)
> 10:14:18,303 DEBUG [FD] sending are-you-alive msg to 192.168.128.105:47870 (own address=192.168.128.129:57685)
> 10:14:18,303 DEBUG [FD] heartbeat missing from 192.168.128.105:47870
> (number=0)
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://jira.jboss.com/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira
More information about the jboss-jira
mailing list