[jboss-jira] [JBoss JIRA] (JGRP-1855) FD_HOST: host failure detection protocol

Bela Ban (JIRA) issues at jboss.org
Thu Jul 31 02:18:31 EDT 2014


    [ https://issues.jboss.org/browse/JGRP-1855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12978208#comment-12978208 ] 

Bela Ban edited comment on JGRP-1855 at 7/31/14 2:17 AM:
---------------------------------------------------------

OK, first impl is done and has been tested with VirtualBox/Fedora 14 instances. The config would look like this:
{code:xml}
 <FD_HOST interval="10000" timeout="35000" />
{code}

The current impl uses {{InetAddress.isReachable()}}. Next: include ability to use external commands such as {{/sbin/ping}}.


was (Author: belaban):
OK, first impl is done and has been tested with VirtualBox/Fedora 14 instances. The config would look like this:
{code:xml}
 <FD_PING2 interval="10000" timeout="35000" />
{code}

The current impl uses {{InetAddress.isReachable()}}. Next: include ability to use external commands such as {{/sbin/ping}}.

> FD_HOST: host failure detection protocol
> ----------------------------------------
>
>                 Key: JGRP-1855
>                 URL: https://issues.jboss.org/browse/JGRP-1855
>             Project: JGroups
>          Issue Type: Feature Request
>      Security Level: Public(Everyone can see) 
>            Reporter: Bela Ban
>            Assignee: Bela Ban
>             Fix For: 3.4.5, 3.5
>
>
> A new protocol similar to FD_PING which detects entire host failures and suspects all members on the failed host.
> Features are:
> * Contrary to FD_PING which uses a ring structure, FD_HOST will have everyone ping everybody else (similar to FD_ALL)
> ** A structure keeps track of hosts (IP addresses) and members on those hosts
> *** Example
> ||192.168.1.2||192.168.1.3||192.168.1.5||
> |A,B,C|D,E,F|X,Y,Z|
> * We sort the members lexically and the *first* member runs a ping against each other IP address, e.g. A pings 192.168.1.3 and 192.168.1.5, D pings 192.168.1.2 and 192.168.1.5 etc
> * The ping command itself is pluggable and can be a Java class (e.g. using {{InetAddress.isReachable()}}, a script or a command (e.g. {{/sbin/ping}}).
> * When an entire host is suspected, we suspect *all* cluster members on it
> ** Example: if B suspects 192.168.1.5, members X, Y and Z are suspected and removed from the view



--
This message was sent by Atlassian JIRA
(v6.2.6#6264)


More information about the jboss-jira mailing list