[jboss-jira] [JBoss JIRA] (WFLY-6749) Cluster failover doesn't work on windows when network is disabled on a node

Paul Ferraro (JIRA) issues at jboss.org
Wed Jun 22 17:07:00 EDT 2016


    [ https://issues.jboss.org/browse/WFLY-6749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13256050#comment-13256050 ] 

Paul Ferraro commented on WFLY-6749:
------------------------------------

The default stack contains the following failure detection protocols:
* FD_SOCK
* FD_ALL
These protocols are described here:
http://www.jgroups.org/manual/index.html#FailureDetection
I suspect that your method of simulating a failure - by disabling the network of the host machine is not being detected by FD_SOCK.  It will however, be detected by FD_ALL, but only after 1 minute.  The heartbeat timeout used by FD_ALL can be manipulated via the timeout property.
e.g.
<protocol type="FD_ALL" ><property name="timeout">60000</property></protocol>

> Cluster failover doesn't work on windows when network is disabled on a node
> ---------------------------------------------------------------------------
>
>                 Key: WFLY-6749
>                 URL: https://issues.jboss.org/browse/WFLY-6749
>             Project: WildFly
>          Issue Type: Bug
>          Components: Clustering
>    Affects Versions: 8.2.0.Final
>            Reporter: Preeta Kuruvilla
>            Assignee: Paul Ferraro
>            Priority: Critical
>
> This is about a two VM Wildfly cluster on windows environment. In order to test the failover, the team has disabled the network on one node. However the failover is not happening and the application functionality on the cluster is hampered as a result.



--
This message was sent by Atlassian JIRA
(v6.4.11#64026)


More information about the jboss-jira mailing list