[jboss-jira] [JBoss JIRA] (WFLY-4748) Singleton service fails to start after repetitive cluster split with "Failed to reach quorum of 1"
Tomas Hofman (JIRA)
issues at jboss.org
Fri Jun 5 05:15:06 EDT 2015
[ https://issues.jboss.org/browse/WFLY-4748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tomas Hofman updated WFLY-4748:
-------------------------------
Steps to Reproduce: There is a not yet merged test case in https://github.com/TomasHofman/wildfly/tree/WFLY-4724. Just modify MERGE2 discovery protocol to MERGE3 in addTunnelingTransportStack.xsl, otherwise the test will fail due to another issue WFLY-4748.
> Singleton service fails to start after repetitive cluster split with "Failed to reach quorum of 1"
> --------------------------------------------------------------------------------------------------
>
> Key: WFLY-4748
> URL: https://issues.jboss.org/browse/WFLY-4748
> Project: WildFly
> Issue Type: Bug
> Components: Clustering
> Affects Versions: 10.0.0.Alpha2
> Reporter: Tomas Hofman
> Assignee: Tomas Hofman
>
> When cluster of two nodes with deployed singleton service (f.i. cluster-ha-singleton quickstart app) splits, merges, and splits again, one of the nodes fails to run the singleton service with error message "WFLYCLSV0006: Failed to reach *quorum of 1* for jboss.quickstart.ha.singleton.default2 service. No singleton master will be elected." - note the "quorum of 1".
> This only happens after the second and other successive splits. After the first split both nodes execute the service correctly.
> After analysis, it appears that nodes are never being added back to service providers cache upon cluster merge, because CacheServiceProviderRegistrationFactory#membershipChanged() is never called with 'merged' attribute set to 'true'.
> I presume that call should come from ChannelCommandDispatcherFactory#viewAccepted():
> {code}
> public void viewAccepted(View view) {
> // ...
> for (Listener listener: this.listeners) {
> listener.membershipChanged(oldNodes, newNodes, view instanceof MergeView);
> }
> }
> {code}
> This method gets called, but the problem is that the 'listeners' list is empty, so no listener is actually notified.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
More information about the jboss-jira
mailing list