[jboss-jira] [JBoss JIRA] (WFLY-5762) Messaging replication fails to check-for-live-server on restart

Srinivas ev (Jira) issues at jboss.org
Thu Nov 22 07:37:00 EST 2018


    [ https://issues.jboss.org/browse/WFLY-5762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13665141#comment-13665141 ] 

Srinivas ev commented on WFLY-5762:
-----------------------------------

Hi [~jmesnil], can you share the standalone-full-ha.xml 's of master and slave.
I am facing same issue and I am unable to point out the code like you. Enabled few debug logs.

Below are my assumptions based on logs observed -
1.  master is trying to reach the http-connector of slave.
2.  logs of Connection pair always displays the same host/ip address after restart.

I will attach the logs, please anyone can have a look on this. [^rotateserver_active.log]  [^rotateserver_backup.log] 

> Messaging replication fails to check-for-live-server on restart
> ---------------------------------------------------------------
>
>                 Key: WFLY-5762
>                 URL: https://issues.jboss.org/browse/WFLY-5762
>             Project: WildFly
>          Issue Type: Bug
>          Components: JMS
>    Affects Versions: 10.0.0.CR4
>            Reporter: Jeff Mesnil
>            Assignee: Andy Taylor
>            Priority: Major
>             Fix For: 10.0.0.CR5
>
>
> The attached configuration use JGroups.
> I had a look at the code and I suspect the issue is located somewhere when the server1 is restarted and calls its SharedNothingLiveActivation#isNodeIdUsed().
> This method returns false and the server completes its live activation instead of setting its HA policy to replicaPolicy.
> Digging into the code, I looks like DiscoveryGroup#received boolean is never set to true because its corresponding JGroupsBroadcastEndpoint never receives any JGroups message.
> I confirm that server2 is working at that time and does send JGroups message.
> I suspect that there is a bug in the wrapping of JGroups receiver/channel/etc. in org.apache.activemq.artemis.api.core.JGroupsBroadcastEndpoint and the endpoint in DiscoveryGroup never receives the message that is actually received by JGroups in the ReceiverAdapter instantiated by JGroupsBroadcastEndpoint.JChannelWrapper#connect.



--
This message was sent by Atlassian Jira
(v7.12.1#712002)


More information about the jboss-jira mailing list