[jboss-jira] [JBoss JIRA] (JGRP-2237) The single node in the cluster not become a coordinator after coordinator leave.

kfir avraham (JIRA) issues at jboss.org
Tue Nov 28 08:50:00 EST 2017


    [ https://issues.jboss.org/browse/JGRP-2237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13494655#comment-13494655 ] 

kfir avraham commented on JGRP-2237:
------------------------------------

i changed 'VERIFY_SUSPECT' timeout to 500 (as in your configuration), and i got the same issue (see log below).
when i changed 'port_range', they not discovered each other from the first time. 

*it is happened also after shutdown for a few minutes, and not just for restart.

2017-11-28 01:33:47:0975	+ 1	Nov-28-2017 01:33:47 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-654: no members discovered after 30021 ms: creating cluster as first member
2017-11-28 01:33:47:0975	+ 2	Nov-28-2017 01:33:47 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-654: installing view [clm-tlv-spih62-654|0] (1) [clm-tlv-spih62-654]
2017-11-28 01:33:47:0975	+ 4	Nov-28-2017 01:33:47 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-654: created cluster (first member). My view is [clm-tlv-spih62-654|0], impl is org.jgroups.protocols.pbcast.CoordGmsImpl
2017-11-28 01:33:58:0747	+ 57	Nov-28-2017 01:33:58 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-11,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: joiners=[clm-tlv-spk9g4-11729], suspected=[], leaving=[], new view: [clm-tlv-spih62-654|1] (2) [clm-tlv-spih62-654, clm-tlv-spk9g4-11729]
2017-11-28 01:33:58:0747	+ 58	Nov-28-2017 01:33:58 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-11,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: mcasting view [clm-tlv-spih62-654|1] (2) [clm-tlv-spih62-654, clm-tlv-spk9g4-11729]
2017-11-28 01:33:58:0747	+ 59	Nov-28-2017 01:33:58 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-11,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: installing view [clm-tlv-spih62-654|1] (2) [clm-tlv-spih62-654, clm-tlv-spk9g4-11729]
2017-11-28 01:33:58:0747	+ 61	Nov-28-2017 01:33:58 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-11,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: sending join-rsp to clm-tlv-spk9g4-11729: view=[clm-tlv-spih62-654|1] (2) [clm-tlv-spih62-654, clm-tlv-spk9g4-11729] (2 mbrs)
2017-11-28 01:34:00:0294	+ 62	Nov-28-2017 01:33:59 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-11,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: got all ACKs (1) for view [clm-tlv-spih62-654|1] in 1331 ms
2017-11-28 01:34:57:0822	+ 66	Nov-28-2017 01:34:57 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-14,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: joiners=[], suspected=[], leaving=[clm-tlv-spih62-654], new view: [clm-tlv-spk9g4-11729|2] (1) [clm-tlv-spk9g4-11729]
2017-11-28 01:34:57:0822	+ 67	Nov-28-2017 01:34:57 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-14,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: sending LEAVE response to clm-tlv-spih62-654
2017-11-28 01:34:57:0822	+ 68	Nov-28-2017 01:34:57 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-14,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: mcasting view [clm-tlv-spk9g4-11729|2] (1) [clm-tlv-spk9g4-11729]
2017-11-28 01:34:57:0822	+ 69	Nov-28-2017 01:34:57 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-14,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: got all ACKs (1) for view [clm-tlv-spk9g4-11729|2] in 16 ms
2017-11-28 01:36:26:0919	+ 86	Nov-28-2017 01:36:26 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-28153: discovery took 30023 ms, members: 1 rsps (0 coords) [done]
2017-11-28 01:36:26:0919	+ 87	Nov-28-2017 01:36:26 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-28153: could not determine coordinator from rsps 1 rsps (0 coords) [done]
2017-11-28 01:36:26:0919	+ 88	Nov-28-2017 01:36:26 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-28153: nodes to choose new coord from are: [clm-tlv-spih62-28153, clm-tlv-spk9g4-11729]
2017-11-28 01:36:26:0919	+ 89	Nov-28-2017 01:36:26 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-28153: I (clm-tlv-spih62-28153) am the first of the nodes, will become coordinator
2017-11-28 01:36:26:0919	+ 90	Nov-28-2017 01:36:26 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-28153: installing view [clm-tlv-spih62-28153|0] (1) [clm-tlv-spih62-28153]
2017-11-28 01:36:26:0919	+ 92	Nov-28-2017 01:36:26 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-28153: created cluster (first member). My view is [clm-tlv-spih62-28153|0], impl is org.jgroups.protocols.pbcast.CoordGmsImpl


> The single node in the cluster not become a coordinator after coordinator leave.
> --------------------------------------------------------------------------------
>
>                 Key: JGRP-2237
>                 URL: https://issues.jboss.org/browse/JGRP-2237
>             Project: JGroups
>          Issue Type: Bug
>    Affects Versions: 4.0.2, 4.0.8
>            Reporter: kfir avraham
>            Assignee: Bela Ban
>            Priority: Minor
>         Attachments: test.xml
>
>
> I got cluster with 2 members, sometimes when the first node (coordinator) leave the cluster the second one is not become a coordinator.
> When the first one is rejoin, he could not determine coordinator and select new one from the nodes list.



--
This message was sent by Atlassian JIRA
(v7.5.0#75005)


More information about the jboss-jira mailing list