[jboss-jira] [JBoss JIRA] (JGRP-2237) The single node in the cluster not become a coordinator after coordinator leave.
kfir avraham (JIRA)
issues at jboss.org
Tue Nov 28 08:50:00 EST 2017
[ https://issues.jboss.org/browse/JGRP-2237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13494655#comment-13494655 ]
kfir avraham commented on JGRP-2237:
------------------------------------
i changed 'VERIFY_SUSPECT' timeout to 500 (as in your configuration), and i got the same issue (see log below).
when i changed 'port_range', they not discovered each other from the first time.
*it is happened also after shutdown for a few minutes, and not just for restart.
2017-11-28 01:33:47:0975 + 1 Nov-28-2017 01:33:47 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-654: no members discovered after 30021 ms: creating cluster as first member
2017-11-28 01:33:47:0975 + 2 Nov-28-2017 01:33:47 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-654: installing view [clm-tlv-spih62-654|0] (1) [clm-tlv-spih62-654]
2017-11-28 01:33:47:0975 + 4 Nov-28-2017 01:33:47 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-654: created cluster (first member). My view is [clm-tlv-spih62-654|0], impl is org.jgroups.protocols.pbcast.CoordGmsImpl
2017-11-28 01:33:58:0747 + 57 Nov-28-2017 01:33:58 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-11,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: joiners=[clm-tlv-spk9g4-11729], suspected=[], leaving=[], new view: [clm-tlv-spih62-654|1] (2) [clm-tlv-spih62-654, clm-tlv-spk9g4-11729]
2017-11-28 01:33:58:0747 + 58 Nov-28-2017 01:33:58 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-11,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: mcasting view [clm-tlv-spih62-654|1] (2) [clm-tlv-spih62-654, clm-tlv-spk9g4-11729]
2017-11-28 01:33:58:0747 + 59 Nov-28-2017 01:33:58 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-11,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: installing view [clm-tlv-spih62-654|1] (2) [clm-tlv-spih62-654, clm-tlv-spk9g4-11729]
2017-11-28 01:33:58:0747 + 61 Nov-28-2017 01:33:58 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-11,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: sending join-rsp to clm-tlv-spk9g4-11729: view=[clm-tlv-spih62-654|1] (2) [clm-tlv-spih62-654, clm-tlv-spk9g4-11729] (2 mbrs)
2017-11-28 01:34:00:0294 + 62 Nov-28-2017 01:33:59 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-11,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: got all ACKs (1) for view [clm-tlv-spih62-654|1] in 1331 ms
2017-11-28 01:34:57:0822 + 66 Nov-28-2017 01:34:57 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-14,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: joiners=[], suspected=[], leaving=[clm-tlv-spih62-654], new view: [clm-tlv-spk9g4-11729|2] (1) [clm-tlv-spk9g4-11729]
2017-11-28 01:34:57:0822 + 67 Nov-28-2017 01:34:57 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-14,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: sending LEAVE response to clm-tlv-spih62-654
2017-11-28 01:34:57:0822 + 68 Nov-28-2017 01:34:57 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-14,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: mcasting view [clm-tlv-spk9g4-11729|2] (1) [clm-tlv-spk9g4-11729]
2017-11-28 01:34:57:0822 + 69 Nov-28-2017 01:34:57 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [ViewHandler-14,clm-tlv-spih62-654] [INFO] - clm-tlv-spih62-654: got all ACKs (1) for view [clm-tlv-spk9g4-11729|2] in 16 ms
2017-11-28 01:36:26:0919 + 86 Nov-28-2017 01:36:26 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-28153: discovery took 30023 ms, members: 1 rsps (0 coords) [done]
2017-11-28 01:36:26:0919 + 87 Nov-28-2017 01:36:26 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-28153: could not determine coordinator from rsps 1 rsps (0 coords) [done]
2017-11-28 01:36:26:0919 + 88 Nov-28-2017 01:36:26 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-28153: nodes to choose new coord from are: [clm-tlv-spih62-28153, clm-tlv-spk9g4-11729]
2017-11-28 01:36:26:0919 + 89 Nov-28-2017 01:36:26 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-28153: I (clm-tlv-spih62-28153) am the first of the nodes, will become coordinator
2017-11-28 01:36:26:0919 + 90 Nov-28-2017 01:36:26 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-28153: installing view [clm-tlv-spih62-28153|0] (1) [clm-tlv-spih62-28153]
2017-11-28 01:36:26:0919 + 92 Nov-28-2017 01:36:26 GMT-12:00 [org.jgroups.protocols.pbcast.GMS] [main] [INFO] - clm-tlv-spih62-28153: created cluster (first member). My view is [clm-tlv-spih62-28153|0], impl is org.jgroups.protocols.pbcast.CoordGmsImpl
> The single node in the cluster not become a coordinator after coordinator leave.
> --------------------------------------------------------------------------------
>
> Key: JGRP-2237
> URL: https://issues.jboss.org/browse/JGRP-2237
> Project: JGroups
> Issue Type: Bug
> Affects Versions: 4.0.2, 4.0.8
> Reporter: kfir avraham
> Assignee: Bela Ban
> Priority: Minor
> Attachments: test.xml
>
>
> I got cluster with 2 members, sometimes when the first node (coordinator) leave the cluster the second one is not become a coordinator.
> When the first one is rejoin, he could not determine coordinator and select new one from the nodes list.
--
This message was sent by Atlassian JIRA
(v7.5.0#75005)
More information about the jboss-jira
mailing list