[jboss-jira] [JBoss JIRA] (JGRP-2293) Graceful concurrent leaving of coordinator(s) leaves the cluster with stale views

Mon Sep 24 09:10:00 EDT 2018

    [ https://issues.jboss.org/browse/JGRP-2293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13637475#comment-13637475 ] 

Bela Ban edited comment on JGRP-2293 at 9/24/18 9:09 AM:
---------------------------------------------------------

Reducing the timeout on FD_ALL helps: {{new FD_ALL().timeout(3000).interval(1000)}}.

I have to investigate whether this is a bug or not, as the first and second in line leaving at the same time is something that will probably have to solved by failure detection.

I'll look into this this week.

was (Author: belaban):
Reducing the timeout on FD_ALL helps: {{ new FD_ALL().timeout(3000).interval(1000)}}.

I have to investigate whether this is a bug or not, as the first and second in line leaving at the same time is something that will probably have to solved by failure detection.

I'll look into this this week.

> Graceful concurrent leaving of coordinator(s) leaves the cluster with stale views
> ---------------------------------------------------------------------------------
>
>                 Key: JGRP-2293
>                 URL: https://issues.jboss.org/browse/JGRP-2293
>             Project: JGroups
>          Issue Type: Bug
>    Affects Versions: 4.0.14
>            Reporter: Radoslav Husar
>            Assignee: Bela Ban
>            Priority: Critical
>             Fix For: 4.0.15
>
>
> JGroups does not handle concurrent leaving of nodes correctly. This is a typical use case in cloud environment when scaled down with an autoscaler/manually which we need to handle.
> A simple test can be devised which fails first n (where n>1) nodes from a cluster, reproducer PR https://github.com/belaban/JGroups/pull/397

--
This message was sent by Atlassian JIRA
(v7.5.0#75005)