New subject: [JBoss JIRA] Updated: (ISPN-493) Harden rehash leave process

Thursday, 10 June 2010

Harden rehash leave process
---------------------------

                 Key: ISPN-493
                 URL: https://jira.jboss.org/browse/ISPN-493
             Project: Infinispan
          Issue Type: Task
    Affects Versions: 4.1.0.BETA2, 4.0.0.Final
            Reporter: Vladimir Blagojevic
            Assignee: Vladimir Blagojevic
             Fix For: 5.0.0.BETA1, 5.0.0.Final

We need to make sure that leave rehash process properly handles massive and rapid node
failure. 

Massive failures:
JGroups detects multiple node failures and pushes up to Infinispan views that are more
"volatile" than we currently assumed (only one member at the time can leave).
For example, if we have view V1={A,B,C,D,E} and massive failure causes {C,D,E} to fail,
JGroups failure detection and GMS are going to install a view V2={A,B} to surviving
members. LeaveTask does not handle this scenario.

Rapid node failure:
We need to revisit how LeaveTasks are queued up and executed/canceled during rapid node
failures. Do we always cancel currently running leave tasks? At what stage are we allowed
to cancel it and at what stage of a leave tasks is it better to wait for a completion of a
task. 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
https://jira.jboss.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

[JBoss JIRA] Created: (ISPN-493) Harden rehash leave process