[infinispan-issues] [JBoss JIRA] (ISPN-8962) PreferAvailabilityStrategy: Rely less on the stable topology

Dan Berindei (JIRA) issues at jboss.org
Mon Mar 19 13:41:01 EDT 2018


Dan Berindei created ISPN-8962:
----------------------------------

             Summary: PreferAvailabilityStrategy: Rely less on the stable topology
                 Key: ISPN-8962
                 URL: https://issues.jboss.org/browse/ISPN-8962
             Project: Infinispan
          Issue Type: Bug
          Components: Core
    Affects Versions: 9.2.0.Final
            Reporter: Dan Berindei
            Assignee: Dan Berindei
             Fix For: 9.3.0.Final, 9.2.1.Final


{{PreferAvailabilityStrategy}} checks the size of the stable topology, and only considers cache topologies that are derived from the biggest topology (in size) when picking a post-merge topology.

Unfortunately, in some situations this algorithm fails pretty badly. If a node has a very long GC pause, when it comes back it will report the old topology *and* the old stable topology. If the rest of the cluster rebalanced, it now has both a smaller current topology and a smaller stable topology.

Furthermore, the stable topology is updated asynchronously, independent from the current topology. So even if there's a split and the minority partition installs a current topology with fewer members, it may take some time for its stable topology to be updated with fewer members. In fact, it appears that when a rebalance is not needed (e.g. because the partition has a single node), the stable topology is never updated!




--
This message was sent by Atlassian JIRA
(v7.5.0#75005)


More information about the infinispan-issues mailing list