[Red Hat JIRA] (ISPN-12350) Persistent UUIDs are only used for initial consistent hash

Sunday, 24 January 2021

     [
https://issues.redhat.com/browse/ISPN-12350?page=com.atlassian.jira.plugi...
]

Tristan Tarrant updated ISPN-12350:
-----------------------------------
    Fix Version/s: 12.1.0.Final
                       (was: 12.0.0.Final)

...
 Persistent UUIDs are only used for initial consistent hash
 ----------------------------------------------------------

                 Key: ISPN-12350
                 URL: https://issues.redhat.com/browse/ISPN-12350
             Project: Infinispan
          Issue Type: Bug
          Components: Core, State Transfer
    Affects Versions: 12.0.0.Dev03, 11.0.3.Final
            Reporter: Dan Berindei
            Assignee: Dan Berindei
            Priority: Major
             Fix For: 12.1.0.Final

 After a graceful restart, the persisted UUIDs are used to re-create the consistent hash
of the cache before shutdown. This initial CH will not be rebalanced, so there is no state
transfer immediately after cluster restart.
 However, if something then triggers a rebalance (e.g. a node join/leave), the persistent
UUIDs are ignored, and {{SyncConsistentHashFactory}} allocates segments based on the new
JGroups addresses instead of the persistent UUIDs. 
 I modified {{ThreeNodeDistGlobalStateRestartTest}} to force a rebalance after restart,
and I got
 {noformat}
 11:24:07,424 TRACE (jgroups-7,Test-NodeD:[]) [ClusterCacheStatus] Cache testCache
topology updated: CacheTopology{id=1, phase=NO_REBALANCE, rebalanceId=1,
currentCH=DefaultConsistentHash{ns=256, owners = (3)[Test-NodeD: 83+0, Test-NodeE: 87+0,
Test-NodeF: 86+0]}, pendingCH=null, unionCH=null, actualMembers=[Test-NodeD, Test-NodeE,
Test-NodeF], persistentUUIDs=[1ba71c04-a6b9-4a5c-9f51-e5e358081dc6,
6d3ff549-aafa-4d8a-8617-84ac6f119549, f37f6a8c-32a4-4dda-b1b0-876c24f42c6a]}, members =
[Test-NodeD, Test-NodeE, Test-NodeF], joiners = []
 11:24:07,889 TRACE (testng-Test:[]) [ClusterCacheStatus] Rebalancing consistent hash for
cache testCache, members are [Test-NodeD, Test-NodeE, Test-NodeF]
 11:24:07,909 TRACE (testng-Test:[]) [ClusterCacheStatus] Updating cache testCache
topology for rebalance: CacheTopology{id=2, phase=READ_OLD_WRITE_ALL, rebalanceId=2,
currentCH=DefaultConsistentHash{ns=256, owners = (3)[Test-NodeD: 83+0, Test-NodeE: 87+0,
Test-NodeF: 86+0]}, pendingCH=DefaultConsistentHash{ns=256, owners = (3)[Test-NodeD: 87+0,
Test-NodeE: 83+0, Test-NodeF: 86+0]}, unionCH=null, actualMembers=[Test-NodeD, Test-NodeE,
Test-NodeF], persistentUUIDs=[1ba71c04-a6b9-4a5c-9f51-e5e358081dc6,
6d3ff549-aafa-4d8a-8617-84ac6f119549, f37f6a8c-32a4-4dda-b1b0-876c24f42c6a]}
 11:24:07,910 TRACE (testng-Test:[]) [ClusterCacheStatus] Moved segments: [Test-NodeD
added 72 removed 68, Test-NodeE added 49 removed 53, Test-NodeF added 59 removed 59]
 {noformat}
 This issue does not affect caches using {{DefaultConsistentHashFactory}}, because it
doesn't care about member UUIDs. Since there is no
{{SyncScatteredConsistentHashFactory}}, scattered cache are not affected at all.
Replicated caches with the default {{SyncReplicateedConsistentHashFactory}} will change
primary owners, but they won't need any state transfer.
 {{TestingUtil.waitForNoRebalance()}} works around the issue by not checking whether the
initial consistent hash (with topologyId==1) is balanced. 

--
This message was sent by Atlassian Jira
(v8.13.1#813001)

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009