[
https://issues.jboss.org/browse/ISPN-9173?page=com.atlassian.jira.plugin....
]
Dan Berindei commented on ISPN-9173:
------------------------------------
Unfortunately there are still other random test failures in the partition handling tests
related to ISPN-9291. This is the kind of test failure the non-atomic availability update
causes, because the availability mode changed to DEGRADED but the actual members are still
a majority:
{noformat}
Test failed:
org.infinispan.partitionhandling.TwoWaySplitAndMergeTest.testSplitAndMerge1[DIST_SYNC]
java.lang.AssertionError: Should have thrown an
org.infinispan.partitionhandling.AvailabilityException
at org.infinispan.test.Exceptions.assertException(Exceptions.java:18)
at org.infinispan.test.Exceptions.expectException(Exceptions.java:99)
at
org.infinispan.partitionhandling.BasePartitionHandlingTest.assertKeyNotAvailableForRead(BasePartitionHandlingTest.java:400)
at
org.infinispan.partitionhandling.BasePartitionHandlingTest$Partition.assertKeyNotAvailableForRead(BasePartitionHandlingTest.java:346)
at
org.infinispan.partitionhandling.BasePartitionHandlingTest$Partition.assertKeysNotAvailableForRead(BasePartitionHandlingTest.java:341)
at
org.infinispan.partitionhandling.TwoWaySplitAndMergeTest.testSplitAndMerge(TwoWaySplitAndMergeTest.java:92)
at
org.infinispan.partitionhandling.TwoWaySplitAndMergeTest.testSplitAndMerge1(TwoWaySplitAndMergeTest.java:31)
{noformat}
Availability mode should be updated atomically with the actual
members
----------------------------------------------------------------------
Key: ISPN-9173
URL:
https://issues.jboss.org/browse/ISPN-9173
Project: Infinispan
Issue Type: Bug
Components: Core
Affects Versions: 9.3.0.Beta1
Reporter: Dan Berindei
Assignee: Dan Berindei
Labels: testsuite_stability
Fix For: 9.3.0.Final
This is a follow-up on ISPN-7682, which asks for the topology itself to be updated
atomically.
{{LocalTopologyManagerImpl}} has additional logic to update the availability mode first
when the cache becomes degraded and to update it last when the cache becomes available,
which means any delay between the updates cannot cause data inconsistencies.
But that logic doesn't really belong in {{LocalTopologyManagerImpl}}, and it's
easy to forget it's there (and in fact we had a bug there related to the new rebalance
phases).
In addition, tests that want to check the cache behaviour in degraded mode and wait only
for the availability mode change will fail if there's a big delay between the
availability mode change. I actually hit this while testing my ISPN-8731/ISPN-7682
changes, and I had added a random delay in {{StateConsumerImpl}} before
{{distributionManager.setCacheTopology()}}.
--
This message was sent by Atlassian JIRA
(v7.5.0#75005)