[JBoss JIRA] (ISPN-11297) Rejoining nodes with global state may have their caches corrupted if there is a config mismatch
by Tristan Tarrant (Jira)
[ https://issues.redhat.com/browse/ISPN-11297?page=com.atlassian.jira.plugi... ]
Tristan Tarrant commented on ISPN-11297:
----------------------------------------
Currently the error message is
`ISPN-000500 Cannot create clustered configuration for cache '%s' because configuration %n%s%n is incompatible with the existing configuration %n%s`
> Rejoining nodes with global state may have their caches corrupted if there is a config mismatch
> -----------------------------------------------------------------------------------------------
>
> Key: ISPN-11297
> URL: https://issues.redhat.com/browse/ISPN-11297
> Project: Infinispan
> Issue Type: Bug
> Components: Configuration, Core
> Affects Versions: 10.1.1.Final
> Reporter: Tristan Tarrant
> Assignee: Tristan Tarrant
> Priority: Critical
>
> With a persistent global state enabled, when a node that was previously part of a cluster rejoins it currently processes caches from the cluster state before the ones from the local state. This means that, if the cache configuration is incompatible, it will be overwritten with the one coming from the cluster.
> When joining the node should perform compatibility checks between caches in the cluster state and the local state before proceeding with creating them. If a mismatch is found, it should fail fast.
--
This message was sent by Atlassian Jira
(v7.13.8#713008)
6 years, 2 months
[JBoss JIRA] (ISPN-11298) Create TakeOfflineManager
by Pedro Ruivo (Jira)
[ https://issues.redhat.com/browse/ISPN-11298?page=com.atlassian.jira.plugi... ]
Pedro Ruivo updated ISPN-11298:
-------------------------------
Status: Open (was: New)
> Create TakeOfflineManager
> -------------------------
>
> Key: ISPN-11298
> URL: https://issues.redhat.com/browse/ISPN-11298
> Project: Infinispan
> Issue Type: Sub-task
> Components: Cross-Site Replication
> Reporter: Pedro Ruivo
> Assignee: Pedro Ruivo
> Priority: Major
>
> The code to take sites offline is embedded in BackupSender. Since IRAC is complex on its own, it will be implemented in a different class.
> The take offline code is common and it needs to be extracted to an independent component to be shared between IRAC and sync xsite.
--
This message was sent by Atlassian Jira
(v7.13.8#713008)
6 years, 2 months
[JBoss JIRA] (ISPN-11298) Create TakeOfflineManager
by Pedro Ruivo (Jira)
Pedro Ruivo created ISPN-11298:
----------------------------------
Summary: Create TakeOfflineManager
Key: ISPN-11298
URL: https://issues.redhat.com/browse/ISPN-11298
Project: Infinispan
Issue Type: Sub-task
Components: Cross-Site Replication
Reporter: Pedro Ruivo
Assignee: Pedro Ruivo
The code to take sites offline is embedded in BackupSender. Since IRAC is complex on its own, it will be implemented in a different class.
The take offline code is common and it needs to be extracted to an independent component to be shared between IRAC and sync xsite.
--
This message was sent by Atlassian Jira
(v7.13.8#713008)
6 years, 2 months
[JBoss JIRA] (ISPN-11291) MultipleCachesDuringConflictResolutionTest.testPartitionMergePolicy random failures
by Dan Berindei (Jira)
[ https://issues.redhat.com/browse/ISPN-11291?page=com.atlassian.jira.plugi... ]
Dan Berindei updated ISPN-11291:
--------------------------------
Status: Pull Request Sent (was: Open)
Git Pull Request: https://github.com/infinispan/infinispan/pull/7854
> MultipleCachesDuringConflictResolutionTest.testPartitionMergePolicy random failures
> -----------------------------------------------------------------------------------
>
> Key: ISPN-11291
> URL: https://issues.redhat.com/browse/ISPN-11291
> Project: Infinispan
> Issue Type: Bug
> Components: Core
> Affects Versions: 10.1.1.Final
> Reporter: Dan Berindei
> Assignee: Dan Berindei
> Priority: Major
> Labels: testsuite_stability
> Fix For: 11.0.0.Alpha1
>
>
> Actually it fails pretty constantly in CI:
> {noformat}
> 18:59:06,673 ERROR (testng-Test:[]) [TestSuiteProgress] Test failed: org.infinispan.conflict.impl.MultipleCachesDuringConflictResolutionTest.testPartitionMergePolicy[DIST_SYNC, DENY_READ_WRITES]
> java.lang.RuntimeException: Cache Test timed out waiting for rebalancing to complete on node Test-NodeA, current topology is CacheTopology{id=8, phase=CONFLICT_RESOLUTION, rebalanceId=4, currentCH=PartitionerConsistentHash:DefaultConsistentHash{ns=256, owners = (2)[Test-NodeB: 256+0, Test-NodeA: 0+256]}, pendingCH=null, unionCH=null, actualMembers=[Test-NodeB, Test-NodeA], persistentUUIDs=[95c5afaf-de70-49f5-a494-4ae634425e9b, 73824c63-18cb-4de3-a714-b1b1b3fe5713]}. rebalanceInProgress=true, currentChIsBalanced=true
> at org.infinispan.test.TestingUtil.waitForNoRebalance(TestingUtil.java:435) ~[test-classes/:?]
> at org.infinispan.test.TestingUtil.waitForNoRebalance(TestingUtil.java:502) ~[test-classes/:?]
> at org.infinispan.partitionhandling.BasePartitionHandlingTest$Partition.waitForPartitionToForm(BasePartitionHandlingTest.java:336) ~[test-classes/:?]
> at org.infinispan.partitionhandling.BasePartitionHandlingTest$Partition.merge(BasePartitionHandlingTest.java:316) ~[test-classes/:?]
> at org.infinispan.partitionhandling.BasePartitionHandlingTest$Partition.merge(BasePartitionHandlingTest.java:305) ~[test-classes/:?]
> at org.infinispan.conflict.impl.MultipleCachesDuringConflictResolutionTest.testPartitionMergePolicy(MultipleCachesDuringConflictResolutionTest.java:60) ~[test-classes/:?]
> {noformat}
--
This message was sent by Atlassian Jira
(v7.13.8#713008)
6 years, 2 months