[
https://issues.jboss.org/browse/ISPN-2892?page=com.atlassian.jira.plugin....
]
Dan Berindei commented on ISPN-2892:
------------------------------------
Dennis, can you describe the scenario in more detail? Are we talking about restarting
cache managers together with their JChannels, cache managers but not their JChannels, or
just individual caches?
Normally, if CacheViewsManagerImpl.isRunning() returns false, it means that the current
node is shutting down and surviving nodes will pick up another JGroups coordinator, which
should restart the cache view installation (with a higher view id). Since that's not
happening, I'm thinking that maybe the cache manager on the coordinator is shut down,
but the JGroups channel keeps running (which we don't support AFAIK).
It's also odd that the remote node would throw a "Received cache view prepare
request after the local node has already shut down" exception while it's joining,
because the first thing CacheViewsManagerImpl.join() does is install the
StateTransferManager as a listener, and the listener is not removed until the cache is
stopped. I'm not sure what to make of that, but if it's a race condition I believe
waiting for a few seconds between stop and start should work around the problem.
View installation loop when restarting cache on multiple nodes
--------------------------------------------------------------
Key: ISPN-2892
URL:
https://issues.jboss.org/browse/ISPN-2892
Project: Infinispan
Issue Type: Bug
Affects Versions: 5.1.7.Final
Reporter: Dennis Reed
Assignee: Mircea Markus
Restarting a cache on multiple nodes at the same time can cause the following error:
ERROR [org.infinispan.cacheviews.CacheViewsManagerImpl] (CacheViewInstaller-19,node1/web)
ISPN000172: Failed to prepare view CacheView{viewId=18, members=[node2/web]} for cache
default-host/test, rolling back to view CacheView{viewId=17, members=[node1/web,
node2/web]}: java.util.concurrent.ExecutionException: org.infinispan.CacheException:
java.lang.IllegalStateException: default-host/test: Received cache view prepare request
after the local node has already shut down
After the initial error, the following error began repeating every second for a few
minutes until BaseStateTransferManagerImpl.waitForJoinToComplete() timed out and the cache
failed to start:
ERROR [org.infinispan.cacheviews.CacheViewsManagerImpl] (CacheViewInstaller-19,node1/web)
ISPN000172: Failed to prepare view CacheView{viewId=21, members=[node2/web]} for cache
default-host/test, rolling back to view CacheView{viewId=20, members=[]}:
java.util.concurrent.ExecutionException: org.infinispan.CacheException:
java.lang.IllegalStateException: Cannot prepare new view CacheView{viewId=21,
members=[node2/web]} on cache default-host/test, we are currently preparing view
CacheView{viewId=18, members=[node2/web]}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:
http://www.atlassian.com/software/jira