[infinispan-issues] [JBoss JIRA] (ISPN-2357) New coordinator installs old CH containing crashed members

Erik Salter (JIRA) jira-events at lists.jboss.org
Sun Sep 30 21:31:02 EDT 2012


    [ https://issues.jboss.org/browse/ISPN-2357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12722690#comment-12722690 ] 

Erik Salter commented on ISPN-2357:
-----------------------------------

After looking at the trace logs, it looks like the coordinator dying abruptly fails to send a LEAVE ClusterControlCommand, so the first thing the new coordinator does is do a GET_STATUS without consulting the view it received.

Maybe the first thing the new coordinator should do is ask the nodes to reconcile their current views with the views received from JGroupsTransport?
                
> New coordinator installs old CH containing crashed members
> ----------------------------------------------------------
>
>                 Key: ISPN-2357
>                 URL: https://issues.jboss.org/browse/ISPN-2357
>             Project: Infinispan
>          Issue Type: Feature Request
>          Components: State transfer
>    Affects Versions: 5.2.0.Alpha4
>            Reporter: Erik Salter
>            Assignee: Mircea Markus
>
> After merging the round of pull requests today (including ISPN-2349), there still is an issue where a killed node can appear in the CH even after JGroups passes up the new view.  This appears to happen when the old coordinator was killed.
> Obviously, if the CH contains the old member, writes that map to the segments previously owned by the dead node will fail.
> Here are ISPN trace logs.  You are looking for erm-cluster.
> http://dl.dropbox.com/u/50401510/5.2.0.ALPHA4/relay2/10.30.12.83/server.log.gz (phl-dg1)
> http://dl.dropbox.com/u/50401510/5.2.0.ALPHA4/relay2/10.30.12.84/server.log.gz (phl-dg2)
> http://dl.dropbox.com/u/50401510/5.2.0.ALPHA4/relay2/10.30.12.83/server.log.gz (phl-dg3, the node that was killed)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


More information about the infinispan-issues mailing list