[jboss-jira] [JBoss JIRA] (AS7-4841) Poor handling of network failures between master Host Controller and slaves

Brian Stansberry (JIRA) jira-events at lists.jboss.org
Mon May 21 02:58:17 EDT 2012


Brian Stansberry created AS7-4841:
-------------------------------------

             Summary: Poor handling of network failures between master Host Controller and slaves
                 Key: AS7-4841
                 URL: https://issues.jboss.org/browse/AS7-4841
             Project: Application Server 7
          Issue Type: Bug
          Components: Domain Management
    Affects Versions: 7.1.2.Final (EAP), 7.1.1.Final
            Reporter: Brian Stansberry
            Assignee: Brian Stansberry
            Priority: Blocker
             Fix For: 7.1.3.Final (EAP), 7.2.0.Alpha1


The detection and handling of network disruptions between slave HCs and masters is not working properly.

1) Network disruptions are not being detected until an operation is invoked from the master to the slave, and the detection will not happen until the TCP layer give up trying to transmit the message, which could take up to 20 mins with default linux tcp settings.

2) If no operation is invoked, or the TCP level detection has not happened yet, the master still regards the slave as registered. So if the network heals and the slave tries to reregister, the re-registration attempt will be denied.

To solve this I am going to re-implement a variant of the ping-pong stuff we had in 7.0. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.jboss.org/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        


More information about the jboss-jira mailing list