[jboss-jira] [JBoss JIRA] (AS7-4841) Poor handling of network failures between master Host Controller and slaves
Brian Stansberry (JIRA)
jira-events at lists.jboss.org
Mon May 21 02:58:17 EDT 2012
Brian Stansberry created AS7-4841:
-------------------------------------
Summary: Poor handling of network failures between master Host Controller and slaves
Key: AS7-4841
URL: https://issues.jboss.org/browse/AS7-4841
Project: Application Server 7
Issue Type: Bug
Components: Domain Management
Affects Versions: 7.1.2.Final (EAP), 7.1.1.Final
Reporter: Brian Stansberry
Assignee: Brian Stansberry
Priority: Blocker
Fix For: 7.1.3.Final (EAP), 7.2.0.Alpha1
The detection and handling of network disruptions between slave HCs and masters is not working properly.
1) Network disruptions are not being detected until an operation is invoked from the master to the slave, and the detection will not happen until the TCP layer give up trying to transmit the message, which could take up to 20 mins with default linux tcp settings.
2) If no operation is invoked, or the TCP level detection has not happened yet, the master still regards the slave as registered. So if the network heals and the slave tries to reregister, the re-registration attempt will be denied.
To solve this I am going to re-implement a variant of the ping-pong stuff we had in 7.0.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.jboss.org/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
More information about the jboss-jira
mailing list