[jboss-remoting-issues] [JBoss JIRA] Commented: (JBREM-947) ConnectionValidator hangs when server dies

Ron Sigal (JIRA) jira-events at lists.jboss.org
Thu Apr 24 20:21:08 EDT 2008


    [ http://jira.jboss.com/jira/browse/JBREM-947?page=comments#action_12410631 ] 
            
Ron Sigal commented on JBREM-947:
---------------------------------

Actually, it's a new daemon Thread for each ping.  If this were on a critical path I would have done something more clever, but we're talking about one new object every few seconds, so I doubt that it will have much of an impact.  Nonetheless, I'll do some profiling before the final release to check for hot spots.

> ConnectionValidator hangs when server dies
> ------------------------------------------
>
>                 Key: JBREM-947
>                 URL: http://jira.jboss.com/jira/browse/JBREM-947
>             Project: JBoss Remoting
>          Issue Type: Bug
>      Security Level: Public(Everyone can see) 
>    Affects Versions: 2.2.2.SP7
>            Reporter: Tim Fox
>         Assigned To: Ron Sigal
>             Fix For: 2.4.0.CR2
>
>
> If the connection between client and server is pulled (pull the real cable) or the entire server suddenly dies, then the connection won't be closed from the server (unlike a kill -9 of the server where the OS will terminate that processses connections), so the client making the write() or read() on the socket won't receive an exception.
> In the eyes of TCP the connection is still alive and the read/write will block until the socket timeout is reached.
> Typically the socket timeout will be much higher than the desired failure detection time (the validation interval), but currently failure will never be detected in this situation before the socket timeout time.
> Remoting should not be dependent on the socket timeout for failure detection, the connetion validation and socket timeout should be possible to be configured separately.
> E.g. you might want to configure a socket timeout of 60 seconds, but a connection validation frequency (ping) of 5 seconds. Currently this is not possible.
> The current implementation gives inconsistent behaviour depending on how the server died - i.e. whether the process died (e.g. kill -9) or the cable was pulled or the entire server disappeared.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://jira.jboss.com/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        



More information about the jboss-remoting-issues mailing list