[jboss-dev-forums] [Design of Clustering on JBoss (Clusters/JBoss)] - Re: Handling of 'deployments taking ~1 minute' scenario

bstansberry@jboss.com do-not-reply at jboss.com
Tue Jan 15 13:11:13 EST 2008


IIRC, the 1 min deployment scenario was due to a deadlock where the AS code used the JGroups up_handler to make an RPC, thus preventing the RPC response from arriving.  Wasn't this a bug that was fixed?

In that case, it was the node sending the RPC that was faulty.  In some other case where a remote node "isn't responding" all you could do would be to send a message to "commit suicide" -- there's no mechanism to evict a node from the group outside of JGroups' own failure detection.  But if the node isn't responding to RPCs, it likely wouldn't respond to the "commit suicide" either.

Logically, I could see some benefit in some sort of self-healing approach where cluster members detect faults and restart themselves or send commands to others telling them to restart.  But this will take a lot of thought.

View the original post : http://www.jboss.com/index.html?module=bb&op=viewtopic&p=4120188#4120188

Reply to the post : http://www.jboss.com/index.html?module=bb&op=posting&mode=reply&p=4120188



More information about the jboss-dev-forums mailing list