[mod_cluster-dev] Handling crashed/hung AS nodes

Bela Ban bela at jboss.com
Fri Mar 27 02:03:41 EDT 2009



Paul Ferraro wrote:
> Currently, the HAModClusterService (where httpd communication is
> coordinated by an HA singleton) does not react to crashed/hung members.
> Specifically, when the HA singleton gets a callback that the group
> membership changes, it does not send any REMOVE-APP messages to httpd on
> behalf of the member that just left. Currently, httpd will detect the
> failure (via a disconnected socket) on its own and sets its internal
> state accordingly, e.g. a STATUS message will return NOTOK.

I assume this feature is on the roadmap ? Brian and I have been talking 
about it and I think this is a crucial feature. I suggest we add this 
before GA. Thoughts ?

-- 
Bela Ban
Lead JGroups / JBoss Clustering team
JBoss - a division of Red Hat



More information about the mod_cluster-dev mailing list