[Clustering] - Re: Cluster falls apart: FD_SOCK errors - jboss-user

Monday, 12 October 2009

A tip for those who come this way after us: we found a large part of the problem was that
the cluster nodes rely on being in constant communication.

If one of them is under high load (say, running some reports or something) its CPU usage
may be so high it does not respond to the cluster ping quickly enough (within 3 seconds).
The cluster then treats it as dead and removes it from the cluster, even though it is not
dead it is just busy.

We increased the org.jgroups.protocols.pbcast.GMS timeout and it helped a great deal.

View the original post :
http://www.jboss.org/index.html?module=bb&op=viewtopic&p=4259958#...

Reply to the post :
http://www.jboss.org/index.html?module=bb&op=posting&mode=reply&a...

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

[Clustering] - Re: Cluster falls apart: FD_SOCK errors