On 3/18/11 10:35 PM, Dave wrote:
Won't be able to get CR4 uploaded, policy dictates that I wait
until final
release. However, I was able to get 431 nodes up and running as a replicated
cluster and 115 nodes up as a distributed cluster. For the 430 node cache, I
was able to get it started with no problems about 50% of the time. When they
formed multiple clusters they merged together only some of the time. It
really does appear to be a startup issue at this point. We have not pushed
it hard enough yet to see what happens at this scale under load.
Any idea when CR4 will be FINAL?
Are there any tools to help diagnose problems / performance at this scale (I
ended up writing my own monitor program)?
Yes, there's probe.sh at the JGroups level. I created a JIRA to provide
a sample for large clusters. You said you based your config on udp.xml,
correct ?
[1]
https://issues.jboss.org/browse/JGRP-1307
--
Bela Ban
Lead JGroups / Clustering Team
JBoss