Hi,
Starting a new topic just for visibility.
I am trying to build a jboss cluster and having some issues. I've downloaded fresh
copy of jboss-4.2.2 and deployed on 3 servers. I configured cluster-service.xml and
jboss-web-cluster/META-INF/jboss-service.xml and put udp ip and port in there. Here are
the portions of my xml files:
cluster-service.xml
<Config>
| <UDP mcast_addr="${jboss.partition.udpGroup:228.20.20.35}"
| mcast_port="${jboss.hapartition.mcast_port:45667}"
| tos="8"
| ucast_recv_buf_size="20000000"
| ucast_send_buf_size="640000"
| mcast_recv_buf_size="25000000"
| mcast_send_buf_size="640000"
| loopback="false"
| discard_incompatible_packets="true"
| enable_bundling="false"
| max_bundle_size="64000"
| max_bundle_timeout="30"
| use_incoming_packet_handler="true"
| use_outgoing_packet_handler="false"
| ip_ttl="${jgroups.udp.ip_ttl:8}"
| down_thread="false" up_thread="false"/>
| <PING timeout="2000"
| down_thread="false" up_thread="false"
num_initial_members="3"/>
| <MERGE2 max_interval="100000"
| down_thread="false" up_thread="false"
min_interval="20000"/>
| <FD_SOCK down_thread="false"
up_thread="false"/>
| <FD timeout="10000" max_tries="5"
down_thread="false" up_thread="false" shun="true"/>
| <VERIFY_SUSPECT timeout="1500" down_thread="false"
up_thread="false"/>
| <pbcast.NAKACK max_xmit_size="60000"
| use_mcast_xmit="false" gc_lag="0"
| retransmit_timeout="300,600,1200,2400,4800"
| down_thread="false" up_thread="false"
| discard_delivered_msgs="true"/>
| <UNICAST timeout="300,600,1200,2400,3600"
| down_thread="false" up_thread="false"/>
| <pbcast.STABLE stability_delay="1000"
desired_avg_gossip="50000"
| down_thread="false" up_thread="false"
| max_bytes="400000"/>
| <pbcast.GMS print_local_addr="true"
join_timeout="3000"
| down_thread="false" up_thread="false"
| join_retry_timeout="2000" shun="true"
| view_bundling="true"/>
| <FRAG2 frag_size="60000" down_thread="false"
up_thread="false"/>
| <pbcast.STATE_TRANSFER down_thread="false"
up_thread="false" use_flush="false"/>
| </Config>
|
jboss-web-cluster.sar/META-INF/jboss-service.xml
<config>
| <UDP
mcast_addr="${jboss.partition.udpGroup:230.20.20.74}"
| mcast_port="${jboss.webpartition.mcast_port:45679}"
| tos="8"
| ucast_recv_buf_size="20000000"
| ucast_send_buf_size="640000"
| mcast_recv_buf_size="25000000"
| mcast_send_buf_size="640000"
| loopback="false"
| discard_incompatible_packets="true"
| enable_bundling="false"
| max_bundle_size="64000"
| max_bundle_timeout="30"
| use_incoming_packet_handler="true"
| use_outgoing_packet_handler="false"
| ip_ttl="${jgroups.udp.ip_ttl:8}"
| down_thread="false"
up_thread="false"/>
| <PING timeout="2000"
| down_thread="false" up_thread="false"
num_initial_members="3"/>
| <MERGE2 max_interval="100000"
| down_thread="false" up_thread="false"
min_interval="20000"/>
| <FD_SOCK down_thread="false"
up_thread="false"/>
| <FD timeout="10000" max_tries="5"
down_thread="false" up_thread="false" shun="true"/>
| <VERIFY_SUSPECT timeout="1500"
down_thread="false" up_thread="false"/>
| <pbcast.NAKACK max_xmit_size="60000"
| use_mcast_xmit="false"
gc_lag="0"
|
retransmit_timeout="300,600,1200,2400,4800"
| down_thread="false"
up_thread="false"
| discard_delivered_msgs="true"/>
| <UNICAST timeout="300,600,1200,2400,3600"
| down_thread="false"
up_thread="false"/>
| <pbcast.STABLE stability_delay="1000"
desired_avg_gossip="50000"
| down_thread="false"
up_thread="false"
| max_bytes="400000"/>
| <pbcast.GMS print_local_addr="true"
join_timeout="3000"
| down_thread="false"
up_thread="false"
| join_retry_timeout="2000"
shun="true"
| view_bundling="true"
| view_ack_collection_timeout="5000"/>
| <FC max_credits="2000000"
down_thread="false" up_thread="false"
| min_threshold="0.10"/>
| <FRAG2 frag_size="60000"
down_thread="false" up_thread="false"/>
| <pbcast.STATE_TRANSFER down_thread="false"
up_thread="false" use_flush="false"/>
| </config>
When i start the servers in cluster, 2 of the servers join the cluster fine but adding
third one creates problem and is very slow during startup and gets halt for some time at:
2008-09-01 13:39:51,150 INFO
[org.jboss.ha.framework.interfaces.HAPartition.DefaultPartition] Number of cluster
members: 3
| 2008-09-01 13:39:51,150 INFO
[org.jboss.ha.framework.interfaces.HAPartition.DefaultPartition] Other members: 2
| 2008-09-01 13:39:51,151 INFO
[org.jboss.ha.framework.interfaces.HAPartition.DefaultPartition] Fetching state (will wait
for 30000 milliseconds):
| 2008-09-01 13:39:51,194 INFO
[org.jboss.ha.framework.interfaces.HAPartition.DefaultPartition] state was retrieved
successfully (in 43 milliseconds)
|
I didn't deploy any application on the servers, these are just fresh servers. I've
tried the same with jboss-4.0.2 and it works fine on the same network with same clustering
configs.
Is there anything changed in new jboss-4.2.2 for the way jboss servers communicate for
clustering? any network configurations i can check? Please help
Thanks
View the original post :
http://www.jboss.com/index.html?module=bb&op=viewtopic&p=4173679#...
Reply to the post :
http://www.jboss.com/index.html?module=bb&op=posting&mode=reply&a...