[jboss-user] [Clustering/JBoss] - JBoss-4.2.2 Clustering

agohar do-not-reply at jboss.com
Tue Sep 2 04:31:40 EDT 2008


Hi,

Starting a new topic just for visibility.

I am trying to build a jboss cluster and having some issues. I've downloaded fresh copy of jboss-4.2.2 and deployed on 3 servers. I configured cluster-service.xml and jboss-web-cluster/META-INF/jboss-service.xml and put udp ip and port in there. Here are the portions of my xml files:

cluster-service.xml
         <Config>
  |             <UDP mcast_addr="${jboss.partition.udpGroup:228.20.20.35}"
  |                  mcast_port="${jboss.hapartition.mcast_port:45667}"
  |                  tos="8"
  |                  ucast_recv_buf_size="20000000"
  |                  ucast_send_buf_size="640000"
  |                  mcast_recv_buf_size="25000000"
  |                  mcast_send_buf_size="640000"
  |                  loopback="false"
  |                  discard_incompatible_packets="true"
  |                  enable_bundling="false"
  |                  max_bundle_size="64000"
  |                  max_bundle_timeout="30"
  |                  use_incoming_packet_handler="true"
  |                  use_outgoing_packet_handler="false"
  |                  ip_ttl="${jgroups.udp.ip_ttl:8}"
  |                  down_thread="false" up_thread="false"/>
  |             <PING timeout="2000"
  |                   down_thread="false" up_thread="false" num_initial_members="3"/>
  |             <MERGE2 max_interval="100000"
  |                     down_thread="false" up_thread="false" min_interval="20000"/>
  |             <FD_SOCK down_thread="false" up_thread="false"/>
  |             <FD timeout="10000" max_tries="5" down_thread="false" up_thread="false" shun="true"/>
  |             <VERIFY_SUSPECT timeout="1500" down_thread="false" up_thread="false"/>
  |             <pbcast.NAKACK max_xmit_size="60000"
  |                     use_mcast_xmit="false" gc_lag="0"
  |                     retransmit_timeout="300,600,1200,2400,4800"
  |                     down_thread="false" up_thread="false"
  |                     discard_delivered_msgs="true"/>
  |             <UNICAST timeout="300,600,1200,2400,3600"
  |                     down_thread="false" up_thread="false"/>
  |             <pbcast.STABLE stability_delay="1000" desired_avg_gossip="50000"
  |                     down_thread="false" up_thread="false"
  |                     max_bytes="400000"/>
  |             <pbcast.GMS print_local_addr="true" join_timeout="3000"
  |                     down_thread="false" up_thread="false"
  |                     join_retry_timeout="2000" shun="true"
  |                     view_bundling="true"/>
  |             <FRAG2 frag_size="60000" down_thread="false" up_thread="false"/>
  |             <pbcast.STATE_TRANSFER down_thread="false" up_thread="false" use_flush="false"/>
  |          </Config>
  | 

jboss-web-cluster.sar/META-INF/jboss-service.xml
            <config>
  |                 <UDP mcast_addr="${jboss.partition.udpGroup:230.20.20.74}"
  |                      mcast_port="${jboss.webpartition.mcast_port:45679}"
  |                              tos="8"
  |                              ucast_recv_buf_size="20000000"
  |                              ucast_send_buf_size="640000"
  |                              mcast_recv_buf_size="25000000"
  |                              mcast_send_buf_size="640000"
  |                              loopback="false"
  |                              discard_incompatible_packets="true"
  |                              enable_bundling="false"
  |                              max_bundle_size="64000"
  |                              max_bundle_timeout="30"
  |                              use_incoming_packet_handler="true"
  |                              use_outgoing_packet_handler="false"
  |                              ip_ttl="${jgroups.udp.ip_ttl:8}"
  |                              down_thread="false" up_thread="false"/>
  |                           <PING timeout="2000"
  |                              down_thread="false" up_thread="false" num_initial_members="3"/>
  |                           <MERGE2 max_interval="100000"
  |                              down_thread="false" up_thread="false" min_interval="20000"/>
  |                           <FD_SOCK down_thread="false" up_thread="false"/>
  |                           <FD timeout="10000" max_tries="5" down_thread="false" up_thread="false" shun="true"/>
  |                           <VERIFY_SUSPECT timeout="1500" down_thread="false" up_thread="false"/>
  |                           <pbcast.NAKACK max_xmit_size="60000"
  |                                    use_mcast_xmit="false" gc_lag="0"
  |                                    retransmit_timeout="300,600,1200,2400,4800"
  |                                    down_thread="false" up_thread="false"
  |                                    discard_delivered_msgs="true"/>
  |                           <UNICAST timeout="300,600,1200,2400,3600"
  |                              down_thread="false" up_thread="false"/>
  |                           <pbcast.STABLE stability_delay="1000" desired_avg_gossip="50000"
  |                                    down_thread="false" up_thread="false"
  |                                    max_bytes="400000"/>
  |                           <pbcast.GMS print_local_addr="true" join_timeout="3000"
  |                                    down_thread="false" up_thread="false"
  |                                    join_retry_timeout="2000" shun="true"
  |                                    view_bundling="true"
  |                                    view_ack_collection_timeout="5000"/>
  |                           <FC max_credits="2000000" down_thread="false" up_thread="false"
  |                               min_threshold="0.10"/>
  |                           <FRAG2 frag_size="60000" down_thread="false" up_thread="false"/>
  |                           <pbcast.STATE_TRANSFER down_thread="false" up_thread="false" use_flush="false"/>
  |            </config>

When i start the servers in cluster, 2 of the servers join the cluster fine but adding third one creates problem and is very slow during startup and gets halt for some time at:
2008-09-01 13:39:51,150 INFO  [org.jboss.ha.framework.interfaces.HAPartition.DefaultPartition] Number of cluster members: 3
  | 2008-09-01 13:39:51,150 INFO  [org.jboss.ha.framework.interfaces.HAPartition.DefaultPartition] Other members: 2
  | 2008-09-01 13:39:51,151 INFO  [org.jboss.ha.framework.interfaces.HAPartition.DefaultPartition] Fetching state (will wait for 30000 milliseconds):
  | 2008-09-01 13:39:51,194 INFO  [org.jboss.ha.framework.interfaces.HAPartition.DefaultPartition] state was retrieved successfully (in 43 milliseconds)
  | 

I didn't deploy any application on the servers, these are just fresh servers. I've tried the same with jboss-4.0.2 and it works fine on the same network with same clustering configs. 

Is there anything changed in new jboss-4.2.2 for the way jboss servers communicate for clustering? any network configurations i can check? Please help

Thanks

View the original post : http://www.jboss.com/index.html?module=bb&op=viewtopic&p=4173679#4173679

Reply to the post : http://www.jboss.com/index.html?module=bb&op=posting&mode=reply&p=4173679



More information about the jboss-user mailing list