[jboss-user] [Clustering/JBoss] - Re: Nodes not join Cluster - UDP discarded Message

jboss_cody do-not-reply at jboss.com
Thu Nov 1 13:56:54 EDT 2007


Thank you for the reply Bela,

Before I answer your questions, I thought that I should also mention that I am using vmware Server w/Centos5  to develop my cluster. 

I have successfully configured two separate VM's for my cluster. (vmware Server only allows up to 4 virtual NICs per vm)

I started 3 nodes on one vm, and they form a cluster.

Then I start 3 more nodes on the other vm and they form a cluster.

I then changed the cluster-configurations on the 2nd set of nodes, to match those of the 1st set of nodes. 

I start the 1st set of nodes. (ips .11, .12, .13). I start the 2nd set of nodes (.21, .22, .23) on the other vm.

I am using UDP transport with the configurations of 'all'. I can see the traffic taking place, but my 2nd set of nodes are unable to JOIN the 1st set of nodes. 

I know my problem exists in my UDP configuration, but that's as far as I've gotten. 


anonymous wrote : Are the 3 additional boxes in the same subnet as the others ?

     -Yes, each share the same subnet addr, I simply copied the network - configurations and only made modifications to the actual host names and ip addrs.

anonymous wrote : Do they bind to correct addresses ? 127.0.0.1 is *not* one ! 

   -Yes, I have the -b option included in the startup scripts of each instance/node.

anonymous wrote : Any firewalls on ? If so, turn off to see whether the JOIN is successful

  -No, at the initial configuration of VMware server, we disabled SELinux and any other firewall that might interfere.

anonymous wrote : Do you use VLANs ? If so, the 3 additional boxes need to be in the same VLAN as the others. Make sure VLANs dont drop IP multicast packets 

  -??? How can I find out this info.

P.S. I've said this before, but just to make it clear, I am a newbie to all of these concepts, so please work with me here.

Sorry for any stupid, obvious questions.

anonymous wrote : If nothing else works, you can always fallback to TCP:TCPPING and
  | - list your 6 nodes in TCPPING explicitly

This was my next plan, but from what I've read, there is extra network traffic using this approach.


Oh yeah, Here is an excerpt from a node in the 1st set:


  | 2007-10-31 22:55:40,984 WARN  [org.jgroups.protocols.pbcast.NAKACK] 192.168.202.11:32789] discarded message from non-member 192.168.202.21:32796, my view is [192.168.202.11:32789|0] [192.168.202.11:32789]
  | 2007-10-31 22:55:41,516 WARN  [org.jgroups.protocols.pbcast.NAKACK] 192.168.202.11:32789] discarded message from non-member 192.168.202.21:32796, my view is [192.168.202.11:32789|0] [192.168.202.11:32789]
  | 2007-10-31 22:55:48,172 WARN  [org.jgroups.protocols.pbcast.NAKACK] 192.168.202.11:32787] discarded message from non-member 192.168.202.21:32794, my view is [192.168.202.11:32787|0] [192.168.202.11:32787]
  | 2007-10-31 22:55:48,832 WARN  [org.jgroups.protocols.pbcast.NAKACK] 192.168.202.11:32787] discarded message from non-member 192.168.202.21:32794, my view is [192.168.202.11:32787|0] [192.168.202.11:32787]
  | 2007-10-31 22:55:50,509 DEBUG [org.jboss.web.tomcat.service.session.JBossCacheManager] Looking for sessions that have expired ...
  | 2007-10-31 22:55:53,055 WARN  [org.jgroups.protocols.pbcast.GMS] merge responses from subgroup coordinators <= 1 ([]). Cancelling merge
  | 2007-10-31 22:55:53,056 ERROR [org.jgroups.protocols.pbcast.GMS] coords or merge_id == null
  | 2007-10-31 22:55:53,056 ERROR [org.jgroups.protocols.pbcast.GMS] coords or merge_id == null
  | 2007-10-31 22:55:55,512 WARN  [org.jgroups.protocols.pbcast.GMS] merge responses from subgroup coordinators <= 1 ([]). Cancelling merge
  | 2007-10-31 22:55:55,513 ERROR [org.jgroups.protocols.pbcast.GMS] coords or merge_id == null
  | 2007-10-31 22:55:55,513 ERROR [org.jgroups.protocols.pbcast.GMS] coords or merge_id == null
  | 2007-10-31 22:56:00,514 DEBUG [org.jboss.web.tomcat.service.session.JBossCacheManager] Looking for sessions that have expired ...
  | 2007-10-31 22:56:00,590 WARN  [org.jgroups.protocols.pbcast.GMS] merge responses from subgroup coordinators <= 1 ([]). Cancelling merge
  | 
  | 
  | 

 boolean isBad = (this.Network_traffic == "NOT GOOD") ;
 System.out.print(isBad.toString());

->NOT GOOD

Thanks again : )

View the original post : http://www.jboss.com/index.html?module=bb&op=viewtopic&p=4101024#4101024

Reply to the post : http://www.jboss.com/index.html?module=bb&op=posting&mode=reply&p=4101024



More information about the jboss-user mailing list