[jboss-jira] [JBoss JIRA] (JGRP-2040) Seeing a OOM in JGroup 3.4

Kshitiz Saxena (JIRA) issues at jboss.org
Mon May 9 02:03:00 EDT 2016


    [ https://issues.jboss.org/browse/JGRP-2040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13202414#comment-13202414 ] 

Kshitiz Saxena commented on JGRP-2040:
--------------------------------------

Here are the output of netstat command

	NODE1

	/home/gis/si525/properties>netstat -na | grep 35060                                      
	tcp        0      0 9.155.214.204:35060         0.0.0.0:*                   LISTEN       
	tcp        0      0 9.155.214.204:39696         9.155.214.109:35060         ESTABLISHED  
	/home/gis/si525/properties>netstat -na | grep 35061                                      
	tcp        0      0 9.155.214.204:35061         0.0.0.0:*                   LISTEN       
	tcp        0      0 9.155.214.204:35061         9.155.214.204:37203         ESTABLISHED  
	tcp        0      0 9.155.214.204:37203         9.155.214.204:35061         ESTABLISHED  
	tcp        0      0 9.155.214.204:37304         9.155.214.109:35061         ESTABLISHED  
	/home/gis/si525/properties>netstat -na | grep 35062                                      
  

	NODE2

	/home/gis>netstat -na | grep 35060                                                       
	tcp        0      0 9.155.214.109:35060         0.0.0.0:*                   LISTEN       
	tcp        0      0 9.155.214.109:35060         9.155.214.204:39696         ESTABLISHED  
	/home/gis>netstat -na | grep 35061                                                       
	tcp        0      0 9.155.214.109:35061         0.0.0.0:*                   LISTEN       
	tcp        0      0 9.155.214.109:35061         9.155.214.109:40967         ESTABLISHED  
	tcp        0      0 9.155.214.109:35061         9.155.214.204:37304         ESTABLISHED  
	tcp        0      0 9.155.214.109:40967         9.155.214.109:35061         ESTABLISHED  
	/home/gis>netstat -na | grep 35062       


Below are the JGroup properties which we have defined

NODE1
jgroups_cluster.property_string=TCP(bind_addr=9.155.214.204;bind_port=35061):TCPPING(initial_hosts=9.155.214.204[35061],9.155.214.109[35061];port_range=1;timeout=5000;num_initial_members=2):MERGE2(min_interval=3000;max_interval=5000):FD_ALL(interval=5000;timeout=20000):FD(timeout=5000;max_tries=48;):VERIFY_SUSPECT(timeout=1500):pbcast.NAKACK(retransmit_timeout=100,200,300,600,1200,2400,4800;discard_delivered_msgs=true):pbcast.STABLE(stability_delay=1000;desired_avg_gossip=20000;max_bytes=0):pbcast.GMS(print_local_addr=true;join_timeout=5000)

jgroups_cluster.distribution_property_string=TCP(bind_port=35060;thread_pool_rejection_policy=run):TCPPING(initial_hosts=up51to52[35060];port_range=1;timeout=5000;num_initial_members=2):MERGE2(min_interval=3000;max_interval=5000):FD_SOCK:FD(timeout=5000;max_tries=48;):VERIFY_SUSPECT(timeout=1500):pbcast.NAKACK(retransmit_timeout=3000;discard_delivered_msgs=true):pbcast.STABLE(stability_delay=1000;desired_avg_gossip=20000;max_bytes=0):pbcast.GMS(join_timeout=5000;print_local_addr=true)

jgroups_cluster.lock.protocolStack=TCP(bind_addr=9.155.214.204;bind_port=35062):TCPPING(initial_hosts=9.155.214.204[35062],9.155.214.109[35062];port_range=1;timeout=5000;num_initial_members=2):MERGE2(min_interval=3000;max_interval=5000):FD_ALL(interval=5000;timeout=20000):FD(timeout=5000;max_tries=48;):VERIFY_SUSPECT(timeout=1500):pbcast.NAKACK(retransmit_timeout=100,200,300,600,1200,2400,4800;discard_delivered_msgs=true):pbcast.STABLE(stability_delay=1000;desired_avg_gossip=20000;max_bytes=0):pbcast.GMS(print_local_addr=true;join_timeout=5000)


NODE2
jgroups_cluster.property_string=TCP(bind_addr=9.155.214.109;bind_port=35061):TCPPING(initial_hosts=9.155.214.109[35061],9.155.214.204[35061];port_range=1;timeout=5000;num_initial_members=2):MERGE2(min_interval=3000;max_interval=5000):FD_ALL(interval=5000;timeout=20000):FD(timeout=5000;max_tries=48;):VERIFY_SUSPECT(timeout=1500):pbcast.NAKACK(retransmit_timeout=100,200,300,600,1200,2400,4800;discard_delivered_msgs=true):pbcast.STABLE(stability_delay=1000;desired_avg_gossip=20000;max_bytes=0):pbcast.GMS(print_local_addr=true;join_timeout=5000)

jgroups_cluster.distribution_property_string=TCP(bind_port=35060;thread_pool_rejection_policy=run):TCPPING(initial_hosts=ac525n2[35060];port_range=1;timeout=5000;num_initial_members=2):MERGE2(min_interval=3000;max_interval=5000):FD_SOCK:FD(timeout=5000;max_tries=48;):VERIFY_SUSPECT(timeout=1500):pbcast.NAKACK(retransmit_timeout=3000;discard_delivered_msgs=true):pbcast.STABLE(stability_delay=1000;desired_avg_gossip=20000;max_bytes=0):pbcast.GMS(join_timeout=5000;print_local_addr=true)

jgroups_cluster.lock.protocolStack=TCP(bind_addr=9.155.214.109;bind_port=35062):TCPPING(initial_hosts=9.155.214.109[35062],9.155.214.204[35062];port_range=1;timeout=5000;num_initial_members=2):MERGE2(min_interval=3000;max_interval=5000):FD_ALL(interval=5000;timeout=20000):FD(timeout=5000;max_tries=48;):VERIFY_SUSPECT(timeout=1500):pbcast.NAKACK(retransmit_timeout=100,200,300,600,1200,2400,4800;discard_delivered_msgs=true):pbcast.STABLE(stability_delay=1000;desired_avg_gossip=20000;max_bytes=0):pbcast.GMS(print_local_addr=true;join_timeout=5000)

> Seeing a OOM in JGroup 3.4
> --------------------------
>
>                 Key: JGRP-2040
>                 URL: https://issues.jboss.org/browse/JGRP-2040
>             Project: JGroups
>          Issue Type: Bug
>    Affects Versions: 3.4
>         Environment: Linux Operating System
>            Reporter: Kshitiz Saxena
>            Assignee: Bela Ban
>
> We are seeing an OOM in our application where thread dump points to JGroup.
> We see the below in thread dumps,
> 3XEHSTTYPE     07:33:24:346241000 GMT j9vm.294 -  >setCurrentException index=11 constructorIndex=0 detailMessage=0000000000F61678 
> 3XEHSTTYPE     07:33:24:346183000 GMT j9mm.126 -   at 0000000050F8CD60 java/lang/Thread.run()V, jit 00007FCF323EA580, pc 00007FCF489E0A36 
> 3XEHSTTYPE     07:33:24:346179000 GMT j9mm.126 -   at 0000000053644748 *org/jgroups/blocks/TCPConnectionMap$TCPConnection$Receiver.run()*V, jit 0000000000000000, pc 00007FCF3354D334 
> 3XEHSTTYPE     07:33:24:346175000 GMT j9mm.101 -   J9AllocateIndexableObject() returning NULL! *1650814064 bytes* requested for object of class 0000000050F79700 from memory space 'Generational' id=00007FCF440427C0 
> In the thread dump we also see 
> WARNING : OutOfMemoryError possibly caused by 1650814064 bytes requested for object of class 0000000050F79700 from memory space 'Generational' id=00007FCF440427C0 
> Java Heap Information
> -Xmx (Maximum Java heap size) : 1280m
> -Xms (Initial Java heap size) : 640m
> -Xss (Maximum stack size for Java threads) : 256k 
> Total Java heap size: 1.25 GB
> Used Java heap size: 174.27 MB
> Free Java heap size: 1.08 GB



--
This message was sent by Atlassian JIRA
(v6.4.11#64026)


More information about the jboss-jira mailing list