[JBoss JIRA] (JGRP-2266) RouterStubManager.run() endless reconnect loop burning a CPU

Friday, 27 April 2018

Emmeran Seehuber created JGRP-2266:
--------------------------------------

             Summary: RouterStubManager.run() endless reconnect loop burning a CPU
                 Key: JGRP-2266
                 URL: https://issues.jboss.org/browse/JGRP-2266
             Project: JGroups
          Issue Type: Bug
    Affects Versions: 4.0.11
         Environment: Small cluster (~ 20 nodes), some nodes are connected through OpenVPN
tunnels. MacOS and Linux nodes.
            Reporter: Emmeran Seehuber
            Assignee: Bela Ban
         Attachments: cs_stack.xml

RouterStubManager.run() tries in a loop to reconnect all stubs currently not connected.
When for whatever reason it is not possible to connect one of this stubs, the method spins
in a endless loop and burns a CPU. 

E.g. sometimes the VPN tunnel is down or one of the TCPGOSSIP hosts is down.

No idea if it is really required to loop here, but at least it should do some some
Thread.yield() or or sleep() here. As this run() method is called periodically it should
not be required to do a endless loop here, should it? Maybe only loop e.g. three times and
then give up?

As the all nodes in the cluster are iMac workstations or special render Linux slaves,
burning a CPU is very annoying. The CPU should rather be spend on the Blender render jobs
or for the interactive work the people are doing on their iMacs. (JGroups is used here to
distribute render jobs within the cluster)

--
This message was sent by Atlassian JIRA
(v7.5.0#75005)

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006