jbosscache-dev November 2008

jbosscache-dev@lists.jboss.org

4 participants
6 discussions

by Manik Surtani

FYI, I have just tagged 3.0.1.GA which contains some bug fixes and a performance tweak pertaining to passivation. * [ JBCACHE-1444 ] ObjectName's validation fails for Jbosscache 3.0 on WAS 6.1 due to ":" char in name. * [ JBCACHE-1445 ] Data gravitation cleanup does not happen when using single-phase commits. * [ JBCACHE-1448 ] Jdbm and BDBJE cache loader incorrectly reading database name from location String * [ JBCACHE-1446 ] Optimize activations by minimizing … [View More]

16 years, 4 months

2
1
0 / 0

Optimize use of CacheLoader.exists() with passivation

by Brian Stansberry

I've been profiling JBoss AS web session replication, and one of the significant hits I'm seeing is from calls to File.exists() from FileCacheLoader (see attached). In turn, those calls are due calls from LegacyActivationInterceptor, particularly the removeNodeFromCacheLoader() method, which is called with every invocation.[1] Many of these calls are on the most critical path[2] so speeding them can have large implications for overall cluster performance. I'm wondering if we can be … [View More]

16 years, 4 months

2
6
0 / 0

Re: [jbosscache-dev] Cache unable to write to cluster

by Manik Surtani

On 12 Nov 2008, at 10:25, Vladimir Blagojevic wrote: > Manik Surtani wrote: >> >> Yes, this was always an issue with the way we used FLUSH - that >> someone in the group could initiate a FLUSH and then die leaving >> other members' flushBlockGates closed. TBH, apart from adding >> timeouts to the flushBlockGate, I can't see how we would get around >> this. > > Me too. I am confused how all these issues started to pop out now. > How … [View More]

16 years, 4 months

3
6
0 / 0

Re: [jbosscache-dev] Cache unable to write to cluster

by Bela Ban

Can't we unlock the blocked flushgates when a node leaves/crashes ? Vladimir Blagojevic wrote: > Manik Surtani wrote: >> >> Yes, this was always an issue with the way we used FLUSH - that >> someone in the group could initiate a FLUSH and then die leaving >> other members' flushBlockGates closed. TBH, apart from adding >> timeouts to the flushBlockGate, I can't see how we would get around >> this. > > Me too. I am confused how all these issues … [View More]

16 years, 4 months

1
0
0 / 0

Cache unable to write to cluster

by Brian Stansberry

We just found an intermittent failure in the EJB3 testsuite[1] that's more a JBC or JGroups issue. This is with JBC 3.0.0.CR4 and JG 2.6.6. I'm speculating it relates to FLUSH work Vladimir's been doing[2][3]. Issue is an inability to replicate a put: Caused by: org.jboss.cache.lock.TimeoutException: State retrieval timed out waiting for flush unblock. at org.jboss.cache.RPCManagerImpl.callRemoteMethods(RPCManagerImpl.java:455) at .... org.jboss.cache.invocation.CacheInvocationDelegate.put(… [View More]CacheInvocationDelegate.java:560) at org.jboss.ha.cachemanager.CacheManagerManagedCache.put(CacheManagerManagedCache.java:285) at org.jboss.ejb3.cache.tree.StatefulTreeCache.putInCache(StatefulTreeCache.java:511) at org.jboss.ejb3.cache.tree.StatefulTreeCache.create(StatefulTreeCache.java:123) ... 70 more Looking at RPCManagerImpl.java:455 we have: if (channel.flushSupported() && !flushBlockGate.await(configuration.getStateRetrievalTimeout(), TimeUnit.MILLISECONDS)) { throw new TimeoutException("State retrieval timed out waiting for flush unblock."); } Basically, failing on flushBlockGate.await(). Looking at use of flushBlockGate, the gate is closed in block() and opened in unblock(). *Assuming* no bug in ReclosableLatch, seems like block() is getting called here with no subsequent call to unblock(). (Unfortunately, logs related to this failure are gone, so I can't prove that.) Questions: 1) Vladimir, could the JGRP-855 issue result in block() getting called with no subsequent call to unblock(), either on the flush coordinator or on one of the other nodes? If yes, your JGRP-855 fix will probably fix this as well. 2) Looking at RPCManagerImpl.start(), it does a connect+state transfer in a try/catch where any failure should result in a CacheException being thrown from start(). That CacheException should have prevented deployment of the ejb; i.e. the call shown in the stack trace above shouldn't have happened. Only way I see it could have happened is if the node that threw above exception wasn't the flush coordinator; i.e. its cache started fine, but a problem on another node led to its block() being called with no matching unblock(). That's a big issue too, as it means a failure in one node can take down the entire cluster by leaving everyone's flushBlockGate closed. [1] https://jira.jboss.org/jira/browse/EJBTHREE-1580 [2] https://jira.jboss.org/jira/browse/JGRP-855 [3] http://www.jboss.com/index.html?module=bb&op=viewtopic&t=145138 -- Brian Stansberry Lead, AS Clustering JBoss, a division of Red Hat brian.stansberry(a)redhat.com [View Less]

16 years, 4 months

2
1
0 / 0

Tree structure or flat structure for JBossCache ?

by Bela Ban

There's been recent discussions about whether a tree structure (as in the current JBossCache) or a flat (hashmap based) structure would be preferred by developers. We're interested in getting feedback from the community. There is a poll and discussion forum at http://www.jboss.com/index.html?module=bb&op=viewtopic&t=145055, we're glad for feedback and comments. Cheers, -- Bela Ban Lead JGroups / Clustering Team JBoss - a division of Red Hat

16 years, 5 months

1
0
0 / 0

← Newer
1
Older →

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

jbosscache-dev November 2008