[jbosscache-dev] Re: [Fwd: New case comment notification. Case Number 00017543]

Thu Aug 9 11:59:53 EDT 2007

Jimmy Wilson wrote:
> Good to know.  I hadn't thought of this b/c most everything I've ever 
> done was with Async.
> 
> JBC should throw an InvalidStateException when customers try to do this 
> (and are using Sync).  You think?  If you can screw up the config, we 
> should tell them authoritatively that it's screwed up during startup 
> (aka fail fast).

That's a good idea:

If SyncReplTimeout <= LockAcquisitionTimeout && SYNC (INVL or REPL)
throw IllegalStateException

Manik & the rest, what do you think? JIRA?

Jimmy, maybe you could implement this if Manik & the rest are happy with 
it?

> 
> Jimmy
> 
> -------- Original Message --------
> Subject: New case comment notification.  Case Number 00017543
> Date: Wed, 8 Aug 2007 14:48:02 +0000 (GMT)
> From: Galder Zamarreno <gzamarreno at jboss.com>
> To: jboss-support-cache at redhat.com <jboss-support-cache at redhat.com>
> 
> Galder Zamarreno has added a comment to case 00017543 : "JBoss cache 
> Write Lock Timeout".  Please read the comment below and then click on 
> the link to respond appropriately.
> 
> Comment:
> Comments on the information provided so far:
> 
> 1.- your SyncReplTimeout and LockAcquisitionTimeout are not set correctly:
> 
> The values of these two properties have to be set accordingly. 
> SyncReplTimeout
> should always be bigger than LockAcquisitionTimeout
> 
> If the values are the same or SyncReplTimeout is lower:
> 
> Node A sends out a replication.
> Node B blocks x seconds waiting for a lock.
> Node A times out on the replication and throws and exception.
> Node B times out on the lock and throws and exception.
> 
> If the SyncReplTimeout is longer, you get:
> 
> Node A sends out a replication.
> Node B blocks x seconds waiting for a lock.
> Node B times out on the lock and throws an TimeoutException.
> Exception propagates back to Node A and is rethrown.
> 
> Either way both nodes get a TimeoutException, but with the latter the 
> exception is more
>  meaningful. With the former we don't know why Node A threw a 
> TimeoutException --
> maybe the underlying cause was a cluster communication problem rather 
> than a lock
> conflict.
> 
> 2.- How many nodes are in the cluster?
> 
> 3.- What is the name of the node for which you provided the logs?is it 
> srv-node1?
> 
> 4.- Could you provide server.log files for other nodes if there're others?
> 
> 5.- If this is occurring in production, can you reproduce this issue 
> outside of production,
> i.e. in staging/pre-production environment?
> 
> https://na1.salesforce.com/50030000003Pqkq

-- 
Galder Zamarreño
Sr. Software Maintenance Engineer
JBoss, a division of Red Hat