October 2014 - infinispan-issues - Jboss List Archives

[JBoss JIRA] (ISPN-4755) TransportConfigurationBuilder.read(...) assumes Transport instance has default constructor

by Dan Berindei (JIRA)

[ https://issues.jboss.org/browse/ISPN-4755?page=com.atlassian.jira.plugin.... ] Dan Berindei commented on ISPN-4755: ------------------------------------ I wouldn't call it a bug, since we still have this disclaimer in the {{TransportConfigurationBuilder.transport(Transport)}} javadoc: {noformat} * NOTE: Currently Infinispan will not use the object instance, but instead instantiate a new * instance of the class. Therefore, do not expect any state to survive, and provide a no-args * constructor to any instance. This will be resolved in Infinispan 5.2.0 {noformat} TBH I would rather allow the user to inject a Channel than a Transport instance. Too bad the {{channelLookup}} property only allows you to inject a class as well. > TransportConfigurationBuilder.read(...) assumes Transport instance has default constructor > ------------------------------------------------------------------------------------------ > > Key: ISPN-4755 > URL: https://issues.jboss.org/browse/ISPN-4755 > Project: Infinispan > Issue Type: Bug > Components: Configuration > Affects Versions: 7.0.0.Beta2 > Reporter: Paul Ferraro > > While upgrading WildFly to Infinispan 7.0, I encountered an InstantiationException in a call to GlobalConfigurationBuilder.read(...). The culprit is the following commit: > https://github.com/infinispan/infinispan/commit/b6190774725ed38f67ee7787a... > Specifically, during TransportConfigurationBuilder.read(...) a new Transport instance is constructed using the default constructor of the transport class. However, the Transport object might not have a default constructor. Such is the case with WildFly's ChannelTransport. > See: https://github.com/wildfly/wildfly/blob/master/clustering/infinispan/src/... -- This message was sent by Atlassian JIRA (v6.3.1#6329)

11 years, 8 months

1
0
0 / 0

[JBoss JIRA] (ISPN-4755) TransportConfigurationBuilder.read(...) assumes Transport instance has default constructor

by Dan Berindei (JIRA)

[ https://issues.jboss.org/browse/ISPN-4755?page=com.atlassian.jira.plugin.... ] Dan Berindei updated ISPN-4755: ------------------------------- Issue Type: Enhancement (was: Bug) > TransportConfigurationBuilder.read(...) assumes Transport instance has default constructor > ------------------------------------------------------------------------------------------ > > Key: ISPN-4755 > URL: https://issues.jboss.org/browse/ISPN-4755 > Project: Infinispan > Issue Type: Enhancement > Components: Configuration > Affects Versions: 7.0.0.Beta2 > Reporter: Paul Ferraro > > While upgrading WildFly to Infinispan 7.0, I encountered an InstantiationException in a call to GlobalConfigurationBuilder.read(...). The culprit is the following commit: > https://github.com/infinispan/infinispan/commit/b6190774725ed38f67ee7787a... > Specifically, during TransportConfigurationBuilder.read(...) a new Transport instance is constructed using the default constructor of the transport class. However, the Transport object might not have a default constructor. Such is the case with WildFly's ChannelTransport. > See: https://github.com/wildfly/wildfly/blob/master/clustering/infinispan/src/... -- This message was sent by Atlassian JIRA (v6.3.1#6329)

11 years, 8 months

1
0
0 / 0

[JBoss JIRA] (ISPN-4831) Node cannot join cluster correctly after restart

by Dan Berindei (JIRA)

[ https://issues.jboss.org/browse/ISPN-4831?page=com.atlassian.jira.plugin.... ] Dan Berindei commented on ISPN-4831: ------------------------------------ Marta, could you try with 6.0.2 (or even better, 7.0.0.CR2)? 5.2.6 is quite old, and we support only the latest released version (soon to be 7.0.0.Final). > Node cannot join cluster correctly after restart > ------------------------------------------------ > > Key: ISPN-4831 > URL: https://issues.jboss.org/browse/ISPN-4831 > Project: Infinispan > Issue Type: Bug > Components: State Transfer > Affects Versions: 5.2.6.Final > Environment: 2 node cluster with following confiiguration: > <?xml version="1.0" encoding="UTF-8"?> > <infinispan > xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" > xsi:schemaLocation="urn:infinispan:config:5.0 http://www.infinispan.org/schemas/infinispan-config-5.0.xsd" > xmlns="urn:infinispan:config:5.0"> > <global> > <transport clusterName="cluster"> > <properties> > <property name="configurationFile" value="cluster-configuration.xml"/> > </properties> > </transport> > <globalJmxStatistics enabled="true" jmxDomain="distCache"/> > </global> > <default> > <locking isolationLevel="REPEATABLE_READ" > lockAcquisitionTimeout="30000" > writeSkewCheck="false" > concurrencyLevel="512" > useLockStriping="false"/> > <clustering mode="distribution"> > <sync replTimeout="120000"/> > <l1 enabled="true"/> > <hash numOwners="2"/> > </clustering> > <jmxStatistics enabled="true"/> > <invocationBatching enabled="true"/> > </default> > <namedCache name="eu.ysoft.safeq.core.cache.entity.CacheableJobInfo_distnamespace"> > </namedCache> > > <namedCache name="eu.ysoft.safeq.core.cache.entity.CacheableJobInfo_index"> > </namedCache> > > </infinispan> > Reporter: Marta Sedlakova > Attachments: configuration.txt, node1-configuration.xml, node1WithRestart.txt, node2-configuration.xml, node2WithoutRestart.log > > > Node is not able to join the cluster again after restart. > Node 1 was restarted, later it seems that the cluster was disconnected for a while and node1 is failed to become a coordinator. > Then the node1 was restarted again, but some state transfer timeouts and the cluster was not connected any more, until both nodes restart: > Node 1 (the one with restart at 3:05 and 3:07): > Sep 29, 2014 3:05:21 AM org.infinispan.remoting.transport.jgroups.JGroupsTransport viewAccepted > INFO: ISPN000094: Received new cluster view: [LOUAPPWPS984-56770|15] [LOUAPPWPS984-56770, LOUAPPWPS983-56765] > Sep 29, 2014 3:05:21 AM org.infinispan.remoting.transport.jgroups.JGroupsTransport startJGroupsChannelIfNeeded > INFO: ISPN000079: Cache local address is LOUAPPWPS983-56765, physical addresses are [133.27.18.204:7800] > Sep 29, 2014 3:05:21 AM org.infinispan.factories.GlobalComponentRegistry start > INFO: ISPN000128: Infinispan version: Infinispan 'Delirium' 5.2.6.Final > Sep 29, 2014 3:05:22 AM org.infinispan.factories.TransactionManagerFactory construct > INFO: ISPN000161: Using a batchMode transaction manager > Sep 29, 2014 3:05:22 AM org.infinispan.jmx.CacheJmxRegistration start > INFO: ISPN000031: MBeans were successfully registered to the platform MBean server. > Sep 29, 2014 3:05:23 AM org.infinispan.factories.TransactionManagerFactory construct > INFO: ISPN000161: Using a batchMode transaction manager > Sep 29, 2014 3:05:23 AM org.infinispan.jmx.CacheJmxRegistration start > INFO: ISPN000031: MBeans were successfully registered to the platform MBean server. > Sep 29, 2014 3:05:23 AM org.infinispan.factories.TransactionManagerFactory construct > INFO: ISPN000161: Using a batchMode transaction manager > Sep 29, 2014 3:05:23 AM org.infinispan.jmx.CacheJmxRegistration start > INFO: ISPN000031: MBeans were successfully registered to the platform MBean server. > Sep 29, 2014 3:06:18 AM org.infinispan.remoting.transport.jgroups.JGroupsTransport viewAccepted > INFO: ISPN000094: Received new cluster view: [LOUAPPWPS983-56765|16] [LOUAPPWPS983-56765] > Sep 29, 2014 3:06:38 AM org.infinispan.remoting.transport.jgroups.JGroupsTransport viewAccepted > INFO: ISPN000093: Received new, MERGED cluster view: MergeView::[LOUAPPWPS983-56765|17] [LOUAPPWPS983-56765, LOUAPPWPS984-56770], subgroups=[LOUAPPWPS984-56770|15] [LOUAPPWPS984-56770], [LOUAPPWPS983-56765|16] [LOUAPPWPS983-56765] > Sep 29, 2014 3:07:33 AM org.infinispan.remoting.transport.jgroups.JGroupsTransport stop > INFO: ISPN000080: Disconnecting and closing JGroups Channel > Sep 29, 2014 3:07:33 AM org.infinispan.remoting.transport.jgroups.JGroupsTransport stop > INFO: ISPN000082: Stopping the RpcDispatcher > Sep 29, 2014 3:08:45 AM org.infinispan.remoting.transport.jgroups.JGroupsTransport start > INFO: ISPN000078: Starting JGroups Channel > Sep 29, 2014 3:08:46 AM org.infinispan.remoting.transport.jgroups.JGroupsTransport viewAccepted > INFO: ISPN000094: Received new cluster view: [LOUAPPWPS984-56770|19] [LOUAPPWPS984-56770, LOUAPPWPS983-54866] > Sep 29, 2014 3:08:47 AM org.infinispan.remoting.transport.jgroups.JGroupsTransport startJGroupsChannelIfNeeded > INFO: ISPN000079: Cache local address is LOUAPPWPS983-54866, physical addresses are [133.27.18.204:7800] > Sep 29, 2014 3:08:47 AM org.infinispan.factories.GlobalComponentRegistry start > INFO: ISPN000128: Infinispan version: Infinispan 'Delirium' 5.2.6.Final > Sep 29, 2014 3:08:47 AM org.infinispan.factories.TransactionManagerFactory construct > INFO: ISPN000161: Using a batchMode transaction manager > Sep 29, 2014 3:08:47 AM org.infinispan.jmx.CacheJmxRegistration start > INFO: ISPN000031: MBeans were successfully registered to the platform MBean server. > Sep 29, 2014 3:12:47 AM org.apache.catalina.core.ApplicationContext log > SEVERE: StandardWrapper.Throwable > org.infinispan.CacheException: Unable to invoke method public void org.infinispan.statetransfer.StateTransferManagerImpl.waitForInitialStateTransferToComplete() throws java.lang.InterruptedException on object of type StateTransferManagerImpl > at org.infinispan.util.ReflectionUtil.invokeAccessibly(ReflectionUtil.java:205) > at org.infinispan.factories.AbstractComponentRegistry$PrioritizedMethod.invoke(AbstractComponentRegistry.java:886) > at org.infinispan.factories.AbstractComponentRegistry.invokeStartMethods(AbstractComponentRegistry.java:657) > at org.infinispan.factories.AbstractComponentRegistry.internalStart(AbstractComponentRegistry.java:646) > at org.infinispan.factories.AbstractComponentRegistry.start(AbstractComponentRegistry.java:549) > at org.infinispan.factories.ComponentRegistry.start(ComponentRegistry.java:217) > at org.infinispan.CacheImpl.start(CacheImpl.java:582) > at org.infinispan.manager.DefaultCacheManager.wireAndStartCache(DefaultCacheManager.java:686) > at org.infinispan.manager.DefaultCacheManager.createCache(DefaultCacheManager.java:649) > at org.infinispan.manager.DefaultCacheManager.getCache(DefaultCacheManager.java:545) > at org.infinispan.rest.StartupListener$$anonfun$init$1.apply(StartupListener.scala:66) > at org.infinispan.rest.StartupListener$$anonfun$init$1.apply(StartupListener.scala:65) > at scala.collection.Iterator$class.foreach(Iterator.scala:727) > at scala.collection.AbstractIterator.foreach(Iterator.scala:1156) > at org.infinispan.rest.StartupListener.init(StartupListener.scala:65) > at org.apache.catalina.core.StandardWrapper.initServlet(StandardWrapper.java:1280) > at org.apache.catalina.core.StandardWrapper.loadServlet(StandardWrapper.java:1193) > at org.apache.catalina.core.StandardWrapper.load(StandardWrapper.java:1088) > at org.apache.catalina.core.StandardContext.loadOnStartup(StandardContext.java:5176) > at org.apache.catalina.core.StandardContext.startInternal(StandardContext.java:5460) > at org.apache.catalina.util.LifecycleBase.start(LifecycleBase.java:150) > at org.apache.catalina.core.ContainerBase.addChildInternal(ContainerBase.java:901) > at org.apache.catalina.core.ContainerBase.addChild(ContainerBase.java:877) > at org.apache.catalina.core.StandardHost.addChild(StandardHost.java:633) > at org.apache.catalina.startup.HostConfig.deployDirectory(HostConfig.java:1120) > at org.apache.catalina.startup.HostConfig$DeployDirectory.run(HostConfig.java:1678) > at java.util.concurrent.Executors$RunnableAdapter.call(Unknown Source) > at java.util.concurrent.FutureTask.run(Unknown Source) > at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) > at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) > at java.lang.Thread.run(Unknown Source) > Caused by: org.infinispan.CacheException: Initial state transfer timed out for cache eu.ysoft.safeq.core.cache.entity.CacheableJobInfo_index on LOUAPPWPS983-54866 > at org.infinispan.statetransfer.StateTransferManagerImpl.waitForInitialStateTransferToComplete(StateTransferManagerImpl.java:216) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source) > at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) > at java.lang.reflect.Method.invoke(Unknown Source) > at org.infinispan.util.ReflectionUtil.invokeAccessibly(ReflectionUtil.java:203) > ... 30 more > Node 2 (without restart): > Sep 29, 2014 3:03:54 AM org.infinispan.remoting.transport.jgroups.JGroupsTransport viewAccepted > INFO: ISPN000094: Received new cluster view: [LOUAPPWPS984-56770|14] [LOUAPPWPS984-56770] > Sep 29, 2014 3:05:21 AM org.infinispan.remoting.transport.jgroups.JGroupsTransport viewAccepted > INFO: ISPN000094: Received new cluster view: [LOUAPPWPS984-56770|15] [LOUAPPWPS984-56770, LOUAPPWPS983-56765] > Sep 29, 2014 3:06:38 AM org.infinispan.remoting.transport.jgroups.JGroupsTransport viewAccepted > INFO: ISPN000093: Received new, MERGED cluster view: MergeView::[LOUAPPWPS983-56765|17] [LOUAPPWPS983-56765, LOUAPPWPS984-56770], subgroups=[LOUAPPWPS984-56770|15] [LOUAPPWPS984-56770], [LOUAPPWPS983-56765|16] [LOUAPPWPS983-56765] > Sep 29, 2014 3:07:33 AM org.infinispan.remoting.transport.jgroups.JGroupsTransport viewAccepted > INFO: ISPN000094: Received new cluster view: [LOUAPPWPS984-56770|18] [LOUAPPWPS984-56770] > Sep 29, 2014 3:07:33 AM org.infinispan.topology.ClusterTopologyManagerImpl handleNewView > ERROR: ISPN000196: Failed to recover cluster state after the current node became the coordinator > java.lang.IllegalStateException: Trying to set a topology with invalid members for cache eu.ysoft.safeq.core.cache.entity.CacheableJobInfo_index: members = [LOUAPPWPS984-56770], topology = CacheTopology{id=37, currentCH=DefaultConsistentHash{numSegments=60, numOwners=1, members=[LOUAPPWPS984-56770, LOUAPPWPS983-56765]}, pendingCH=null} > at org.infinispan.topology.ClusterCacheStatus.updateCacheTopology(ClusterCacheStatus.java:165) > at org.infinispan.topology.ClusterTopologyManagerImpl.updateCacheStatusAfterMerge(ClusterTopologyManagerImpl.java:315) > at org.infinispan.topology.ClusterTopologyManagerImpl.handleNewView(ClusterTopologyManagerImpl.java:236) > at org.infinispan.topology.ClusterTopologyManagerImpl$ClusterViewListener.handleViewChange(ClusterTopologyManagerImpl.java:579) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source) > at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) > at java.lang.reflect.Method.invoke(Unknown Source) > at org.infinispan.notifications.AbstractListenerImpl$ListenerInvocation$1.run(AbstractListenerImpl.java:212) > at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) > at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) > at java.lang.Thread.run(Unknown Source) > Sep 29, 2014 3:08:46 AM org.infinispan.remoting.transport.jgroups.JGroupsTransport viewAccepted > INFO: ISPN000094: Received new cluster view: [LOUAPPWPS984-56770|19] [LOUAPPWPS984-56770, LOUAPPWPS983-54866] > Sep 29, 2014 3:12:48 AM org.infinispan.statetransfer.StateProviderImpl getCacheTopology > WARN: ISPN000211: Transactions were requested by node LOUAPPWPS983-54866 with topology 30, older than the local topology (35) > Sep 29, 2014 3:12:48 AM org.infinispan.statetransfer.StateProviderImpl getCacheTopology > WARN: ISPN000212: Segments were requested by node LOUAPPWPS983-54866 with topology 30, older than the local topology (35) > Sep 29, 2014 12:42:27 PM org.infinispan.transaction.TransactionTable shutDownGracefully > WARN: ISPN000100: Stopping, but there are 0 local transactions and 16 remote transactions that did not finish in time. > Sep 29, 2014 12:42:27 PM org.infinispan.remoting.transport.jgroups.JGroupsTransport stop > INFO: ISPN000080: Disconnecting and closing JGroups Channel > Sep 29, 2014 12:42:29 PM org.jgroups.logging.JDKLogImpl warn > WARNING: LOUAPPWPS984-56770: failed to collect all ACKs (expected=1) for view [LOUAPPWPS983-54866|20] after 2000ms, missing ACKs from [LOUAPPWPS983-54866] > Sep 29, 2014 12:42:30 PM org.jgroups.logging.JDKLogImpl error -- This message was sent by Atlassian JIRA (v6.3.1#6329)

11 years, 8 months

1
0
0 / 0

[JBoss JIRA] (ISPN-4873) Statetransfer thread pool deadlock

by Dan Berindei (JIRA)

[ https://issues.jboss.org/browse/ISPN-4873?page=com.atlassian.jira.plugin.... ] Dan Berindei updated ISPN-4873: ------------------------------- Status: Open (was: New) > Statetransfer thread pool deadlock > ---------------------------------- > > Key: ISPN-4873 > URL: https://issues.jboss.org/browse/ISPN-4873 > Project: Infinispan > Issue Type: Bug > Components: Core > Affects Versions: 6.0.2.Final > Reporter: Takayoshi Kimura > > During massive state transfer with 300 nodes and 3000 caches, the OOB and/or infinispan thread pool gets deadlock, similar to ISPN-2808. > The thread pool is configured with max 1400 threads now and increasing them is not a realistic workaround as the user is planning to add more caches. -- This message was sent by Atlassian JIRA (v6.3.1#6329)

11 years, 8 months

1
0
0 / 0

[JBoss JIRA] (ISPN-4872) During state transfer, a pessimistic transaction can fail due to a ClusteredGetCommand forcing a lock command

by Dan Berindei (JIRA)

[ https://issues.jboss.org/browse/ISPN-4872?page=com.atlassian.jira.plugin.... ] Dan Berindei updated ISPN-4872: ------------------------------- Status: Open (was: New) > During state transfer, a pessimistic transaction can fail due to a ClusteredGetCommand forcing a lock command > ------------------------------------------------------------------------------------------------------------- > > Key: ISPN-4872 > URL: https://issues.jboss.org/browse/ISPN-4872 > Project: Infinispan > Issue Type: Bug > Components: Core > Affects Versions: 5.2.7.Final, 6.0.2.Final, 7.0.0.CR2 > Reporter: Erik Salter > > This is a strange one: > A pessimistic transaction started on the primary owner during state transfer can fail because a backup owner issues a ClusteredGetCommand to the primary while processing the write command. This can happen if the backup needs to perform a remote get (due to a DELTA_WRITE, conditional, or reliable return values). In this case, it's a DELTA_WRITE. > In this chain of events, state transfer is ongoing, and the union CH is (east-dht5, west-dht5, east-dht6) In this case, the pessimistic transaction originated on east-dht5 must be invoked on west-dht5 and east-dht6: > {code} > 2014-10-21 02:15:01,309 TRACE [org.infinispan.transaction.TransactionTable] (OOB-1713,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) Created a new local transaction: LocalXaTransaction{xid=null} LocalTransaction{remoteLockedNodes=null, isMarkedForRollback=false, lockedKeys=null, backupKeyLocks=null, isFromStateTransfer=false} globalTx=GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:local, topologyId=33, age(ms)=0 > 2014-10-21 02:15:01,309 TRACE [org.infinispan.util.concurrent.locks.containers.OwnableReentrantPerEntryLockContainer] (OOB-1713,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) Creating and acquiring new lock instance for key EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831] > 2014-10-21 02:15:01,345 TRACE [org.infinispan.remoting.transport.jgroups.CommandAwareRpcDispatcher] (OOB-1713,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) Replication task sending PrepareCommand {modifications=[PutKeyValueCommand{key=EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831], value=QamResource [qamId=431831, currentBandwidth=3750000, maxBandwidth=38810000, maxPrograms=255, currentPrograms=1], flags=[SKIP_REMOTE_LOOKUP, DELTA_WRITE], putIfAbsent=false, lifespanMillis=-1, maxIdleTimeMillis=-1, successful=true, ignorePreviousValue=false}], onePhaseCommit=true, gtx=GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:local, cacheName='qamAllocation', topologyId=33} to addresses [240-west-dht5.comcast.net-35898(CH2-Chicago-IL), 240-east-dht6.comcast.net-47049(CMC-Denver-CO)] with response mode GET_ALL > {code} > Let's examine east-dht6. It gets the write request. It needs to perform a remote get to pull the full key context, since it receives only the delta. > {code} > 2014-10-21 02:15:01,362 TRACE [org.infinispan.remoting.transport.jgroups.CommandAwareRpcDispatcher] (OOB-2149,session-resource-cluster,240-east-dht6.comcast.net-47049(CMC-Denver-CO)) Attempting to execute command: PrepareCommand {modifications=[PutKeyValueCommand{key=EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831], value=QamResourceDelta [qamId=431831, basePort=2049, currentBandwidth=3750000, maxBandwidth=38810000, maxPrograms=255, currentPrograms=1, baseProgramNumber=1, portStep=1, program=Program [portNumber=2053, programNumber=5, status=UNUSED, sessionToken=, lastUpdatedTime=1413872101310, bandwidth=0, inputId=-1]], flags=[SKIP_REMOTE_LOOKUP, DELTA_WRITE], putIfAbsent=false, lifespanMillis=-1, maxIdleTimeMillis=-1, successful=true, ignorePreviousValue=false}], onePhaseCommit=true, gtx=GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:local, cacheName='qamAllocation', topologyId=33} [sender=240-east-dht5.comcast.net-26106(CMC-Denver-CO)] > ... > 2014-10-21 02:15:01,362 TRACE [org.infinispan.transaction.TransactionTable] (OOB-2149,session-resource-cluster,240-east-dht6.comcast.net-47049(CMC-Denver-CO)) Created and registered remote transaction RemoteTransaction{modifications=... > globalTx=GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:remote, topologyId=33, age(ms)=0 > 2014-10-21 02:15:01,362 TRACE [org.infinispan.remoting.transport.jgroups.CommandAwareRpcDispatcher] (OOB-2149,session-resource-cluster,240-east-dht6.comcast.net-47049(CMC-Denver-CO)) Replication task sending ClusteredGetCommand{key=EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831], flags=[SKIP_REMOTE_LOOKUP, DELTA_WRITE]} to addresses [240-east-dht5.comcast.net-26106(CMC-Denver-CO), 240-west-dht5.comcast.net-35898(CH2-Chicago-IL)] with response mode GET_FIRST > {code} > east-dht5 receives the ClusteredGetCommand request. Because TxDistributionInterceptor::line 325 is true, the acquireRemoteLock flag is set to true. > {code} > boolean acquireRemoteLock = false; > if (ctx.isInTxScope()) { > TxInvocationContext txContext = (TxInvocationContext) ctx; > acquireRemoteLock = isWrite && isPessimisticCache && !txContext.getAffectedKeys().contains(key); > } > {code} > Thus, when east-dht5 runs the ClusteredGetCommand. it will create and invoke a LockControlCommand. Because the LCC is created inline, it will not have its origin set. But the LCC will create a RemoteTxInvocationContext. When the LCC runs through the interceptor chain, TxInterceptor::invokeNextInterceptorAndVerifyTransaction will check the originator (because of the remote context). When it doesn't find it, it will force a rollback. > {code} > 2014-10-21 02:15:01,347 TRACE [org.infinispan.remoting.InboundInvocationHandlerImpl] (OOB-1721,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) Calling perform() on ClusteredGetCommand{key=EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831], flags=[SKIP_REMOTE_LOOKUP, DELTA_WRITE]} > 2014-10-21 02:15:01,347 TRACE [org.infinispan.transaction.TransactionTable] (OOB-1721,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) Created and registered remote transaction RemoteTransaction{modifications=[], lookedUpEntries={}, lockedKeys=null, backupKeyLocks=null, missingLookedUpEntries=false, isMarkedForRollback=false} globalTx=GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:local, topologyId=33, age(ms)=0 > 2014-10-21 02:15:01,347 TRACE [org.infinispan.statetransfer.StateTransferInterceptor] (OOB-1721,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) handleTopologyAffectedCommand for command LockControlCommand{cache=qamAllocation, keys=[EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831]], flags=[SKIP_REMOTE_LOOKUP, DELTA_WRITE], unlock=false} > 2014-10-21 02:15:01,347 TRACE [org.infinispan.interceptors.locking.PessimisticLockingInterceptor] (OOB-1721,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) Locking key EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831], no need to check for pending locks. > 2014-10-21 02:15:01,347 TRACE [org.infinispan.interceptors.TxInterceptor] (OOB-1721,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) invokeNextInterceptorAndVerifyTransaction :: originatorMissing=true, alreadyCompleted=false > 2014-10-21 02:15:01,347 TRACE [org.infinispan.interceptors.TxInterceptor] (OOB-1721,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) Rolling back remote transaction GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:local because either already completed(false) or originator(null) no longer in the cluster(true). > > // ROLLBACK HAPPENS HERE > {code} > When the transaction attempts to commit, it fails due to the spurious rollback. For instance: > {code} > 2014-10-21 02:15:01,384 WARN [org.infinispan.remoting.rpc.RpcManagerImpl] (OOB-1713,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) ISPN000071: Caught exception when handling command PrepareCommand {modifications=[PutKeyValueCommand{key=EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831], value=QamResource [qamId=431831, currentBandwidth=3750000, maxBandwidth=38810000, maxPrograms=255, currentPrograms=1], flags=[SKIP_REMOTE_LOOKUP, DELTA_WRITE], putIfAbsent=false, lifespanMillis=-1, maxIdleTimeMillis=-1, successful=true, ignorePreviousValue=false}], onePhaseCommit=true, gtx=GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:local, cacheName='qamAllocation', topologyId=33} > org.infinispan.remoting.RemoteException: ISPN000217: Received exception from 240-west-dht5.comcast.net-35898(CH2-Chicago-IL), see cause for remote stack trace > ... > at org.infinispan.commands.tx.PrepareCommand.acceptVisitor(PrepareCommand.java:124) > at org.infinispan.interceptors.InterceptorChain.invoke(InterceptorChain.java:343) > at org.infinispan.transaction.TransactionCoordinator.commit(TransactionCoordinator.java:175) > at org.infinispan.transaction.xa.TransactionXaAdapter.commit(TransactionXaAdapter.java:122) > ... > com.arjuna.ats.internal.jta.resources.arjunacore.XAResourceRecord.topLevelOnePhaseCommit(XAResourceRecord.java:636) > at com.arjuna.ats.arjuna.coordinator.BasicAction.onePhaseCommit(BasicAction.java:2619) > at com.arjuna.ats.arjuna.coordinator.BasicAction.End(BasicAction.java:1779) > at com.arjuna.ats.arjuna.coordinator.TwoPhaseCoordinator.end(TwoPhaseCoordinator.java:88) > at com.arjuna.ats.arjuna.AtomicAction.commit(AtomicAction.java:177) > Caused by: org.infinispan.transaction.xa.InvalidTransactionException: This remote transaction GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:local is already rolled back > at org.infinispan.transaction.RemoteTransaction.checkIfRolledBack(RemoteTransaction.java:137) > at org.infinispan.transaction.RemoteTransaction.putLookedUpEntry(RemoteTransaction.java:73) > at org.infinispan.context.impl.RemoteTxInvocationContext.putLookedUpEntry(RemoteTxInvocationContext.java:99) > at org.infinispan.container.EntryFactoryImpl.wrapInternalCacheEntryForPut(EntryFactoryImpl.java:242) > at org.infinispan.container.EntryFactoryImpl.wrapEntryForPut(EntryFactoryImpl.java:166) > ... > {code} > So it looks like there's a couple ways to handle this. One would be to only acquire a remote lock in TxDistributionInterceptor if the current node was the originator. Another -- possibly less intrusive -- would be to set the origin of the invoked LCC command to that of the ClusteredGetCommand. The former should be preferable, but this code tends to be a bit labyrinthine with the various pessimistic use cases out there. -- This message was sent by Atlassian JIRA (v6.3.1#6329)

11 years, 8 months

1
0
0 / 0

[JBoss JIRA] (ISPN-4872) During state transfer, a pessimistic transaction can fail due to a ClusteredGetCommand forcing a lock command

by Dan Berindei (JIRA)

[ https://issues.jboss.org/browse/ISPN-4872?page=com.atlassian.jira.plugin.... ] Dan Berindei reassigned ISPN-4872: ---------------------------------- Assignee: Dan Berindei > During state transfer, a pessimistic transaction can fail due to a ClusteredGetCommand forcing a lock command > ------------------------------------------------------------------------------------------------------------- > > Key: ISPN-4872 > URL: https://issues.jboss.org/browse/ISPN-4872 > Project: Infinispan > Issue Type: Bug > Components: Core > Affects Versions: 5.2.7.Final, 6.0.2.Final, 7.0.0.CR2 > Reporter: Erik Salter > Assignee: Dan Berindei > > This is a strange one: > A pessimistic transaction started on the primary owner during state transfer can fail because a backup owner issues a ClusteredGetCommand to the primary while processing the write command. This can happen if the backup needs to perform a remote get (due to a DELTA_WRITE, conditional, or reliable return values). In this case, it's a DELTA_WRITE. > In this chain of events, state transfer is ongoing, and the union CH is (east-dht5, west-dht5, east-dht6) In this case, the pessimistic transaction originated on east-dht5 must be invoked on west-dht5 and east-dht6: > {code} > 2014-10-21 02:15:01,309 TRACE [org.infinispan.transaction.TransactionTable] (OOB-1713,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) Created a new local transaction: LocalXaTransaction{xid=null} LocalTransaction{remoteLockedNodes=null, isMarkedForRollback=false, lockedKeys=null, backupKeyLocks=null, isFromStateTransfer=false} globalTx=GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:local, topologyId=33, age(ms)=0 > 2014-10-21 02:15:01,309 TRACE [org.infinispan.util.concurrent.locks.containers.OwnableReentrantPerEntryLockContainer] (OOB-1713,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) Creating and acquiring new lock instance for key EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831] > 2014-10-21 02:15:01,345 TRACE [org.infinispan.remoting.transport.jgroups.CommandAwareRpcDispatcher] (OOB-1713,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) Replication task sending PrepareCommand {modifications=[PutKeyValueCommand{key=EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831], value=QamResource [qamId=431831, currentBandwidth=3750000, maxBandwidth=38810000, maxPrograms=255, currentPrograms=1], flags=[SKIP_REMOTE_LOOKUP, DELTA_WRITE], putIfAbsent=false, lifespanMillis=-1, maxIdleTimeMillis=-1, successful=true, ignorePreviousValue=false}], onePhaseCommit=true, gtx=GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:local, cacheName='qamAllocation', topologyId=33} to addresses [240-west-dht5.comcast.net-35898(CH2-Chicago-IL), 240-east-dht6.comcast.net-47049(CMC-Denver-CO)] with response mode GET_ALL > {code} > Let's examine east-dht6. It gets the write request. It needs to perform a remote get to pull the full key context, since it receives only the delta. > {code} > 2014-10-21 02:15:01,362 TRACE [org.infinispan.remoting.transport.jgroups.CommandAwareRpcDispatcher] (OOB-2149,session-resource-cluster,240-east-dht6.comcast.net-47049(CMC-Denver-CO)) Attempting to execute command: PrepareCommand {modifications=[PutKeyValueCommand{key=EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831], value=QamResourceDelta [qamId=431831, basePort=2049, currentBandwidth=3750000, maxBandwidth=38810000, maxPrograms=255, currentPrograms=1, baseProgramNumber=1, portStep=1, program=Program [portNumber=2053, programNumber=5, status=UNUSED, sessionToken=, lastUpdatedTime=1413872101310, bandwidth=0, inputId=-1]], flags=[SKIP_REMOTE_LOOKUP, DELTA_WRITE], putIfAbsent=false, lifespanMillis=-1, maxIdleTimeMillis=-1, successful=true, ignorePreviousValue=false}], onePhaseCommit=true, gtx=GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:local, cacheName='qamAllocation', topologyId=33} [sender=240-east-dht5.comcast.net-26106(CMC-Denver-CO)] > ... > 2014-10-21 02:15:01,362 TRACE [org.infinispan.transaction.TransactionTable] (OOB-2149,session-resource-cluster,240-east-dht6.comcast.net-47049(CMC-Denver-CO)) Created and registered remote transaction RemoteTransaction{modifications=... > globalTx=GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:remote, topologyId=33, age(ms)=0 > 2014-10-21 02:15:01,362 TRACE [org.infinispan.remoting.transport.jgroups.CommandAwareRpcDispatcher] (OOB-2149,session-resource-cluster,240-east-dht6.comcast.net-47049(CMC-Denver-CO)) Replication task sending ClusteredGetCommand{key=EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831], flags=[SKIP_REMOTE_LOOKUP, DELTA_WRITE]} to addresses [240-east-dht5.comcast.net-26106(CMC-Denver-CO), 240-west-dht5.comcast.net-35898(CH2-Chicago-IL)] with response mode GET_FIRST > {code} > east-dht5 receives the ClusteredGetCommand request. Because TxDistributionInterceptor::line 325 is true, the acquireRemoteLock flag is set to true. > {code} > boolean acquireRemoteLock = false; > if (ctx.isInTxScope()) { > TxInvocationContext txContext = (TxInvocationContext) ctx; > acquireRemoteLock = isWrite && isPessimisticCache && !txContext.getAffectedKeys().contains(key); > } > {code} > Thus, when east-dht5 runs the ClusteredGetCommand. it will create and invoke a LockControlCommand. Because the LCC is created inline, it will not have its origin set. But the LCC will create a RemoteTxInvocationContext. When the LCC runs through the interceptor chain, TxInterceptor::invokeNextInterceptorAndVerifyTransaction will check the originator (because of the remote context). When it doesn't find it, it will force a rollback. > {code} > 2014-10-21 02:15:01,347 TRACE [org.infinispan.remoting.InboundInvocationHandlerImpl] (OOB-1721,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) Calling perform() on ClusteredGetCommand{key=EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831], flags=[SKIP_REMOTE_LOOKUP, DELTA_WRITE]} > 2014-10-21 02:15:01,347 TRACE [org.infinispan.transaction.TransactionTable] (OOB-1721,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) Created and registered remote transaction RemoteTransaction{modifications=[], lookedUpEntries={}, lockedKeys=null, backupKeyLocks=null, missingLookedUpEntries=false, isMarkedForRollback=false} globalTx=GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:local, topologyId=33, age(ms)=0 > 2014-10-21 02:15:01,347 TRACE [org.infinispan.statetransfer.StateTransferInterceptor] (OOB-1721,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) handleTopologyAffectedCommand for command LockControlCommand{cache=qamAllocation, keys=[EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831]], flags=[SKIP_REMOTE_LOOKUP, DELTA_WRITE], unlock=false} > 2014-10-21 02:15:01,347 TRACE [org.infinispan.interceptors.locking.PessimisticLockingInterceptor] (OOB-1721,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) Locking key EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831], no need to check for pending locks. > 2014-10-21 02:15:01,347 TRACE [org.infinispan.interceptors.TxInterceptor] (OOB-1721,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) invokeNextInterceptorAndVerifyTransaction :: originatorMissing=true, alreadyCompleted=false > 2014-10-21 02:15:01,347 TRACE [org.infinispan.interceptors.TxInterceptor] (OOB-1721,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) Rolling back remote transaction GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:local because either already completed(false) or originator(null) no longer in the cluster(true). > > // ROLLBACK HAPPENS HERE > {code} > When the transaction attempts to commit, it fails due to the spurious rollback. For instance: > {code} > 2014-10-21 02:15:01,384 WARN [org.infinispan.remoting.rpc.RpcManagerImpl] (OOB-1713,session-resource-cluster,240-east-dht5.comcast.net-26106(CMC-Denver-CO)) ISPN000071: Caught exception when handling command PrepareCommand {modifications=[PutKeyValueCommand{key=EdgeResourceCacheKey[edgeDeviceId=4211,resourceId=431831], value=QamResource [qamId=431831, currentBandwidth=3750000, maxBandwidth=38810000, maxPrograms=255, currentPrograms=1], flags=[SKIP_REMOTE_LOOKUP, DELTA_WRITE], putIfAbsent=false, lifespanMillis=-1, maxIdleTimeMillis=-1, successful=true, ignorePreviousValue=false}], onePhaseCommit=true, gtx=GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:local, cacheName='qamAllocation', topologyId=33} > org.infinispan.remoting.RemoteException: ISPN000217: Received exception from 240-west-dht5.comcast.net-35898(CH2-Chicago-IL), see cause for remote stack trace > ... > at org.infinispan.commands.tx.PrepareCommand.acceptVisitor(PrepareCommand.java:124) > at org.infinispan.interceptors.InterceptorChain.invoke(InterceptorChain.java:343) > at org.infinispan.transaction.TransactionCoordinator.commit(TransactionCoordinator.java:175) > at org.infinispan.transaction.xa.TransactionXaAdapter.commit(TransactionXaAdapter.java:122) > ... > com.arjuna.ats.internal.jta.resources.arjunacore.XAResourceRecord.topLevelOnePhaseCommit(XAResourceRecord.java:636) > at com.arjuna.ats.arjuna.coordinator.BasicAction.onePhaseCommit(BasicAction.java:2619) > at com.arjuna.ats.arjuna.coordinator.BasicAction.End(BasicAction.java:1779) > at com.arjuna.ats.arjuna.coordinator.TwoPhaseCoordinator.end(TwoPhaseCoordinator.java:88) > at com.arjuna.ats.arjuna.AtomicAction.commit(AtomicAction.java:177) > Caused by: org.infinispan.transaction.xa.InvalidTransactionException: This remote transaction GlobalTransaction:<240-east-dht5.comcast.net-26106(CMC-Denver-CO)>:1575124:local is already rolled back > at org.infinispan.transaction.RemoteTransaction.checkIfRolledBack(RemoteTransaction.java:137) > at org.infinispan.transaction.RemoteTransaction.putLookedUpEntry(RemoteTransaction.java:73) > at org.infinispan.context.impl.RemoteTxInvocationContext.putLookedUpEntry(RemoteTxInvocationContext.java:99) > at org.infinispan.container.EntryFactoryImpl.wrapInternalCacheEntryForPut(EntryFactoryImpl.java:242) > at org.infinispan.container.EntryFactoryImpl.wrapEntryForPut(EntryFactoryImpl.java:166) > ... > {code} > So it looks like there's a couple ways to handle this. One would be to only acquire a remote lock in TxDistributionInterceptor if the current node was the originator. Another -- possibly less intrusive -- would be to set the origin of the invoked LCC command to that of the ClusteredGetCommand. The former should be preferable, but this code tends to be a bit labyrinthine with the various pessimistic use cases out there. -- This message was sent by Atlassian JIRA (v6.3.1#6329)

11 years, 8 months

1
0
0 / 0

[JBoss JIRA] (ISPN-4776) The topology id for the merged cache topology is not always bigger than all the partition topology ids

by Dan Berindei (JIRA)

[ https://issues.jboss.org/browse/ISPN-4776?page=com.atlassian.jira.plugin.... ] Dan Berindei updated ISPN-4776: ------------------------------- Status: Pull Request Sent (was: Coding In Progress) Git Pull Request: https://github.com/infinispan/infinispan/pull/2908, https://github.com/infinispan/infinispan/pull/2964, https://github.com/infinispan/infinispan/pull/2977 (was: https://github.com/infinispan/infinispan/pull/2908, https://github.com/infinispan/infinispan/pull/2964) > The topology id for the merged cache topology is not always bigger than all the partition topology ids > ------------------------------------------------------------------------------------------------------ > > Key: ISPN-4776 > URL: https://issues.jboss.org/browse/ISPN-4776 > Project: Infinispan > Issue Type: Bug > Components: Core > Affects Versions: 7.0.0.Beta2 > Reporter: Dan Berindei > Assignee: Dan Berindei > Priority: Blocker > Labels: testsuite_stability > Fix For: 7.0.0.CR2 > > > With the ISPN-4574 fix, I changed the merge algorithm to pick the partition with the most members (both in the _stable_ topology and in the _current_ topology) instead of the partition with the highest topology id. > However, the biggest topology is not necessarily the partition with the highest topology id, so it's possible that some nodes will ignore the merged topology because they already have a higher topology installed. This happened once in ClusterTopologyManagerTest.testClusterRecoveryAfterThreeWaySplit: > {noformat} > 00:24:59,286 DEBUG (transport-thread-NodeL-p33097-t6:) [ClusterCacheStatus] Recovered 3 partition(s) for cache cache: [CacheTopology{id=8, rebalanceId=3, currentCH=DefaultConsistentHash{ns = 60, owners = (1)[, NodeL-25322: 60+0]}, pendingCH=null, unionCH=null}, CacheTopology{id=6, rebalanceId=3, currentCH=DefaultConsistentHash{ns = 60, owners = (2)[, NodeL-25322: 30+10, NodeN-6727: 30+10]}, pendingCH=DefaultConsistentHash{ns = 60, owners = (2)[, NodeL-25322: 30+30, NodeN-6727: 30+30]}, unionCH=null}, CacheTopology{id=5, rebalanceId=2, currentCH=DefaultConsistentHash{ns = 60, owners = (1)[, NodeM-12972: 60+0]}, pendingCH=null, unionCH=null}] > 00:24:59,287 DEBUG (transport-thread-NodeL-p33097-t6:) [ClusterCacheStatus] Updating topologies after merge for cache cache, current topology = CacheTopology{id=5, rebalanceId=2, currentCH=DefaultConsistentHash{ns = 60, owners = (1)[, NodeM-12972: 60+0]}, pendingCH=null, unionCH=null}, stable topology = CacheTopology{id=4, rebalanceId=2, currentCH=DefaultConsistentHash{ns = 60, owners = (3)[, NodeL-25322: 20+20, NodeM-12972: 20+20, NodeN-6727: 20+20]}, pendingCH=null, unionCH=null}, availability mode = null > 00:24:59,287 DEBUG (transport-thread-NodeL-p33097-t6:) [ClusterTopologyManagerImpl] Updating cluster-wide current topology for cache cache, topology = CacheTopology{id=5, rebalanceId=2, currentCH=DefaultConsistentHash{ns = 60, owners = (1)[, NodeM-12972: 60+0]}, pendingCH=null, unionCH=null}, availability mode = null > 00:24:59,288 TRACE (transport-thread-NodeL-p33097-t3:) [LocalTopologyManagerImpl] Ignoring consistent hash update for cache cache, current topology is 8: CacheTopology{id=5, rebalanceId=2, currentCH=DefaultConsistentHash{ns = 60, owners = (1)[, NodeM-12972: 60+0]}, pendingCH=null, unionCH=null} > {noformat} > Failure logs here: http://ci.infinispan.org/viewLog.html?buildId=12364&buildTypeId=Infinispa... -- This message was sent by Atlassian JIRA (v6.3.1#6329)

11 years, 8 months

1
0
0 / 0

[JBoss JIRA] (ISPN-4866) HotRotRemoteCacheIT.testPutAsync random failures

by Vojtech Juranek (JIRA)

[ https://issues.jboss.org/browse/ISPN-4866?page=com.atlassian.jira.plugin.... ] Vojtech Juranek closed ISPN-4866. --------------------------------- Resolution: Duplicate Issue Duplicates ISPN-4813, fails regularly and IMHO it's a bug > HotRotRemoteCacheIT.testPutAsync random failures > ------------------------------------------------ > > Key: ISPN-4866 > URL: https://issues.jboss.org/browse/ISPN-4866 > Project: Infinispan > Issue Type: Bug > Components: Server, Test Suite - Server > Affects Versions: 7.0.0.CR1 > Reporter: Dan Berindei > Priority: Blocker > Labels: testsuite_stability > Fix For: 7.0.0.Final > > > {noformat} > java.lang.AssertionError: expected:<100> but was:<103> > at org.junit.Assert.fail(Assert.java:88) > at org.junit.Assert.failNotEquals(Assert.java:743) > at org.junit.Assert.assertEquals(Assert.java:118) > at org.junit.Assert.assertEquals(Assert.java:555) > at org.junit.Assert.assertEquals(Assert.java:542) > at org.infinispan.server.test.client.hotrod.AbstractRemoteCacheIT.testPutAsync(AbstractRemoteCacheIT.java:574) > {noformat} > The test has been failing in CI almost constantly since October 2nd: > http://ci.infinispan.org/project.html?projectId=Infinispan&testNameId=-32... -- This message was sent by Atlassian JIRA (v6.3.1#6329)

11 years, 8 months

1
0
0 / 0

[JBoss JIRA] (ISPN-4873) Statetransfer thread pool deadlock

by Takayoshi Kimura (JIRA)

[ https://issues.jboss.org/browse/ISPN-4873?page=com.atlassian.jira.plugin.... ] Takayoshi Kimura updated ISPN-4873: ----------------------------------- Summary: Statetransfer thread pool deadlock (was: Statetransfer thread deadlock) > Statetransfer thread pool deadlock > ---------------------------------- > > Key: ISPN-4873 > URL: https://issues.jboss.org/browse/ISPN-4873 > Project: Infinispan > Issue Type: Bug > Components: Core > Affects Versions: 6.0.2.Final > Reporter: Takayoshi Kimura > > During massive state transfer with 300 nodes and 3000 caches, the OOB and/or infinispan thread pool gets deadlock, similar to ISPN-2808. > The thread pool is configured with max 1400 threads and increasing them is not a realistic workaround as the user is planning to add more caches. -- This message was sent by Atlassian JIRA (v6.3.1#6329)

11 years, 8 months

1
0
0 / 0

[JBoss JIRA] (ISPN-4873) Statetransfer thread pool deadlock

by Takayoshi Kimura (JIRA)

[ https://issues.jboss.org/browse/ISPN-4873?page=com.atlassian.jira.plugin.... ] Takayoshi Kimura updated ISPN-4873: ----------------------------------- Description: During massive state transfer with 300 nodes and 3000 caches, the OOB and/or infinispan thread pool gets deadlock, similar to ISPN-2808. The thread pool is configured with max 1400 threads now and increasing them is not a realistic workaround as the user is planning to add more caches. was: During massive state transfer with 300 nodes and 3000 caches, the OOB and/or infinispan thread pool gets deadlock, similar to ISPN-2808. The thread pool is configured with max 1400 threads and increasing them is not a realistic workaround as the user is planning to add more caches. > Statetransfer thread pool deadlock > ---------------------------------- > > Key: ISPN-4873 > URL: https://issues.jboss.org/browse/ISPN-4873 > Project: Infinispan > Issue Type: Bug > Components: Core > Affects Versions: 6.0.2.Final > Reporter: Takayoshi Kimura > > During massive state transfer with 300 nodes and 3000 caches, the OOB and/or infinispan thread pool gets deadlock, similar to ISPN-2808. > The thread pool is configured with max 1400 threads now and increasing them is not a realistic workaround as the user is planning to add more caches. -- This message was sent by Atlassian JIRA (v6.3.1#6329)

11 years, 8 months

1
0
0 / 0

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

infinispan-issues October 2014