[infinispan-issues] [JBoss JIRA] (ISPN-6183) Initial state transfer fails with unexpected timeout

Vladimir Dzhuvinov (JIRA) issues at jboss.org
Tue Feb 9 05:30:00 EST 2016


    [ https://issues.jboss.org/browse/ISPN-6183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13160120#comment-13160120 ] 

Vladimir Dzhuvinov commented on ISPN-6183:
------------------------------------------

Additional JGroups settings passed in as properties:

-Djava.net.preferIPv4Stack=true
-Djgroups.fd_sock.start_port=7900
-Djgroups.tcp.port=7800

> Initial state transfer fails with unexpected timeout
> ----------------------------------------------------
>
>                 Key: ISPN-6183
>                 URL: https://issues.jboss.org/browse/ISPN-6183
>             Project: Infinispan
>          Issue Type: Bug
>          Components: State Transfer
>    Affects Versions: 7.2.5.Final
>         Environment: Java 7 on AWS EC2
>            Reporter: Vladimir Dzhuvinov
>         Attachments: default-jgroups-s3ping.xml, state-transfer-timeout-stack-trace.txt
>
>
> Hi guys,
> I would like to report a somewhat odd issue with initial state transfer. It was observed in two instances - an Infinispan 7.2.5 cluster with 2 nodes and an Infinispan 7.2.5 cluster with 6 nodes. The two clusters had been running for 2 weeks, the smaller for dev purposes with very light load - about a dozen cached objects. Upon adding an extra node an initial state transfer exception was encountered with both clusters, after about 4 minutes which is the default timeout setting for such situations. Several attempts were made to add a new node, incl. one with increased timeout (10 mins), but state transfer would still not complete, and throw an exception:
> {code:java}
> "message": "Unable to invoke method public void org.infinispan.statetransfer.StateTransferManagerImpl.waitForInitialStateTransferToComplete() throws java.lang.Exception on object of type StateTransferManagerImpl",
>       "name": "org.infinispan.commons.CacheException",
>       "cause": {
>         "commonElementCount": 25,
>         "localizedMessage": "Initial state transfer timed out for cache authzStore.codeMap on ip-10-180-242-223-40643",
>         "message": "Initial state transfer timed out for cache authzStore.codeMap on ip-10-180-242-223-40643",
>         "name": "org.infinispan.commons.CacheException",
>         "extendedStackTrace": [
>           {
>             "class": "org.infinispan.statetransfer.StateTransferManagerImpl",
>             "method": "waitForInitialStateTransferToComplete",
>             "file": "StateTransferManagerImpl.java",
>             "line": 222,
>             "exact": false,
>             "location": "StateTransferManagerImpl.class",
>             "version": "?"
>           },
> {code}
> The JMX console reported "stateTransferInProgress=true" and "joinComplete=true".
> The original clusters where then shut down and started again together with the new node, after which the clusters were successfully formed.
> Attached is the exception stack trace and the JGroups config (based on the stock S3 ping).



--
This message was sent by Atlassian JIRA
(v6.4.11#64026)


More information about the infinispan-issues mailing list