[infinispan-issues] [JBoss JIRA] (ISPN-9908) Cache startup failure with server hinting and insufficient segments

Dan Berindei (Jira) issues at jboss.org
Tue Feb 19 04:52:15 EST 2019


     [ https://issues.jboss.org/browse/ISPN-9908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dan Berindei updated ISPN-9908:
-------------------------------
              Status: Pull Request Sent  (was: Open)
    Git Pull Request: https://github.com/infinispan/infinispan/pull/6703


> Cache startup failure with server hinting and insufficient segments
> -------------------------------------------------------------------
>
>                 Key: ISPN-9908
>                 URL: https://issues.jboss.org/browse/ISPN-9908
>             Project: Infinispan
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 9.4.6.Final
>            Reporter: Hiroki Daicho
>            Assignee: Dan Berindei
>            Priority: Major
>         Attachments: logs.zip
>
>
> When setting small segment to a cache and using server hinting, node can't start with the following error[1].
> It can be reproduced with RHDG 7.2.3 and 7.3 ER2.
> [1]
> {code}
> ERROR [org.jboss.msc.service.fail] (MSC service thread 1-4) MSC000001: Failed to start service jboss.datagrid-infinispan.clustered.test: org.jboss.msc.service.StartException in service jboss.datagrid-infinispan.clustered.test: Failed to start service
> ...
> Caused by: org.infinispan.commons.CacheException: Unable to invoke method public void org.infinispan.statetransfer.StateTransferManagerImpl.start() throws java.lang.Exception on object of type StateTransferManagerImpl
> ...
> Caused by: org.infinispan.util.concurrent.TimeoutException: Replication timeout for svr01 (flags=0), site-id=site1, rack-id=rack1, machine-id=machine1)
> at org.infinispan.remoting.transport.jgroups.JGroupsTransport.checkRsp(JGroupsTransport.java:916)
> ...
> {code}
> For example, 3rd node will fail to start with the following setting in 3 nodes cluster.
> When set the segments to 20 (6.6.2 default), 6th node will fail to start with the above timeout.
> Nodes seems to not be able to finish the initial state transfer and start up fails if the segments are set insufficiently against the number of nodes,
> {code}
> <distributed-cache name="default" segments="1" />
> ...
> <stack name="udp">
>     <transport type="UDP" socket-binding="jgroups-udp" machine="${jboss.jgroups.transport.machine:machine1}" rack="${jboss.jgroups.transport.rack:rack1}" site="${jboss.jgroups.transport.site:site1}" />
> </stack>
> {code}



--
This message was sent by Atlassian Jira
(v7.12.1#712002)


More information about the infinispan-issues mailing list