[JBoss JIRA] (ISPN-8074) DistSyncFuncTest failing random in computeIfPresent test
by Dan Berindei (JIRA)
[ https://issues.jboss.org/browse/ISPN-8074?page=com.atlassian.jira.plugin.... ]
Dan Berindei commented on ISPN-8074:
------------------------------------
The test runs with L1 enabled, and it looks like the {{computeIfPresent()}} operation can finish before the entry is invalidated on all the non-owners. It becomes much easier to reproduce with an artificial delay in {{InvalidateL1Command.perform()}}, but I'm thinking {{DistSyncFuncTest}} should work with L1 disabled and the L1-enabled test should be in {{DistSyncL1FuncTest}}.
> DistSyncFuncTest failing random in computeIfPresent test
> --------------------------------------------------------
>
> Key: ISPN-8074
> URL: https://issues.jboss.org/browse/ISPN-8074
> Project: Infinispan
> Issue Type: Bug
> Environment: Jenkins
> http://ci.infinispan.org/job/Infinispan/job/master/41/testReport/junit/or...
> Reporter: Katia Aresti
> Assignee: Katia Aresti
>
> http://ci.infinispan.org/job/Infinispan/job/master/41/testReport/junit/or...
> org.infinispan.distribution.DistSyncFuncTest.testComputeIfPresentFromNonOwner[DIST_SYNC, tx=false] (from infinispan-core)
> En échec depuis 1 build (Depuis Instable#41 )
> A duré 0.28 s.
> Ajouter une description
> This test was not claimed. Claim it.
> This test was not claimed. Claim it.
> Message d'erreur
> Expecting [k1] to equal [null] on cache [DistSyncFuncTest[DIST_SYNC, tx=false]-NodeB-50117] but was [computed_k1_value]. Owners are [Cache 'dist'@DistSyncFuncTest[DIST_SYNC, tx=false]-NodeC-19559, Cache 'dist'@DistSyncFuncTest[DIST_SYNC, tx=false]-NodeD-7512]
> Pile d'exécution
> java.lang.AssertionError: Expecting [k1] to equal [null] on cache [DistSyncFuncTest[DIST_SYNC, tx=false]-NodeB-50117] but was [computed_k1_value]. Owners are [Cache 'dist'@DistSyncFuncTest[DIST_SYNC, tx=false]-NodeC-19559, Cache 'dist'@DistSyncFuncTest[DIST_SYNC, tx=false]-NodeD-7512]
> at org.infinispan.distribution.BaseDistFunctionalTest.assertOnAllCaches(BaseDistFunctionalTest.java:176)
> at org.infinispan.distribution.BaseDistFunctionalTest.assertRemovedOnAllCaches(BaseDistFunctionalTest.java:168)
> at org.infinispan.distribution.DistSyncFuncTest.testComputeIfPresentFromNonOwner(DistSyncFuncTest.java:357)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> at java.lang.Thread.run(Thread.java:748)
> ... Removed 16 stack frames
>
--
This message was sent by Atlassian JIRA
(v7.2.3#72005)
7 years, 4 months
[JBoss JIRA] (ISPN-5947) Infinispan directory provider is a lot slower when lucene caches are distributed compared to replicated
by Sanne Grinovero (JIRA)
[ https://issues.jboss.org/browse/ISPN-5947?page=com.atlassian.jira.plugin.... ]
Sanne Grinovero commented on ISPN-5947:
---------------------------------------
I'm afraid this is expected. Thanks for measuring though!
Shall we close this? Not really a bug.
> Infinispan directory provider is a lot slower when lucene caches are distributed compared to replicated
> -------------------------------------------------------------------------------------------------------
>
> Key: ISPN-5947
> URL: https://issues.jboss.org/browse/ISPN-5947
> Project: Infinispan
> Issue Type: Bug
> Components: Embedded Querying
> Reporter: Jakub Markos
> Assignee: Gustavo Fernandes
>
> I noticed that the difference in performance when using Infinispan Directory Provider with lucene data cache in distributed mode compared to replicated mode is quite big. In numbers, on my computer, running a 4 node cluster with a distributed cache with indexing enabled:
> {code}
> <distributed-cache name="dist_lucene" owners="2" statistics="true">
> <indexing index="LOCAL">
> <property name="default.indexmanager">org.infinispan.query.indexmanager.InfinispanIndexManager</property>
> <property name="default.exclusive_index_use">true</property>
> <property name="default.metadata_cachename">lucene_metadata</property>
> <property name="default.data_cachename">lucene_data</property>
> <property name="default.locking_cachename">lucene_locking</property>
> </indexing>
> </distributed-cache>
> <replicated-cache name="lucene_metadata" mode="SYNC" remote-timeout="25000">
> <indexing index="NONE"/>
> </replicated-cache>
> <replicated-cache name="lucene_data" mode="SYNC" remote-timeout="25000">
> <indexing index="NONE"/>
> </replicated-cache>
> <replicated-cache name="lucene_locking" mode="SYNC" remote-timeout="25000">
> <indexing index="NONE"/>
> </replicated-cache>
> {code}
> Using 10 threads on each node, loading 100 000 entries takes ~2.5 minutes, and using 100 threads takes ~1 minute. Changing the configuration to use a distributed cache for the index data:
> {code}
> <distributed-cache name="lucene_data" mode="SYNC" remote-timeout="25000">
> <indexing index="NONE"/>
> </distributed-cache>
> {code}
> leads to loading times 3+ hours (10 threads, I stopped it at around 80000 entries) and 22 minutes (100 threads), which is around 20x slowdown.
--
This message was sent by Atlassian JIRA
(v7.2.3#72005)
7 years, 4 months
[JBoss JIRA] (ISPN-8158) ClusteredCacheWithAffinityIndexManagerTxTest random failures with trace logging enabled
by Dan Berindei (JIRA)
[ https://issues.jboss.org/browse/ISPN-8158?page=com.atlassian.jira.plugin.... ]
Dan Berindei closed ISPN-8158.
------------------------------
Resolution: Duplicate Issue
Duplicate of ISPN-8160
> ClusteredCacheWithAffinityIndexManagerTxTest random failures with trace logging enabled
> ---------------------------------------------------------------------------------------
>
> Key: ISPN-8158
> URL: https://issues.jboss.org/browse/ISPN-8158
> Project: Infinispan
> Issue Type: Bug
> Components: Test Suite - Query
> Affects Versions: 9.1.0.Final
> Reporter: Dan Berindei
> Assignee: Dan Berindei
> Priority: Critical
> Labels: testsuite_stability
>
> The test usually passes when run by itself, but when run in parallel with other tests often getting one or more failures. The failing test method is not always the same, e.g.
> {noformat}
> 13:10:28,264 ERROR (testng-Test:[]) [TestSuiteProgress] Test failed: org.infinispan.query.blackbox.ClusteredCacheWithAffinityIndexManagerTxTest.testCompute
> org.hibernate.search.exception.SearchException: Unable to reopen IndexReader
> at org.hibernate.search.indexes.impl.SharingBufferReaderProvider$PerDirectoryLatestReader.refreshAndGet(SharingBufferReaderProvider.java:242) ~[hibernate-search-engine-5.8.0.Beta4.jar:5.8.0.Beta4]
> at org.hibernate.search.indexes.impl.SharingBufferReaderProvider.openIndexReader(SharingBufferReaderProvider.java:73) ~[hibernate-search-engine-5.8.0.Beta4.jar:5.8.0.Beta4]
> at org.hibernate.search.indexes.impl.SharingBufferReaderProvider.openIndexReader(SharingBufferReaderProvider.java:35) ~[hibernate-search-engine-5.8.0.Beta4.jar:5.8.0.Beta4]
> at org.hibernate.search.reader.impl.ManagedMultiReader.createInstance(ManagedMultiReader.java:69) ~[hibernate-search-engine-5.8.0.Beta4.jar:5.8.0.Beta4]
> at org.hibernate.search.reader.impl.MultiReaderFactory.openReader(MultiReaderFactory.java:48) ~[hibernate-search-engine-5.8.0.Beta4.jar:5.8.0.Beta4]
> at org.hibernate.search.query.engine.impl.LuceneHSQuery.buildSearcher(LuceneHSQuery.java:475) ~[hibernate-search-engine-5.8.0.Beta4.jar:5.8.0.Beta4]
> at org.hibernate.search.query.engine.impl.LuceneHSQuery.buildSearcher(LuceneHSQuery.java:399) ~[hibernate-search-engine-5.8.0.Beta4.jar:5.8.0.Beta4]
> at org.hibernate.search.query.engine.impl.LuceneHSQuery.queryEntityInfos(LuceneHSQuery.java:142) ~[hibernate-search-engine-5.8.0.Beta4.jar:5.8.0.Beta4]
> at org.infinispan.query.impl.CacheQueryImpl.list(CacheQueryImpl.java:160) ~[classes/:?]
> at org.infinispan.query.blackbox.ClusteredCacheTest.testCompute(ClusteredCacheTest.java:687) ~[test-classes/:?]
> Caused by: org.apache.lucene.index.IndexNotFoundException: no segments* file found in InfinispanDirectory{indexName='person.194'}: files: []
> at org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(SegmentInfos.java:726) ~[lucene-core-5.5.4.jar:5.5.4 31012120ebbd93744753eb37f1dbc5e654628291 - jpountz - 2017-02-08 19:08:03]
> at org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(SegmentInfos.java:683) ~[lucene-core-5.5.4.jar:5.5.4 31012120ebbd93744753eb37f1dbc5e654628291 - jpountz - 2017-02-08 19:08:03]
> at org.apache.lucene.index.SegmentInfos.readLatestCommit(SegmentInfos.java:490) ~[lucene-core-5.5.4.jar:5.5.4 31012120ebbd93744753eb37f1dbc5e654628291 - jpountz - 2017-02-08 19:08:03]
> at org.apache.lucene.index.StandardDirectoryReader.isCurrent(StandardDirectoryReader.java:344) ~[lucene-core-5.5.4.jar:5.5.4 31012120ebbd93744753eb37f1dbc5e654628291 - jpountz - 2017-02-08 19:08:03]
> at org.apache.lucene.index.StandardDirectoryReader.doOpenNoWriter(StandardDirectoryReader.java:300) ~[lucene-core-5.5.4.jar:5.5.4 31012120ebbd93744753eb37f1dbc5e654628291 - jpountz - 2017-02-08 19:08:03]
> at org.apache.lucene.index.StandardDirectoryReader.doOpenIfChanged(StandardDirectoryReader.java:263) ~[lucene-core-5.5.4.jar:5.5.4 31012120ebbd93744753eb37f1dbc5e654628291 - jpountz - 2017-02-08 19:08:03]
> at org.apache.lucene.index.StandardDirectoryReader.doOpenIfChanged(StandardDirectoryReader.java:251) ~[lucene-core-5.5.4.jar:5.5.4 31012120ebbd93744753eb37f1dbc5e654628291 - jpountz - 2017-02-08 19:08:03]
> at org.apache.lucene.index.DirectoryReader.openIfChanged(DirectoryReader.java:137) ~[lucene-core-5.5.4.jar:5.5.4 31012120ebbd93744753eb37f1dbc5e654628291 - jpountz - 2017-02-08 19:08:03]
> at org.hibernate.search.indexes.impl.SharingBufferReaderProvider$PerDirectoryLatestReader.refreshAndGet(SharingBufferReaderProvider.java:239) ~[hibernate-search-engine-5.8.0.Beta4.jar:5.8.0.Beta4]
> ... 35 more
> {noformat}
> I suspect the root problem is that trace logging makes everything slower.
--
This message was sent by Atlassian JIRA
(v7.2.3#72005)
7 years, 5 months
[JBoss JIRA] (ISPN-8139) OptimisticPrimaryOwnerCrashDuringPrepareTest.testPrimaryOwnerCrash random failures
by Dan Berindei (JIRA)
[ https://issues.jboss.org/browse/ISPN-8139?page=com.atlassian.jira.plugin.... ]
Dan Berindei reassigned ISPN-8139:
----------------------------------
Assignee: Dan Berindei
> OptimisticPrimaryOwnerCrashDuringPrepareTest.testPrimaryOwnerCrash random failures
> ----------------------------------------------------------------------------------
>
> Key: ISPN-8139
> URL: https://issues.jboss.org/browse/ISPN-8139
> Project: Infinispan
> Issue Type: Bug
> Components: Test Suite - Core
> Reporter: Tristan Tarrant
> Assignee: Dan Berindei
> Labels: testsuite_stability
>
> Stacktrace
> java.util.concurrent.TimeoutException
> at java.util.concurrent.FutureTask.get(FutureTask.java:205)
> at org.infinispan.distribution.rehash.OptimisticPrimaryOwnerCrashDuringPrepareTest.testPrimaryOwnerCrash(OptimisticPrimaryOwnerCrashDuringPrepareTest.java:58)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> at java.lang.Thread.run(Thread.java:748)
> ... Removed 16 stack frames
>
--
This message was sent by Atlassian JIRA
(v7.2.3#72005)
7 years, 5 months