August 2015 - infinispan-dev - Jboss List Archives

Lucene 5 is coming: pitfalls to consider

by Sanne Grinovero

Hi all, the Hibernate Search branch upgrading to Apache Lucene 5.2.x is almost ready, but there are some drawbacks on top of the many nice efficiency improvements. # API changes The API changes are not too bad, and definitely an improvement. I'll provide a detailed list as usual in the Hibernate Search migration guide - for now let it suffice to know that it's an easy upgrade for end users, as long as they were just creating Query instances and not using the more powerful and complex stuff. # … [View More]Sorting To sort on a field will require an UninvertingReader to wrap the cached IndexReaders, and the uninverting process is very inefficient. On top of that, the result of the uninverting process is not cacheable, so that will need to be repeated on each index, for each query which is executed. In short, I expect performance of sorted queries to be quite degraded in our first milestone using Lucene 5, and we'll have to discuss how to fix this. Needless to say, fixing this is a blocking requirement before we can consider the migration complete. Sorting will not need an UninvertingReader if the target field has been indexed as DocValues, but that implies: - we'll need an explicit, upfront (indexing time) flag to be set - we'll need to detect if the matching indexing options are compatible with the runtime query to skip the uninverting process This is mostly a job for Hibernate Search, but in terms of user experience it means you have to mark fields for "sortability" explicitly; will we need to extend the protobuf schema? Please make sure we'll just have to hook in existing metadata, we can't fix this after API freeze. # Filters We did some clever bitset level optimisations to merge multiple Filter instances and save memory to cache multiple filter instances, I had to drop that code as we don't deal with in-heap structures more but the design is about iterating off heap chunks of data, and resort on the more traditional Lucene stack for filtering. I couldn't measure the performance impact yet; it's a significantly different approach and while it sounds promising on paper, we'll need some help testing this. The Lucene team can generally be trusted to go in the better direction, but we'll have to verify if we're using it in the best way. # Analyzers It is no longer possible to override the field->analyzer mapping at runtime. We did expose this feature as a public API and I found a way to still do it, but it comes with a performance price tag. We'll soon deprecate this feature; if you can, start making sure there's no need for this in Infinispan as at some time in the near future we'll have to drop this, with no replacement. # Index encoding As usual the index encoding evolves and the easy solution is to rebuild it. Lucene 5 no longer ships with backwards compatible de-coders, but these are available as separate dependencies. If you feel the need to be able to read existing indexes, we should include these. (I'm including these as private dependencies in the Hibernate Search modules). Thanks, Sanne [View Less]

9 years, 7 months

4
7
0 / 0

Hidden failures in the testsuite

by Sanne Grinovero

Hi all, I just updated my local master fork and started the testsuite, as I sometimes do. It's great to see that the build was successful, and no tests *appeared* to have failed. But! lazily scrolling up in the console, I see lots of exceptions which don't look like intentional (I'm aware that some tests intentionally create error conditions). Also some tests are extremely verbose, which might be the reason for nobody noticing these. Some examples: - org.infinispan.it.compatibility.… [View More]

9 years, 7 months

5
6
0 / 0

Redis infinispan cache store

by Simon Paulger

Hi, I'm interested in developing inifinispan integration with Redis for use in JBoss. Before working on JBoss, I first need to add the capability to Infinispan itself. Is this an enhancement that the infinispan community would be interested in? Regards, Simon

9 years, 7 months

3
8
0 / 0

Blue-Green deployment scenario

by Christian Beikov

Hello, I have been reading the rolling upgrade chapter[1] from the documentation and I have some questions. 1. The documentation states that in the target cluster, every cache that should be migrated, should use a CLI cache loader pointing to the source cluster. I suppose that this can only be configured via XML but not via the CLI or JMX? That would be bad because after a node restart the cache loader would be enabled again. 2. How would the JMX URL look like if I wanted … [View More]

9 years, 7 months

2
1
0 / 0

Early Access builds for JDK 8u66 b02 and JDK 9 b78 are available on java.net

by Rory O'Donnell

Hi Galder, Early Access build for JDK 8u66 b02 <http://jdk8.java.net/download.html> is available on java.net, summary of changes are listed here. <http://download.java.net/jdk8u66/changes/jdk8u66-b02.html?q=download/jdk8...> Early Access build for JDK 9 b78 <https://jdk9.java.net/download/> is available on java.net, summary of changes are listed here <http://download.java.net/jdk9/changes/jdk9-b78.html?q=download/jdk9/chang...>. With respect to ongoing JDK 9 … [View More]

9 years, 7 months

1
0
0 / 0

Weekly Infinispan IRC Meeting minutes 2015-08-24

by Tristan Tarrant

Hi all, here are the minutes from this week's #infinispan IRC meeting: http://transcripts.jboss.org/meeting/irc.freenode.org/infinispan/2015/inf... Enjoy Tristan -- Tristan Tarrant Infinispan Lead JBoss, a division of Red Hat

9 years, 7 months

1
0
0 / 0

Shared vs Non-Shared CacheStores

by Sanne Grinovero

I would like to propose a clear cut separation between our shared and non-shared CacheStores, in all terms such as: - Configuration options - Integration contracts (Split the CacheStore SPI) - Implementations - Terminology, to avoid any further confusion around valid configurations and sensible architectures We have loads of examples of users who get in trouble by configuring one incorrectly, but also there are plenty of efficiency improvements we could take advantage of by clearly … [View More]splitting the integration points and the implementations in two categories. Not least, it's a very common and dangerous pitfall to assume that Infinispan is able to restore a consistent state after having stopped a DIST cluster which passivated into non-shared CacheStore instances, or even REPL clusters when they don't shutdown all at the same exact time (and "exact same time" is a strange concept at least..). We need to clarify the different options, tradeoffs and their consequences.. to users and ourselves, as a clearly defined use case will avoid bugs and simplify implementations. # The purpose of each I think that people should use a non-shared (local?) CacheStore for the sole purpose of expanding to storage capacity of each single node.. be it because you don't have enough memory at all, or be it because you prefer some extra safety margin because either your estimates are complex, or maybe because we live in a real world were the hashing function might not be perfect in practice. I hope we all agree that Infinispan should be able to take such situations with at worst a graceful performance degradatation, rather than complain sending OOMs to the admin and setting the service on strike. A Shared CacheStore is useful for very different purposes; primarily to implement a Cache on some other service - for example your (single, shared) RDBMs, a slow (or expensive) webservice your organization has to call frequently, etc.. Or it's useful even as a write-through cache on a similar service, maybe internal but not able to handle the high variation of load spikes which Infinsipan can handle better. Finally, a great use case is to have a consistent backup of all your data-grid content, possibly in some "reference" form such as JPA mapped entities. # Benefits of a Non-Shared A non-shared CacheStore implementor should be able to take advantage of *its purpose*, among the big ones I see: - Exclusive usage -> locking of a specific entry can be handled at datacontainer level, can simplify quite some internal code. - Reliability -> since a clustered node needs to wipe its state at reboot (after a crash), it's much simpler to code any such CacheStore to avoid any form of disk synch or persistance guarantees. - Encoding format -> this can be controlled entirely by Infinispan, and no need to take factors like rolling upgrade compatible encodings in mind. JBoss Marshalling would be good enough, or some implementations might not need to serialize at all. Our non-shared CacheStore implentation(s) could take advantage of lower level more complex code optimisations and interfaces, as users would rarely want to customize one of these, while the use case of mapping data to a shared service needs a more user friendly SPI so to keep it simple to plug in custom stores: custom data formats, custom connectors, get some help in implementing concurrency correctly. Proper Transaction integration for the CacheStore has been on our wishlist for some time too, I suspect that accepting that we have been mixing up two different things under a same name so far, would make it simpler to implement further improvements such as transactions: the way to do such a thing is very different in each of these use cases, so it would help at least to implement it on a subset first, or maybe only if it turns out there's no need for such things in the context of the local-only-dedicated "swapfile". # Mixed types should be killed I'm aware that some of our current implementations _could_ work both as shared or non-shared, for example the JDBC or JPACacheStore or the Remote Cachestore.. but in most cases it doesn't make much sense. Why would you ever want to use the JPACacheStore if not to share data with a _shared_ database? We should take such options away, and by doing so focus on the use cases which actually matter and simplify the implementations and improve the configuration validations. If ever a compelling storage technology is identified which we'd like to offer as an option for both shared or non-shared, I would still recommend to make two different implementations, as there certainly are different requirements and assumptions when coding such a thing. Not least, I would very like to see a default local CacheStore: picking one for local "emergency swapping" should be a no-brainer for users; we could setup one by default and not bother newcomers with complex choices. If we simplify the requirement of such a thing, it should be easy to write one on standard Java NIO2 APIs and get rid of the complexities of maintaining the native integration with things like LevelDB, not least the inefficiency of Java to make such native calls. Then as a second step, we should attack the other use case: backups; from a *purpose driven perspective* I'd then see us revive the Cassandra integration; obviously as a shared-only option. Cheers, Sanne [View Less]

9 years, 7 months

6
13
0 / 0

Infinispan 8.0.0.CR1 released!

by Gustavo Fernandes

Dear community, It's my pleasure to announce the first release candidate of Infinispan 8! All details are on our blog: http://blog.infinispan.org/2015/08/infinispan-800cr1-is-out.html <http://blog.infinispan.org/2015/08/infinispan-800beta3-out-with-lucene-5....> Cheers, Gustavo

9 years, 7 months

1
1
0 / 0

New Functional Map API in Infinispan 8 - Introduction

by Galder Zamarreno

Hi all, In Infinispan 8, we're introducing a new experimental Functional, Asynchronous, Lambda-based, Map API. I've written a blog post doing an overall introduction of the API: http://blog.infinispan.org/2015/08/new-functional-map-api-in-infinispan-8... Cheers, -- Galder Zamarreño Infinispan, Red Hat

9 years, 7 months

1
0
0 / 0

JCache integration with Wildfly provided configuration

by Christian Beikov

Hello, I am using Infinispan 7.2.3.Final within Wildfly 9.0.1 and I would like to use the JCache integration but I struggle a bit. I configured the JGroups subsystem in the standalone.xml of my Wildfly installation to enable clustering of Infinispan caches. That works as expected, but I wasn't sure how I would have my caches clustered too. I thought of some possible solutions but they both aren't really what I am looking for. 1. Put the cache container configuration into standalone.xml … [View More]

9 years, 7 months

2
4
0 / 0

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

infinispan-dev August 2015