[infinispan-dev] Infinispan client/server architecture based on gRPC

Wed May 30 08:17:20 EDT 2018

On Wed 18-05-30 11:16, Galder Zamarreno wrote:
>On Tue, May 29, 2018 at 8:57 PM Adrian Nistor <anistor at redhat.com> wrote:
>
>> Vittorio, a few remarks regarding your statement "...The alternative to
>> this is to develop a protostream equivalent for each supported language and
>> it doesn't seem really feasible to me."
>>
>> No way! That's a big misunderstanding. We do not need to re-implement the
>> protostream library in C/C++/C# or any new supported language.
>> Protostream is just for Java and it is compatible with Google's protobuf
>> lib we already use in the other clients. We can continue using Google's
>> protobuf lib for these clients, with or without gRPC.
>> Protostream does not handle protobuf services as gRPC does, but we can add
>> support for that with little effort.
>>
>> The real problem here is if we want to replace our hot rod invocation
>> protocol with gRPC to save on the effort of implementing and maintaining
>> hot rod in all those clients. I wonder why the obvious question is being
>> avoided in this thread.
>>
>
>^ It is not being avoided. I stated it quite clearly when I replied but
>maybe not with enough detail. So, I said:
>
>> The biggest problem I see in our client/server architecture is the
>ability to quickly deliver features/APIs across multiple language clients.
>Both Vittorio and I have seen how long it takes to implement all the
>different features available in Java client and port them to Node.js,
>C/C++/C#...etc. This effort lead by Vittorio is trying to improve on that
>by having some of that work done for us. Granted, not all of it will be
>done, but it should give us some good foundations on which to build.
>
>To expand on it a bit further: the reason it takes us longer to get
>different features in is because each client implements its own network
>layer, parses the protocol and does type transformations (between byte[]
>and whatever the client expects).
>
>IMO, the most costly things there are getting the network layer right (from
>experience with Node.js, it has taken a while to do so) and parsing work
>(not only parsing itself, but doing it in a efficient way). Network layer
>also includes load balancing, failover, cluster failover...etc.
>
>>From past experience, transforming from byte[] to what the client expects
>has never really been very problematic for me. What's been difficult here
>is coming up with encoding architecture that Gustavo lead, whose aim was to
>improve on the initial compatibility mode. But, with that now clear,
>understood and proven to solve our issues, the rest in this area should be
>fairly straightforward IMO.
>
>Type transformation, once done, is a constant. As we add more Hot Rod
>operations, it's mostly the parsing that starts to become more work.
>Network can also become more work if instead of RPC commands you start
>supporting streams based commands.
>
>gRPC solves the network (FYI: with key as HTTP header and SubchannelPicker
>you can do hash-aware routing) and parsing for us. I don't see the need for
>it to solve our type transformations for us. If it does it, great, but does
>it support our compatibility requirements? (I had already told Vittorio to
>check Gustavo on this). Type transformation is a lower prio for me, network
>and parsing are more important.
>
>Hope this clarifies better my POV.

I think I had an internal view a bit different of the project goal so to
clarify.  Who will do the hash aware connection to the grid?
Each client (i.e. manual coding for each platform work).
Or a generic C/Rust/Assembly client that acts as a gRPC server?

What of the following architecture is closer to what each of you is
saying:

A
|--------------- Client ---------------------|             |-- server --|
Clt RT <-- inter process --> Generic C client <--- HR ------> Data Grid

B
|--------------- Client --------------------|              |-- server --|
Clt RT <-- gRPC call    --> Generic C client <--- HR ------> Data Grid

C
|--------------- Client ------------------|               |-- server --|
Clt RT <-- inter process --> Generic C clt <--- gRPC -----> Data Grid

D
|--------------- Client -----------------|              |-- server --|
Clt RT <-- gRPC call    --> Generic C clt <--- gRPC ----> Data Grid

E
|--------------- Client ---------------------|             |-- server --|
Clt RT gRPC + manual coding (hash-aware etc)  <-- gRPC -->  Data Grid

F
Yet another alien

My understanding is that you guys are talking about E which still leaves
a lot of polyglot bugs if you ask me. 

Emmanuel