On 01/04/2010 10:44 PM, Alex Kluge wrote:
>> - No events. I used events, that is asynchronous messages
originating
>> from the server when certain conditions are met, to notify clients
>> that data was written to the server. This was mostly driven by the
>> need to notify an L1 cache of a write to the cache server. It would
>> be good to allow for this usage in this protocol. Note that this is
>> another case where the op code is useful to have as part of the
>> message.
>
> Isn't this expensive? Doesn't this mean the server
> has to fire off messages to each and every connected client,
> whether the clients are interested in these messages or
> not?
Not necessarily. There are a number of options. It wouldn't be
difficult to allow the clients to register for the events, and
then only send them to the interested clients. The events can
be sent asynchronously (a separate thread), thus they won't
delay the response to a write. Piratically speaking, they aren't
that expensive.
I'm still not convinced by this either. What you're talking about sounds
like JMS to me :). Right now, the only situation where I can see this
being useful is for sending back cluster formation changes but we found
simpler way to deal with it and that doesn't require the added complexity.
My other source of doubt has already being highlighted in a previous
thread and relates to firewalls, which generally block persistent
connections.
It's unclear here whether in your case you had control over the clients
and the environment linking clients to your server and whether you had
any issues with the topics just mentioned.
I do find an asynchronous event model easier to extend in future if we
want to send other information back to the clients, and possibly easier
to filter client capabilities. You could assume that all clients must be
able to handle a response as it is, and keep things like cluster
formation changes as asynchronous events that more complex clients will
only be able to handle.
>> - What happens if the key or the value is not text? I have a way of
>> representing the data to allow for a wide variety of data types,
>> even allowing for arrays or maps. This will make the protocol more
>> complex, but the assumption that the data is a string is rather
>> limiting. This is already sketched out in the wiki.
Hmmmmm, I don't think I've made any assumptions in the wiki that keys or
values are Strings unless I've made a mistake somewhere (maybe in the
example where I've used a particular encoding for Strings?). My thoughts
around this was that I was gonna treat them both as byte[]...
>
> There is some work around non-text data in the RESTful
> cache server. Adding MIME types to any opaque data
> allows us to attach handlers to this data, such as XML or
> JSON data. What do you have in mind?
The idea is to prefix each data block with a data type. This is a
lightweight binary protocol, so a full fledged mime type would be
overkill. There is a discussion and example in the Encoding Data
section of this page:
http://community.jboss.org/wiki/RemoteCacheInteractions
Data types are limited to things like integer, byte, string, boolean, etc.
Or, if it isn't a recognised type, the native platform serialisation is
used. There can be arrays of these types, or maps as well.
Each data type is represented by a bit, and they can be used in
combinations. An array of bytes would have the array, byte and
primitive bits set. The set of recognised data types can of course be
expanded.
... but that looks rather useful (at least based on the cached version
of that wiki that Google shows ;) - no wiki access right now :( ),
particularly the way you can combine different bytes to create composed
types. I don't see a problem with including this in Hot Rod.
I suppose you included type length into length field so that in the
future you can support other types and possibly longer type fields?
>> - Is the full message size still there as a header field? Is this
>> necessary? It precludes you from generating messages where you
>> don't know the full size of the message beforehand. For example,
>> in the future you might want to write a map of the data in the
>> cache. I do this, and can chunk the data, thus allowing me to send
>> an arbitrarily large map. This becomes, to put it mildly, difficult
>> if you need the size of the request as a header field.
>
> +1. If there is a way to work without this size
> header that would be better.
In my implementation I have a size attached to each field, and this
allows the messages to be handled easily. I retrieve more data from
the network if there is not enough data to complete the processing of
a field. There is no need to know the size of the full message.
I borrowed the idea from the memcached binary protocol, but you have a a
good point. I'll revisit the model to include size together with each field.
Finally, I've noted that you add cache name in the requests. This is
interesting because until now, I had thought of a Hot Rod server to map
1 to 1 to a cache, but if you add the cache name, you can map 1 Hot Rod
server to a Cache manager and the cache allows you to direct requests to
different caches under the same cache manager.
In the case of memcached, since it's a well defined protocol, the
mapping is between 1 to 1 between a Memcached server and a cache.
Thanks,
Alex
--- On Mon, 1/4/10, Manik Surtani<manik(a)jboss.org> wrote:
> From: Manik Surtani<manik(a)jboss.org>
> Subject: Re: [infinispan-dev] Hot Rod - pt3
> To: "infinispan -Dev List"<infinispan-dev(a)lists.jboss.org>
> Date: Monday, January 4, 2010, 11:17 AM
>
> On 29 Dec 2009, at 18:56, Alex Kluge wrote:
>
>> Hi,
>>
>> I hope you have all enjoyed the holiday season.
>>
>> The protocol is an interesting read. There are a
> few things that stand
>> out.
>>
>> - No events. I used events, that is asynchronous
> messages originating
>> from the server when certain conditions
> are met, to notify clients
>> that data was written to the
> server. This was mostly driven by the
>> need to notify an L1 cache of a write to
> the cache server. It would
>> be good to allow for this usage in this
> protocol. Note that this is
>> another case where the op code is useful
> to have as part of the
>> message.
>
> Isn't this expensive? Doesn't this mean the server
> has to fire off messages to each and every connected client,
> whether the clients are interested in these messages or
> not?
>
>> - What happens if the key or the value is not
> text? I have a way of
>> representing the data to allow for a wide
> variety of data types,
>> even allowing for arrays or maps. This
> will make the protocol more
>> complex, but the assumption that the data
> is a string is rather
>> limiting. This is already sketched out in
> the wiki.
>
> There is some work around non-text data in the RESTful
> cache server. Adding MIME types to any opaque data
> allows us to attach handlers to this data, such as XML or
> JSON data. What do you have in mind?
>
>> - Is the full message size still there as a
> header field? Is this
>> necessary? It precludes you from
> generating messages where you
>> don't know the full size of the message
> beforehand. For example,
>> in the future you might want to write a
> map of the data in the
>> cache. I do this, and can chunk the data,
> thus allowing me to send
>> an arbitrarily large map. This becomes,
> to put it mildly, difficult
>> if you need the size of the request as a
> header field.
>
> +1. If there is a way to work without this size
> header that would be better.
>
> Cheers
> Manik
>
>>
>>
>
> Thanks,
>>
>
> Alex
>>
>> --- On Mon, 12/21/09, Manik Surtani<manik(a)jboss.org>
> wrote:
>>
>>> From: Manik Surtani<manik(a)jboss.org>
>>> Subject: Re: [infinispan-dev] Hot Rod - pt3
>>> To: "infinispan -Dev List"<infinispan-dev(a)lists.jboss.org>
>>> Date: Monday, December 21, 2009, 5:36 AM
>>>
>>> On 21 Dec 2009, at 11:08, Galder Zamarreno wrote:
>>>
>>>> Hi all,
>>>>
>>>> Re:
http://community.jboss.org/wiki/HotRodProtocol
>>>>
>>>> First, I've made the corresponding changes
> based on
>>> the feedback I got
>>>> from pt2. This included reducing the response
> header
>>> since clients are
>>>> already aware of what they sent, addition of
> topology
>>> view id to to non
>>>> dumb requests and further specification in
> responses
>>> when topology
>>>> changes have happened...etc.
>>>>
>>>> I've also added flags to the request header
> that allow
>>> sending
>>>> Infinispan flags like: skip cache store, zero
> lock
>>> acquisition
>>>> timeout...etc.
>>>
>>>> Note that I've noted this as being N * 1 byte
> where
>>> each byte represents
>>>> a flag. However, I think this could maybe be
> sent more
>>> efficiently by
>>>> using XOR, i.e.
>>>>
>>>> 0x00 -> no flag
>>>> 0x01 (0000 0001) -> zero lock acquisition
>>>> 0x02 (0000 0010) -> cache mode local
>>>> 0x03 (0000 0011) -> zero lock acquisition +
> cache
>>> mode local
>>>> 0x04 (0000 0100) -> skip locking
>>>> ...etc.
>>>>
>>>> With 2 bytes, we could implement 16 Flags, we
>>> currently 11. However, we
>>>> could use vint as well, making sure that the
> most
>>> significant bit does
>>>> not mean anything flag wise. Iow, with vint,
> in 1 byte
>>> we'd be able to
>>>> define 7 diff flags. Thoughts?
>>>
>>> +1
>>>
>>>> I've also added the quit command that
> disconnects
>>> clients.
>>>
>>> Does this have any effect on the servers?
>>>
>>>> Finally, as far as I'm concerned, the
> specification is
>>> complete. I'm
>>>> leaving the quiet commands out of this initial
> scope
>>> (see
>>>>
http://code.google.com/p/memcached/wiki/MemcacheBinaryProtocol).
>>>
>>>> Remember that quiet commands could be used so
> that the
>>> server buffers
>>>> responses and only when you send a non-quiet
> command,
>>> the server replies
>>>> with all the pending answers.
>>>>
>>>> As you can see at the bottom of the wiki, I've
> added a
>>> local only put
>>>> request/response example so that readers get
> an idea
>>> of what a full
>>>> command looks like. I had received some
> feedback from
>>> readers saying
>>>> that it was difficult to understand how it all
> fit
>>> together.
>>>
>>> The examples look good, however I would not use
> the
>>> LOCAL_ONLY flag since that flag really only has
> meaning in a
>>> p2p context. In a client-server context,
> this flag is
>>> useless, and probably meaningless. I would
> suggest a
>>> different flag for the example, e.g.,
> ZERO_LOCK_TIMEOUT.
>>>
>>>> I'll probably add a couple more examples for
>>> non-so-dumb and clever
>>>> request/responses but I'll held them until we
> have a
>>> final round of
>>>> feedback and people can indicate whether they
> want any
>>> other examples
>>>> appearing in the wiki.
>>>>
>>>> Cheers,
>>>> --
>>>> Galder ZamarreƱo
>>>> Sr. Software Engineer
>>>> Infinispan, JBoss Cache
>>>>
> _______________________________________________
>>>> infinispan-dev mailing list
>>>> infinispan-dev(a)lists.jboss.org
>>>>
https://lists.jboss.org/mailman/listinfo/infinispan-dev
>>>
>>> --
>>> Manik Surtani
>>> manik(a)jboss.org
>>> Lead, Infinispan
>>> Lead, JBoss Cache
>>>
http://www.infinispan.org
>>>
http://www.jbosscache.org
>>>
>>>
>>>
>>>
>>>
>>> _______________________________________________
>>> infinispan-dev mailing list
>>> infinispan-dev(a)lists.jboss.org
>>>
https://lists.jboss.org/mailman/listinfo/infinispan-dev
>>>
>>
>>
>>
>>
>> _______________________________________________
>> infinispan-dev mailing list
>> infinispan-dev(a)lists.jboss.org
>>
https://lists.jboss.org/mailman/listinfo/infinispan-dev
>
> --
> Manik Surtani
> manik(a)jboss.org
> Lead, Infinispan
> Lead, JBoss Cache
>
http://www.infinispan.org
>
http://www.jbosscache.org
>
>
>
>
>
> _______________________________________________
> infinispan-dev mailing list
> infinispan-dev(a)lists.jboss.org
>
https://lists.jboss.org/mailman/listinfo/infinispan-dev
>
_______________________________________________
infinispan-dev mailing list
infinispan-dev(a)lists.jboss.org
https://lists.jboss.org/mailman/listinfo/infinispan-dev
--
Galder ZamarreƱo
Sr. Software Engineer
Infinispan, JBoss Cache