On 10 Dec 2009, at 12:23, Galder Zamarreno wrote:
>> Also, it might be wised to implement operations like
replaceIfEquals
>> using with a cas long, like memcached does
>> (
http://code.google.com/p/memcached/wiki/MemcacheBinaryProtocol), rather
>> than sending the value cos that would be more network efficient. The old
>> value could be much more longer.
>
> Not sure I understand. So we have 2 replace methods (thanks to ConcurrentMap) -
replace(k, v) which succeeds if k is mapped to something, and replace(k, v1, v2) which
succeeds if k is mapped to v1. From what I see, only the former is supported by
memcached's binary protocol and not the latter. Are you suggesting supporting the
latter in HotRod?
I'm suggesting supporting both actually.
replace(k, v) which succeeds if k is mapped to something - that's the
same as memcached's replace command. I plan to support that.
replace(k, v1, v2) is what memcached does with 'cas' but instead of
passing the old value, it passes the cas id, which is 8 bytes at the
most. The old value could be a lot longer. In ConcurrentMap, replace
works by passing the old value.
Sure, and that's in the same VM so you pass by reference. Across the wire, this may
involve serializing the entire (old) value which may be quite large.
I think, to save network space, I think we should implement it the
way
memcached does. IOW, whenever something gets stored, it's stored with a
cas id and this can later be used to replace a value, only of the cas is
the one as we've passed.
You get this cas id back as a return value? Which you can then store and use in future
'replace' commands? Is this it?
In the txt protocol, I implemented the cas id using
System.nanoTime().
Is this cas id meant to be unique? If so, System.nanoTime() may not be ...
>> Another thing to note here is that for expiry and maxIdle, I
followed
>> the Java TimeUnit based definition. To make it easier for client
>> implementers, we could follow the memcached style, where seconds are
>> used always and if the number of seconds is bigger than 30 days, those
>> seconds represent UNIX time (number of seconds since 1st January 1970).
>> WDYT?
>
> This is done to reduce bytes on the wire, since expressing something in millis often
means extra bytes and often meaningless since network latency can be worth several hundred
millis already. So it makes sense for the protocol to express this in seconds. Not sure
I understand your comment re: seconds> 30 days. Are you referring to the time being
treated as relative to the UNIX epoch [1] as opposed to relative to the current time
snapshot?
Indeed, when you pass a number of seconds that is bigger than 30 days,
memcached understands it as the offset from 1/1/1970. Otherwise, if
below 30 days, it thinks it's relative to the current time.
Good point about sending anything below seconds, it's wasteful and
netwrok latency probably invalidates. Assuming we only allowed, seconds,
minutes, hours, and days as time units, we'll be saving space compared
to memcached.
I'm not sure of the frequency of hours/days as time units, so I'd
inclined to only support seconds and use the same algorithm as memcached
simply, as this reduces complexity of existing clients that might be
used to memcached protocol.
Yes, the protocol should just use seconds. Client libs on top could add syntactic sugar
like TimeUnit.*, etc.
Cheers
--
Manik Surtani
manik(a)jboss.org
Lead, Infinispan
Lead, JBoss Cache
http://www.infinispan.org
http://www.jbosscache.org