[jboss-cvs] JBossCache/docs/JBossCache-UserGuide/en/modules ...

Thu Jan 25 11:57:30 EST 2007

  User: vblagojevic
  Date: 07/01/25 11:57:30

  Modified:    docs/JBossCache-UserGuide/en/modules  replication.xml
  Log:
  updated state transfer section
  
  Revision  Changes    Path
  1.6       +558 -514  JBossCache/docs/JBossCache-UserGuide/en/modules/replication.xml
  
  (In the diff below, changes in quantity of whitespace are not shown.)
  
  Index: replication.xml
  ===================================================================
  RCS file: /cvsroot/jboss/JBossCache/docs/JBossCache-UserGuide/en/modules/replication.xml,v
  retrieving revision 1.5
  retrieving revision 1.6
  diff -u -b -r1.5 -r1.6
  --- replication.xml	24 Jan 2007 18:05:25 -0000	1.5
  +++ replication.xml	25 Jan 2007 16:57:30 -0000	1.6
  @@ -1,503 +1,481 @@
  +<?xml version="1.0" encoding="UTF-8"?>
   <chapter id="clustering">
      <title>Clustering</title>
  +
      <para>This chapter talks about aspects around clustering JBoss Cache.</para>
  +
      <section>
         <title>Cache Replication Modes</title>
  -      <para>
  -         JBoss Cache can be configured to be either local (standalone) or clustered. If
  -         in a cluster, the cache can be configured to replicate changes, or to
  -         invalidate changes. A detailed discussion on this follows.
  -      </para>
  +
  +    <para>JBoss Cache can be configured to be either local (standalone) or
  +    clustered. If in a cluster, the cache can be configured to replicate
  +    changes, or to invalidate changes. A detailed discussion on this
  +    follows.</para>
  +
         <section>
            <title>Local Mode</title>
  +
            <para>Local caches don't join a cluster and don't communicate with other
            caches in a cluster. Therefore their elements don't need to be
  -         serializable - however, we recommend making them serializable, enabling a
  -         user to change the cache mode at any time.  The dependency on the JGroups
  -         library is still there, although a JGroups channel is not started.</para>
  +      serializable - however, we recommend making them serializable, enabling
  +      a user to change the cache mode at any time. The dependency on the
  +      JGroups library is still there, although a JGroups channel is not
  +      started.</para>
         </section>
  +
         <section>
            <title>Replicated Caches</title>
  -         <para>Replicated caches replicate all changes to the
  -         other cache instances in the cluster. Replication can either happen
  -         after each modification (no transactions), or at the end of a
  -         transaction (commit time).
  -      </para>
  +
  +      <para>Replicated caches replicate all changes to the other cache
  +      instances in the cluster. Replication can either happen after each
  +      modification (no transactions), or at the end of a transaction (commit
  +      time).</para>
   
         <para>Replication can be synchronous or asynchronous . Use of either one
            of the options is application dependent. Synchronous replication blocks
  -         the caller (e.g. on a <literal>put()</literal>) until the modifications have been
  -         replicated successfully to all nodes in a cluster. Asynchronous
  -         replication performs replication in the background (the <literal>put()</literal> returns
  -         immediately).
  -         JBoss Cache
  -         also offers a replication queue, where
  -         modifications are replicated periodically (i.e. interval-based), or when
  -         the queue size exceeds a number of elements, or a combination
  -         thereof.
  -      </para>
  +      the caller (e.g. on a <literal>put()</literal>) until the modifications
  +      have been replicated successfully to all nodes in a cluster.
  +      Asynchronous replication performs replication in the background (the
  +      <literal>put()</literal> returns immediately). JBoss Cache also offers a
  +      replication queue, where modifications are replicated periodically (i.e.
  +      interval-based), or when the queue size exceeds a number of elements, or
  +      a combination thereof.</para>
   
  -      <para>
  -         Asynchronous replication is faster (no caller blocking), because
  +      <para>Asynchronous replication is faster (no caller blocking), because
            synchronous replication requires acknowledgments from all nodes in a
            cluster that they received and applied the modification successfully
            (round-trip time). However, when a synchronous replication returns
            successfully, the caller knows for sure that all modifications have been
  -         applied at all nodes, whereas this is not be the case with
  -         asynchronous replication. With asynchronous replication, errors are
  -         simply written to a log. Even when using transactions, a transaction may
  -         succeed but replication may not succeed on all cache instances.
  -      </para>
  +      applied at all nodes, whereas this is not be the case with asynchronous
  +      replication. With asynchronous replication, errors are simply written to
  +      a log. Even when using transactions, a transaction may succeed but
  +      replication may not succeed on all cache instances.</para>
   
         <section id="replication.tx">
            <title>Replicated Caches and Transactions</title>
  -         <para>
  -            When using transactions, replication only occurs at the transaction boundary - i.e., when a transaction
  -            commits. This results in minimising replication traffic since a single modification os broadcast rather
  -            than a series of individual modifications, and can be a lot more efficient than not using transactions.
  -            Another effect of this is that if a transaction were to roll back, nothing is broadcast across a cluster.
  -         </para>
  -         <para>
  -            Depending on whether you are running your cluster in asynchronous or synchronous mode, JBoss Cache will use
  -            either a single phase or <ulink url="http://en.wikipedia.org/wiki/Two-phase_commit_protocol">two phase commit</ulink>
  -            protocol, respectively.
  -         </para>
  +
  +        <para>When using transactions, replication only occurs at the
  +        transaction boundary - i.e., when a transaction commits. This results
  +        in minimising replication traffic since a single modification os
  +        broadcast rather than a series of individual modifications, and can be
  +        a lot more efficient than not using transactions. Another effect of
  +        this is that if a transaction were to roll back, nothing is broadcast
  +        across a cluster.</para>
  +
  +        <para>Depending on whether you are running your cluster in
  +        asynchronous or synchronous mode, JBoss Cache will use either a single
  +        phase or <ulink
  +        url="http://en.wikipedia.org/wiki/Two-phase_commit_protocol">two phase
  +        commit</ulink> protocol, respectively.</para>
  +
            <section>
               <title>One Phase Commits</title>
  -            <para>
  -               Used when your cache mode is REPL_ASYNC. All modifications are replicated in a single call, which
  -               instructs remote caches to apply the changes to their local in-memory state and commit locally. Remote
  -               errors/rollbacks are never fed back to the originator of the transaction since the communication is
  -               asynchronous.
  -            </para>
  +
  +          <para>Used when your cache mode is REPL_ASYNC. All modifications are
  +          replicated in a single call, which instructs remote caches to apply
  +          the changes to their local in-memory state and commit locally.
  +          Remote errors/rollbacks are never fed back to the originator of the
  +          transaction since the communication is asynchronous.</para>
            </section>
  +
            <section>
               <title>Two Phase Commits</title>
  -            <para>
  -               Used when your cache mode is REPL_SYNC. Upon committing your transaction, JBoss Cache broadcasts a
  -               prepare call, which carries all modifications relevant to the transaction. Remote caches then acquire
  -               local locks on their in-memory state and apply the modifications. Once all remote caches respond to the
  -               prepare call, the originator of the transaction broadcasts a commit. This instructs all remote caches to
  -               commit their data.  If any of the caches fail to respond to the prepare phase, the originator broadcasts
  -               a rollback.
  -            </para>
  -            <para>
  -               Note that although the prepare phase is synchronous, the commit and rollback phases are asynchronous.
  -               This is because
  -               <ulink url="http://java.sun.com/products/jta/">Sun's JTA specification</ulink>
  -               does not specify how transactional resources should deal with failures
  -               at this stage of a transaction; and other resources participating in the transaction may have
  -               indeterminate state anyway. As such, we do away with the overhead of synchronous communication
  -               for this phase of the transaction. That said, they can be forced to be synchronous using the
  -               <literal>SyncCommitPhase</literal> and <literal>SyncRollbackPhase</literal> configuration attributes.
  -            </para>
  +
  +          <para>Used when your cache mode is REPL_SYNC. Upon committing your
  +          transaction, JBoss Cache broadcasts a prepare call, which carries
  +          all modifications relevant to the transaction. Remote caches then
  +          acquire local locks on their in-memory state and apply the
  +          modifications. Once all remote caches respond to the prepare call,
  +          the originator of the transaction broadcasts a commit. This
  +          instructs all remote caches to commit their data. If any of the
  +          caches fail to respond to the prepare phase, the originator
  +          broadcasts a rollback.</para>
  +
  +          <para>Note that although the prepare phase is synchronous, the
  +          commit and rollback phases are asynchronous. This is because <ulink
  +          url="http://java.sun.com/products/jta/">Sun's JTA
  +          specification</ulink> does not specify how transactional resources
  +          should deal with failures at this stage of a transaction; and other
  +          resources participating in the transaction may have indeterminate
  +          state anyway. As such, we do away with the overhead of synchronous
  +          communication for this phase of the transaction. That said, they can
  +          be forced to be synchronous using the
  +          <literal>SyncCommitPhase</literal> and
  +          <literal>SyncRollbackPhase</literal> configuration
  +          attributes.</para>
            </section>
         </section>
  +
         <section id="br">
            <title>Buddy Replication</title>
  -         <para>
  -            Buddy Replication allows you to suppress replicating your data to all instances in a cluster. Instead,
  -            each instance picks one or more 'buddies' in the cluster, and only replicates to these specific buddies.
  -            This greatly helps scalability as there is no longer a memory and network traffic impact every time another
  -            instance is added to a cluster.
  -         </para>
  -         <para>
  -            One of the most common use cases of Buddy Replication is when a replicated cache is used by a servlet
  -            container to store HTTP session data. One of the pre-requisites to buddy replication working well and being
  -            a real benefit is the use of <emphasis>session affinity</emphasis>, more casually known as
  -            <emphasis>sticky sessions</emphasis> in HTTP session replication speak. What this means is that if
  -            certain data is frequently accessed, it is desirable that this is always accessed on one instance rather
  -            than in a round-robin fashion as this helps the cache cluster optimise how it chooses buddies, where it
  -            stores data, and minimises replication traffic.
  -         </para>
  -         <para>
  -            If this is not possible, Buddy Replication may prove to be more of an overhead than a benefit.
  -         </para>
  +
  +        <para>Buddy Replication allows you to suppress replicating your data
  +        to all instances in a cluster. Instead, each instance picks one or
  +        more 'buddies' in the cluster, and only replicates to these specific
  +        buddies. This greatly helps scalability as there is no longer a memory
  +        and network traffic impact every time another instance is added to a
  +        cluster.</para>
  +
  +        <para>One of the most common use cases of Buddy Replication is when a
  +        replicated cache is used by a servlet container to store HTTP session
  +        data. One of the pre-requisites to buddy replication working well and
  +        being a real benefit is the use of <emphasis>session
  +        affinity</emphasis>, more casually known as <emphasis>sticky
  +        sessions</emphasis> in HTTP session replication speak. What this means
  +        is that if certain data is frequently accessed, it is desirable that
  +        this is always accessed on one instance rather than in a round-robin
  +        fashion as this helps the cache cluster optimise how it chooses
  +        buddies, where it stores data, and minimises replication
  +        traffic.</para>
  +
  +        <para>If this is not possible, Buddy Replication may prove to be more
  +        of an overhead than a benefit.</para>
  +
            <section>
               <title>Selecting Buddies</title>
  +
               <figure>
                     <title>BuddyLocator</title>
   
                     <mediaobject>
                        <imageobject>
  -                        <imagedata fileref="images/BuddyReplication.png"/>
  +                <imagedata fileref="images/BuddyReplication.png" />
                        </imageobject>
                     </mediaobject>
                  </figure>
  -            <para>
  -               Buddy Replication uses an instance of a <literal>BuddyLocator</literal>
  -               which contains the logic used to select buddies in a network. JBoss Cache currently ships with a single
  -               implementation, <literal>NextMemberBuddyLocator</literal>, which is used as a default if no implementation
  -               is provided. The <literal>NextMemberBuddyLocator</literal> selects the next member in the cluster, as
  -               the name suggests, and guarantees an even spread of buddies for each instance.
  -            </para>
  -            <para>
  -               The
  -               <literal>NextMemberBuddyLocator</literal>
  -               takes in 2 parameters, both optional.
  -               <itemizedlist>
  +
  +          <para>Buddy Replication uses an instance of a
  +          <literal>BuddyLocator</literal> which contains the logic used to
  +          select buddies in a network. JBoss Cache currently ships with a
  +          single implementation, <literal>NextMemberBuddyLocator</literal>,
  +          which is used as a default if no implementation is provided. The
  +          <literal>NextMemberBuddyLocator</literal> selects the next member in
  +          the cluster, as the name suggests, and guarantees an even spread of
  +          buddies for each instance.</para>
  +
  +          <para>The <literal>NextMemberBuddyLocator</literal> takes in 2
  +          parameters, both optional. <itemizedlist>
                     <listitem>
  +                 
  +
                        <literal>numBuddies</literal>
  +
                        - specifies how many buddies each instance should pick to back its data onto. This defaults to 1.
                     </listitem>
  +
                     <listitem>
  +                 
  +
                        <literal>ignoreColocatedBuddies</literal>
  -                     - means that
  -                     each instance will
  +
  +                 - means that each instance will 
  +
                        <emphasis>try</emphasis>
  -                     to select a buddy on a different physical host. If not able to do so though,
  -                     it will fall back to colocated instances. This defaults to
  +
  +                 to select a buddy on a different physical host. If not able to do so though, it will fall back to colocated instances. This defaults to 
  +
                        <literal>true</literal>
  +
                        .
                     </listitem>
  -               </itemizedlist>
  -            </para>
  +            </itemizedlist></para>
            </section>
  +
            <section>
               <title>BuddyPools</title>
  -            <para>
  -               Also known as <emphasis>replication groups</emphasis>, a buddy pool is an optional construct where each instance in a cluster
  -               may be configured with a buddy pool name. Think of this as an 'exclusive club membership' where when
  -               selecting buddies,
  -               <literal>BuddyLocator</literal>s that support buddy pools would try and select buddies sharing the same
  -               buddy pool name. This allows system administrators a degree of flexibility and control over how buddies
  -               are selected. For example, a sysadmin may put two instances on two separate physical servers that
  -               may be on two separate physical racks in the same buddy pool. So rather than picking an
  +
  +          <para>Also known as <emphasis>replication groups</emphasis>, a buddy
  +          pool is an optional construct where each instance in a cluster may
  +          be configured with a buddy pool name. Think of this as an 'exclusive
  +          club membership' where when selecting buddies,
  +          <literal>BuddyLocator</literal>s that support buddy pools would try
  +          and select buddies sharing the same buddy pool name. This allows
  +          system administrators a degree of flexibility and control over how
  +          buddies are selected. For example, a sysadmin may put two instances
  +          on two separate physical servers that may be on two separate
  +          physical racks in the same buddy pool. So rather than picking an
                  instance on a different host on the same rack,
  -               <literal>BuddyLocator</literal>s would rather pick
  -               the instance in the same buddy pool, on a separate rack which may add a degree of redundancy.
  -            </para>
  +          <literal>BuddyLocator</literal>s would rather pick the instance in
  +          the same buddy pool, on a separate rack which may add a degree of
  +          redundancy.</para>
            </section>
  +
            <section>
               <title>Failover</title>
  -            <para>
  -               In the unfortunate event of an instance crashing, it is assumed that the client connecting to the
  -               cache (directly or indirectly, via some other service such as HTTP session replication) is able to
  -               redirect the request to any other random cache instance in the cluster. This is where a concept of
  -               Data Gravitation comes in.
  -            </para>
  -            <para>
  -               Data Gravitation is a concept where if a request is made on a cache in the cluster and the cache does
  -               not contain this information, it then asks other instances in the cluster for the data. If even this
  -               fails, it would (optionally) ask other instances to check in the backup data they store for other caches.
  -               This means that even if a cache containing your session dies, other instances will still be able
  -               to access this data by asking the cluster to search through their backups for this data.
  -            </para>
  -            <para>
  -               Once located, this data is then transferred to the instance which requested it and is added to this
  -               instance's data tree. It is then (optionally) removed from all other instances (and backups) so
  -               that if session affinity is used, the affinity should now be to this new cache instance which has
  -               just
  -               <emphasis>taken ownership</emphasis>
  -               of this data.
  -            </para>
  -            <para>
  -               Data Gravitation is implemented as an interceptor. The following (all optional) configuration properties
  -               pertain to data gravitation.
  -               <itemizedlist>
  +
  +          <para>In the unfortunate event of an instance crashing, it is
  +          assumed that the client connecting to the cache (directly or
  +          indirectly, via some other service such as HTTP session replication)
  +          is able to redirect the request to any other random cache instance
  +          in the cluster. This is where a concept of Data Gravitation comes
  +          in.</para>
  +
  +          <para>Data Gravitation is a concept where if a request is made on a
  +          cache in the cluster and the cache does not contain this
  +          information, it then asks other instances in the cluster for the
  +          data. If even this fails, it would (optionally) ask other instances
  +          to check in the backup data they store for other caches. This means
  +          that even if a cache containing your session dies, other instances
  +          will still be able to access this data by asking the cluster to
  +          search through their backups for this data.</para>
  +
  +          <para>Once located, this data is then transferred to the instance
  +          which requested it and is added to this instance's data tree. It is
  +          then (optionally) removed from all other instances (and backups) so
  +          that if session affinity is used, the affinity should now be to this
  +          new cache instance which has just <emphasis>taken
  +          ownership</emphasis> of this data.</para>
  +
  +          <para>Data Gravitation is implemented as an interceptor. The
  +          following (all optional) configuration properties pertain to data
  +          gravitation. <itemizedlist>
                     <listitem>
  +                 
  +
                        <literal>dataGravitationRemoveOnFind</literal>
  -                     - forces all remote caches that own the data or hold backups for the data to remove that data,
  -                     thereby making the requesting cache the new data owner. If set to
  +
  +                 - forces all remote caches that own the data or hold backups for the data to remove that data, thereby making the requesting cache the new data owner. If set to 
  +
                        <literal>false</literal>
  -                     an evict is broadcast instead of a remove, so any state
  -                     persisted in cache loaders will remain. This is useful if you have a shared cache loader
  -                     configured. Defaults to
  +
  +                 an evict is broadcast instead of a remove, so any state persisted in cache loaders will remain. This is useful if you have a shared cache loader configured. Defaults to 
  +
                        <literal>true</literal>
  +
                        .
                     </listitem>
  +
                     <listitem>
  +                 
  +
                        <literal>dataGravitationSearchBackupTrees</literal>
  +
                        - Asks remote instances to search through their backups as well as main data trees. Defaults to
  +
                        <literal>true</literal>
  +
                        . The resulting effect is that if this is
  +
                        <literal>true</literal>
  +
                        then backup nodes can respond to data gravitation requests in addition to data owners.
                     </listitem>
  +
                     <listitem>
  +                 
  +
                        <literal>autoDataGravitation</literal>
  +
                        - Whether data gravitation occurs for every cache miss. My default this is set to
  +
                        <literal>false</literal>
  -                     to prevent unnecessary network calls. Most use cases will know when it may need to gravitate data
  -                     and will pass in an
  +
  +                 to prevent unnecessary network calls. Most use cases will know when it may need to gravitate data and will pass in an 
  +
                        <literal>Option</literal>
  +
                        to enable data gravitation on a per-invocation basis. If
  +
                        <literal>autoDataGravitation</literal>
  +
                        is
  +
                        <literal>true</literal>
  +
                        this
  +
                        <literal>Option</literal>
  +
                        is unnecessary.
                     </listitem>
  -               </itemizedlist>
  -            </para>
  +            </itemizedlist></para>
            </section>
  +
            <section>
               <title>Configuration</title>
  -            <para>
  -               <programlisting>
  -<![CDATA[
  -   <!-- Buddy Replication config -->
  -   <attribute name="BuddyReplicationConfig">
  -   <config>
   
  -      <!-- Enables buddy replication.  This is the ONLY mandatory configuration element here. -->
  -      <buddyReplicationEnabled>true</buddyReplicationEnabled>
  +          <para><programlisting>
  +
  +   &lt;!-- Buddy Replication config --&gt;
  +   &lt;attribute name="BuddyReplicationConfig"&gt;
  +   &lt;config&gt;
  +
  +      &lt;!-- Enables buddy replication.  This is the ONLY mandatory configuration element here. --&gt;
  +      &lt;buddyReplicationEnabled&gt;true&lt;/buddyReplicationEnabled&gt;
   
  -      <!-- These are the default values anyway -->
  -      <buddyLocatorClass>org.jboss.cache.buddyreplication.NextMemberBuddyLocator</buddyLocatorClass>
  +      &lt;!-- These are the default values anyway --&gt;
  +      &lt;buddyLocatorClass&gt;org.jboss.cache.buddyreplication.NextMemberBuddyLocator&lt;/buddyLocatorClass&gt;
   
  -      <!-- numBuddies is the number of backup nodes each node maintains.  ignoreColocatedBuddies means that
  +      &lt;!-- numBuddies is the number of backup nodes each node maintains.  ignoreColocatedBuddies means that
              each node will *try* to select a buddy on a different physical host.  If not able to do so though,
  -           it will fall back to colocated nodes. -->
  -      <buddyLocatorProperties>
  +           it will fall back to colocated nodes. --&gt;
  +      &lt;buddyLocatorProperties&gt;
            numBuddies = 1
            ignoreColocatedBuddies = true
  -      </buddyLocatorProperties>
  +      &lt;/buddyLocatorProperties&gt;
   
  -      <!-- A way to specify a preferred replication group.  If specified, we try and pick a buddy why shares
  +      &lt;!-- A way to specify a preferred replication group.  If specified, we try and pick a buddy why shares
              the same pool name (falling back to other buddies if not available).  This allows the sysdmin to hint at
              backup buddies are picked, so for example, nodes may be hinted topick buddies on a different physical rack
  -           or power supply for added fault tolerance.  -->
  -      <buddyPoolName>myBuddyPoolReplicationGroup</buddyPoolName>
  +           or power supply for added fault tolerance.  --&gt;
  +      &lt;buddyPoolName&gt;myBuddyPoolReplicationGroup&lt;/buddyPoolName&gt;
   
  -      <!-- Communication timeout for inter-buddy group organisation messages (such as assigning to and removing
  -           from groups, defaults to 1000. -->
  -      <buddyCommunicationTimeout>2000</buddyCommunicationTimeout>
  -
  -      <!-- Whether data is removed from old owners when gravitated to a new owner.  Defaults to true.  -->
  -      <dataGravitationRemoveOnFind>true</dataGravitationRemoveOnFind>
  -
  -      <!-- Whether backup nodes can respond to data gravitation requests, or only the data owner is supposed to respond.
  -           defaults to true. -->
  -      <dataGravitationSearchBackupTrees>true</dataGravitationSearchBackupTrees>
  -
  -      <!-- Whether all cache misses result in a data gravitation request.  Defaults to false, requiring callers to
  -           enable data gravitation on a per-invocation basis using the Options API.  -->
  -      <autoDataGravitation>false</autoDataGravitation>
  -
  -   </config>
  -   </attribute>
  -]]>
  -               </programlisting>
  -            </para>
  -         </section>
  -      </section>
  +      &lt;!-- Communication timeout for inter-buddy group organisation messages (such as assigning to and removing
  +           from groups, defaults to 1000. --&gt;
  +      &lt;buddyCommunicationTimeout&gt;2000&lt;/buddyCommunicationTimeout&gt;
   
  +      &lt;!-- Whether data is removed from old owners when gravitated to a new owner.  Defaults to true.  --&gt;
  +      &lt;dataGravitationRemoveOnFind&gt;true&lt;/dataGravitationRemoveOnFind&gt;
   
  +      &lt;!-- Whether backup nodes can respond to data gravitation requests, or only the data owner is supposed to respond.
  +           defaults to true. --&gt;
  +      &lt;dataGravitationSearchBackupTrees&gt;true&lt;/dataGravitationSearchBackupTrees&gt;
   
  +      &lt;!-- Whether all cache misses result in a data gravitation request.  Defaults to false, requiring callers to
  +           enable data gravitation on a per-invocation basis using the Options API.  --&gt;
  +      &lt;autoDataGravitation&gt;false&lt;/autoDataGravitation&gt;
  +
  +   &lt;/config&gt;
  +   &lt;/attribute&gt;
  +
  +               </programlisting></para>
  +        </section>
            </section>
         </section>
  +  </section>
  +
         <section>
            <title>Invalidation</title>
  +
         <para>If a cache is configured for invalidation rather than replication,
  -         every time data is changed in a cache other caches in the cluster
  -         receive a message informing them that their data is now stale and should
  -         be evicted from memory. Invalidation, when used with a shared cache
  -         loader (see chapter on Cache Loaders) would cause remote caches to refer
  -         to the shared cache loader to retrieve modified data. The benefit of
  -         this is twofold: network traffic is minimised as invalidation messages
  -         are very small compared to replicating updated data, and also that other
  -         caches in the cluster look up modified data in a lazy manner, only when
  -         needed.
  -      </para>
  +    every time data is changed in a cache other caches in the cluster receive
  +    a message informing them that their data is now stale and should be
  +    evicted from memory. Invalidation, when used with a shared cache loader
  +    (see chapter on Cache Loaders) would cause remote caches to refer to the
  +    shared cache loader to retrieve modified data. The benefit of this is
  +    twofold: network traffic is minimised as invalidation messages are very
  +    small compared to replicating updated data, and also that other caches in
  +    the cluster look up modified data in a lazy manner, only when
  +    needed.</para>
   
         <para>Invalidation messages are sent after each modification (no
            transactions), or at the end of a transaction, upon successful commit.
            This is usually more efficient as invalidation messages can be optimised
            for the transaction as a whole rather than on a per-modification
  -         basis.
  -      </para>
  +    basis.</para>
   
  -      <para>Invalidation too can be synchronous or asynchronous, and just as
  -         in the case of replication, synchronous invalidation blocks
  -         until all caches in the cluster receive invalidation messages and have
  -         evicted stale data while asynchronous invalidation works in a
  -         'fire-and-forget' mode, where invalidation messages are broadcast but
  -         doesn't block and wait for responses.
  -      </para>
  +    <para>Invalidation too can be synchronous or asynchronous, and just as in
  +    the case of replication, synchronous invalidation blocks until all caches
  +    in the cluster receive invalidation messages and have evicted stale data
  +    while asynchronous invalidation works in a 'fire-and-forget' mode, where
  +    invalidation messages are broadcast but doesn't block and wait for
  +    responses.</para>
         </section>
   
  -
  -
      <section>
         <title>State Transfer</title>
   
  -      <para><emphasis>State Transfer</emphasis> refers to the process by which a JBoss Cache instance
  -         prepares itself to begin providing a service by acquiring the current
  -         state from another cache instance and integrating that state into its
  -         own state.
  -      </para>
  +    <para><emphasis>State Transfer</emphasis> refers to the process by which a
  +    JBoss Cache instance prepares itself to begin providing a service by
  +    acquiring the current state from another cache instance and integrating
  +    that state into its own state.</para>
  +
  +    <section>
  +      <title>State Transfer Types</title>
  +
  +      <para>There are three divisions of state transfer types depending on a
  +      point of view related to state transfer. First, in the context of
  +      particular state transfer implementation, the underlying plumbing, there
  +      are two starkly different state transfer types: byte array and streaming
  +      based state transfer. Second, state transfer can be full or partial
  +      state transfer depending on a subtree being transferred. Entire cache
  +      tree transfer represents full transfer while transfer of a particular
  +      subtree represents partial state transfer. And finally state transfer
  +      can be "in-memory" and "persistent" transfer depending on a particular
  +      use of cache. </para>
  +    </section>
   
         <section>
  -         <title>Types of State Transfer</title>
  -
  -         <para>The state that is acquired and integrated can consist of two basic
  -            types:
  -         </para>
  -
  -         <orderedlist>
  -            <listitem>
  -               <para>"Transient" or "in-memory" state. This consists of the actual
  -                  in-memory state of another cache instance - the contents of the
  -                  various in-memory nodes in the cache that is providing state are
  -                  serialized and transferred; the recipient deserializes the data,
  -                  creates corresponding nodes in its own in-memory tree, and populates
  -                  them with the transferred data.
  -               </para>
  -
  -               <para>"In-memory" state transfer is enabled by setting
  -                  the cache's
  -                  <literal>FetchInMemoryState</literal>
  -                  configuration attribute to
  -                  <literal>true</literal>
  -                  .
  -               </para>
  -            </listitem>
  -            <listitem>
  -               <para>"Persistent" state. Only applicable if a non-shared
  -                  cache loader is used. The state stored in the state-provider
  -                  cache's persistent store is deserialized and transferred; the
  -                  recipient passes the data to its own cache loader, which persists
  -                  it to the recipient's persistent store.
  -               </para>
  -
  -               <para>"Persistent" state transfer is enabled by setting
  -                  a cache loader's
  -                  <literal>fetchPersistentState</literal>
  -                  attribute to
  -                  <literal>true</literal>
  -                  . If multiple cache loaders
  -                  are configured in a chain, only one can have this property
  -                  set to true; otherwise you will get an exception at startup.
  -               </para>
  -
  -               <para>Persistent state transfer with a shared cache loader does
  -                  not make sense, as the same persistent store that provides the
  -                  data will just end up receiving it. Therefore, if a shared cache
  -                  loader is used, the cache will not allow a persistent state
  -                  transfer even if a cache loader has
  -                  <literal>fetchPersistentState</literal>
  -                  set to
  -                  <literal>true</literal>
  -                  .
  -               </para>
  -            </listitem>
  -         </orderedlist>
  -
  -         <para>Which of these types of state transfer is appropriate depends on the usage
  -            of the cache.
  -         </para>
  -
  -         <orderedlist>
  -            <listitem>
  -               <para>If a write-through cache loader is used, the current cache
  -                  state is fully represented by the persistent state. Data may
  -                  have been evicted from the in-memory state, but it will still be
  -                  in the persistent store. In this case, if the cache loader is not
  -                  shared, persistent state transfer is used to ensure the new cache
  -                  has the correct state. In-memory state can be transferred as well
  -                  if the desire is to have a "hot" cache -- one that has all
  -                  relevant data in memory when the cache begins providing service.
  -                  (Note that the "CacheLoaderPreload" configuration parameter can
  -                  be used as well to provide a "warm" or "hot" cache without
  -                  requiring an in-memory state transfer. This approach somewhat
  -                  reduces the burden on the cache instance providing state, but
  -                  increases the load on the persistent store on the recipient
  -                  side.)
  -               </para>
  -            </listitem>
  -            <listitem>
  -               <para>If a cache loader is used with passivation, the full
  -                  representation of the state can only be obtained by combining
  -                  the in-memory (i.e. non-passivated) and persistent (i.e. passivated)
  -                  states. Therefore an in-memory state transfer is necesssary. A
  -                  persistent state transfer is necessary if the cache loader is
  -                  not shared.
  -               </para>
  -            </listitem>
  -            <listitem>
  -               <para>If no cache loader is used and the cache is solely a
  -                  write-aside cache (i.e. one that is used to cache data that can
  -                  also be found in a persistent store, e.g. a database), whether
  -                  or not in-memory state should be transferred depends on whether
  -                  or not a "hot" cache is desired.
  -               </para>
  -            </listitem>
  -         </orderedlist>
  +      <title>Byte array and streaming based state transfer</title>
  +
  +      <para>Byte array based transfer was a default and only transfer
  +      methodology for cache in all previous releases up to 2.0. Byte array
  +      based transfer loads entire state transferred into a byte array and
  +      sends it to a state receiving member. Major limitation of this approach
  +      is that the state transfer that is very large (&gt;1GB) would likely
  +      result in OutOfMemoryException. Streaming state transfer provides an
  +      InputStream to a state reader and an OutputStream to a state writer.
  +      OutputStream and InputStream abstractions enable state transfer in byte
  +      chunks thus resulting in smaller memory requirements. For example, if
  +      application state is represented as a tree whose aggregate size is 1GB,
  +      rather than having to provide a 1GB byte array streaming state transfer
  +      transfers the state in chunks of N bytes where N is user
  +      configurable.</para>
  +
  +      <para>Byte array and streaming based state transfer are completely API
  +      transparent, interchangeable, and statically configured through a
  +      standard cache configuration XML file. Refer to JGroups documentation on
  +      how to change from one type of transfer to another. </para>
         </section>
  -      <section>
  -         <title>When State Transfer Occurs</title>
   
  -         <para>If either in-memory or persistent state transfer is enabled, a full or
  -            partial state transfer will be done at various times, depending on how the
  -            cache is used. "Full" state transfer refers to the transfer of the state
  -            related to the entire tree -- i.e. the root node and all nodes below it.
  -            A "partial" state transfer is one where just a portion of the tree is
  -            transferred -- i.e. a node at a given Fqn and all nodes below it.
  -         </para>
  +    <section>
  +      <title>Full and partial state transfer</title>
  +
  +      <para>If either in-memory or persistent state transfer is enabled, a
  +      full or partial state transfer will be done at various times, depending
  +      on how the cache is used. "Full" state transfer refers to the transfer
  +      of the state related to the entire tree -- i.e. the root node and all
  +      nodes below it. A "partial" state transfer is one where just a portion
  +      of the tree is transferred -- i.e. a node at a given Fqn and all nodes
  +      below it.</para>
   
            <para>If either in-memory or persistent state transfer is enabled, state
  -            transfer will occur at the following times:
  -         </para>
  +      transfer will occur at the following times:</para>
   
            <orderedlist>
               <listitem>
                  <para>Initial state transfer. This occurs when the cache is first
  -                  started (as part of the processing of the
  -                  <literal>start()</literal>
  -                  method). This is a full state transfer. The state is retrieved
  -                  from the cache instance that has been operational the longest. If
  -                  there is any problem receiving or integrating the state, the cache
  -                  will not start.
  -               </para>
  +          started (as part of the processing of the <literal>start()</literal>
  +          method). This is a full state transfer. The state is retrieved from
  +          the cache instance that has been operational the longest. If there
  +          is any problem receiving or integrating the state, the cache will
  +          not start.</para>
   
                  <para>Initial state transfer will occur unless:</para>
   
                  <orderedlist>
                     <listitem>
  -                     <para>The cache's
  -                        <literal>InactiveOnStartup</literal>
  -                        property
  -                        is
  -                        <literal>true</literal>
  -                        . This property is used in conjunction
  -                        with region-based marshalling.
  -                     </para>
  +              <para>The cache's <literal>InactiveOnStartup</literal> property
  +              is <literal>true</literal> . This property is used in
  +              conjunction with region-based marshaling.</para>
                     </listitem>
  +
                     <listitem>
  -                     <para>Buddy replication is used. See below for more on
  -                        state transfer with buddy replication.
  -                     </para>
  +              <para>Buddy replication is used. See below for more on state
  +              transfer with buddy replication.</para>
                     </listitem>
                  </orderedlist>
               </listitem>
   
               <listitem>
                  <para>Partial state transfer following region activation. Only
  -                  relevant when region-based marshalling is used. Here a special
  -                  classloader is needed to unmarshal the state for a portion of
  -                  the tree. State transfer cannot succeed until the application
  -                  registers this classloader with the cache. Once the application
  -                  registers its classloader, it calls
  -                  <literal>cache.getRegion(fqn, true).activate()</literal>
  -                  .
  -                  As part of the region activation process, a partial state transfer
  -                  of the relevant subtree's state is performed. The state is
  -                  requested from the oldest cache instance in the cluster; if that
  -                  instance responds with no state, state is requested from each
  -                  instance one by one until one provides state or all instances have
  -                  been queried.
  -               </para>
  +          relevant when region-based marshaling is used. Here a special
  +          classloader is needed to unmarshal the state for a portion of the
  +          tree. State transfer cannot succeed until the application registers
  +          this classloader with the cache. Once the application registers its
  +          classloader, it calls <literal>cache.getRegion(fqn,
  +          true).activate()</literal> . As part of the region activation
  +          process, a partial state transfer of the relevant subtree's state is
  +          performed. The state is requested from the oldest cache instance in
  +          the cluster; if that instance responds with no state, state is
  +          requested from each instance one by one until one provides state or
  +          all instances have been queried.</para>
   
                  <para>Typically when region-based marshalling is used, the cache's
  -                  <literal>InactiveOnStartup</literal>
  -                  property is set to
  -                  <literal>true</literal>
  -                  . This suppresses initial state transfer,
  -                  which would fail due to the inability to deserialize the
  -                  transferred state.
  -               </para>
  +          <literal>InactiveOnStartup</literal> property is set to
  +          <literal>true</literal> . This suppresses initial state transfer,
  +          which would fail due to the inability to deserialize the transferred
  +          state.</para>
               </listitem>
   
               <listitem>
  @@ -506,41 +484,107 @@
                     cluster, it becomes the buddy of one or more other instances, and
                     one or more other instances become its buddy. Each time an instance
                     determines it has a new buddy providing backup for it, it pushes
  -                  it's current state to the new buddy. This "pushing" of state to
  -                  the new buddy is slightly different from other forms of state
  -                  transfer, which are based on a "pull" approach (i.e. recipient
  -                  asks for and receives state). However, the process of preparing
  -                  and integrating the state is the same.
  -               </para>
  -
  -               <para>This "push" of state upon buddy group formation only occurs
  -                  if the
  -                  <literal>InactiveOnStartup</literal>
  -                  property is set to
  -                  <literal>false</literal>
  -                  . If it is
  -                  <literal>true</literal>
  -                  ,
  -                  state transfer amongst the buddies only occurs when the application
  -                  activates the region
  -                  on the various
  -                  members of the group.
  -               </para>
  -
  -               <para>Partial state transfer following a region activation
  -                  call is slightly different in the buddy replication case as well.
  -                  Instead of requesting the partial state from one cache instance,
  -                  and trying all instances until one responds, with buddy replication
  -                  the instance that is activating a region will request partial
  -                  state from each instance for which it is serving as a backup.
  -               </para>
  +          it's current state to the new buddy. This "pushing" of state to the
  +          new buddy is slightly different from other forms of state transfer,
  +          which are based on a "pull" approach (i.e. recipient asks for and
  +          receives state). However, the process of preparing and integrating
  +          the state is the same.</para>
  +
  +          <para>This "push" of state upon buddy group formation only occurs if
  +          the <literal>InactiveOnStartup</literal> property is set to
  +          <literal>false</literal> . If it is <literal>true</literal> , state
  +          transfer amongst the buddies only occurs when the application
  +          activates the region on the various members of the group.</para>
  +
  +          <para>Partial state transfer following a region activation call is
  +          slightly different in the buddy replication case as well. Instead of
  +          requesting the partial state from one cache instance, and trying all
  +          instances until one responds, with buddy replication the instance
  +          that is activating a region will request partial state from each
  +          instance for which it is serving as a backup.</para>
               </listitem>
            </orderedlist>
  +    </section>
   
  +    <section>
  +      <title>Transient ("in-memory") and persistent state transfer</title>
   
  -      </section>
  +      <para>The state that is acquired and integrated can consist of two basic
  +      types:</para>
         
  -   </section>
  +      <orderedlist>
  +        <listitem>
  +          <para>"Transient" or "in-memory" state. This consists of the actual
  +          in-memory state of another cache instance - the contents of the
  +          various in-memory nodes in the cache that is providing state are
  +          serialized and transferred; the recipient deserializes the data,
  +          creates corresponding nodes in its own in-memory tree, and populates
  +          them with the transferred data.</para>
   
  -</chapter>
  +          <para>"In-memory" state transfer is enabled by setting the cache's
  +          <literal>FetchInMemoryState</literal> configuration attribute to
  +          <literal>true</literal> .</para>
  +        </listitem>
  +
  +        <listitem>
  +          <para>"Persistent" state. Only applicable if a non-shared cache
  +          loader is used. The state stored in the state-provider cache's
  +          persistent store is deserialized and transferred; the recipient
  +          passes the data to its own cache loader, which persists it to the
  +          recipient's persistent store.</para>
  +
  +          <para>"Persistent" state transfer is enabled by setting a cache
  +          loader's <literal>fetchPersistentState</literal> attribute to
  +          <literal>true</literal> . If multiple cache loaders are configured
  +          in a chain, only one can have this property set to true; otherwise
  +          you will get an exception at startup.</para>
  +
  +          <para>Persistent state transfer with a shared cache loader does not
  +          make sense, as the same persistent store that provides the data will
  +          just end up receiving it. Therefore, if a shared cache loader is
  +          used, the cache will not allow a persistent state transfer even if a
  +          cache loader has <literal>fetchPersistentState</literal> set to
  +          <literal>true</literal> .</para>
  +        </listitem>
  +      </orderedlist>
  +
  +      <para>Which of these types of state transfer is appropriate depends on
  +      the usage of the cache.</para>
  +
  +      <orderedlist>
  +        <listitem>
  +          <para>If a write-through cache loader is used, the current cache
  +          state is fully represented by the persistent state. Data may have
  +          been evicted from the in-memory state, but it will still be in the
  +          persistent store. In this case, if the cache loader is not shared,
  +          persistent state transfer is used to ensure the new cache has the
  +          correct state. In-memory state can be transferred as well if the
  +          desire is to have a "hot" cache -- one that has all relevant data in
  +          memory when the cache begins providing service. (Note that the
  +          "CacheLoaderPreload" configuration parameter can be used as well to
  +          provide a "warm" or "hot" cache without requiring an in-memory state
  +          transfer. This approach somewhat reduces the burden on the cache
  +          instance providing state, but increases the load on the persistent
  +          store on the recipient side.)</para>
  +        </listitem>
         
  \ No newline at end of file
  +        <listitem>
  +          <para>If a cache loader is used with passivation, the full
  +          representation of the state can only be obtained by combining the
  +          in-memory (i.e. non-passivated) and persistent (i.e. passivated)
  +          states. Therefore an in-memory state transfer is necessary. A
  +          persistent state transfer is necessary if the cache loader is not
  +          shared.</para>
  +        </listitem>
  +
  +        <listitem>
  +          <para>If no cache loader is used and the cache is solely a
  +          write-aside cache (i.e. one that is used to cache data that can also
  +          be found in a persistent store, e.g. a database), whether or not
  +          in-memory state should be transferred depends on whether or not a
  +          "hot" cache is desired.</para>
  +        </listitem>
  +      </orderedlist>
  +    </section>
  +  </section>
  +</chapter>
  \ No newline at end of file