[wildfly-dev] discussion about JSR352 Batch and JPA

Thu Sep 19 05:13:56 EDT 2013

On Wed, Sep 18, 2013 at 2:54 PM, Scott Marlow <smarlow at redhat.com> wrote:

> What are the requirements for supporting Batch section 11.6 [1]?  From
> looking at JSR352, I get that each "chunk" has its own JTA transaction.
>   I previously had heard that we only supported starting the transaction
> from the application processing code (via UserTransaction) but I think
> the Batch container/runtime should start a JTA transaction for each
> "chunk" that is processed.  What are we really doing for managing the
> JTA transactions for batch?
>

The spec says:

8.2.1 Each chunk is processed in a separate transaction. See section 9.7
for more
details on transactionality.

To me that implies the batch implementation starts the transaction itself,
although it does seem very vague. For instance looking at those diagrams it
looks like the transaction is committed even if an exception is thrown, and
it also does not seem to give you any possibility of managing the
transaction yourself, or doing non-transactional work.

>
>
> REGULAR CHUNK PROCESSING & JPA
>
>
> For the JPA support for regular chunk processing [1], the following will
> give a new underlying persistence context per chunk (jta tx):
>
> @PersistenceContext(unitName = "chunkNonpartitionedAZTT4443334")
>   EntityManager em;
>
>
> PARTITIONED CHUNK PROCESSING & JPA
>
>
> For the JPA support for partitioned chunk processing [2], the following
> will give a new underlying persistence context per chunk (jta tx):
>
> @PersistenceContext(unitName = "chunkpartitionedAZTT4443334")
>   EntityManager em;
>
> One concern that I have about partitioning is the performance impact of
> deadlocking and waiting for the JTA transaction to time out.  Depending
> on whether the work is configured to retry or not, hitting several dead
> locks in a batch process could defeat some of the performance gains of
> partitioning.  Avoiding deadlocks by always reading/writing/locking the
> underlying database resources in the same exact order, would help avoid
> deadlocks.
>

IMHO if you are attempting to partition work that results in a deadlock
then you are basically doing something wrong. Also, as each chunk should be
basically running the same code but with difference data, in general it
should acquire locks in the same order anyway.

Stuart

>
> Beyond the basic JPA capability of ensuring that each "chunk", has its
> own persistence context, how else can we help the batch processing
> experts writing JSR-352 applications on WildFly/Hibernate?
>
> Anything else to discuss about JPA & Batch?
>
> [1] Batch spec section section 11.6 Regular Chunk Processing
> https://gist.github.com/scottmarlow/6603746
>
> [2] Batch spec section Batch 11.7 Partitioned Chunk Processing
> https://gist.github.com/scottmarlow/6607667
>
> [3] persistence.xml https://gist.github.com/scottmarlow/6608533
>
> _______________________________________________
> wildfly-dev mailing list
> wildfly-dev at lists.jboss.org
> https://lists.jboss.org/mailman/listinfo/wildfly-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.jboss.org/pipermail/wildfly-dev/attachments/20130919/e621c03b/attachment.html