[jboss-dev-forums] [Design of Messaging on JBoss (Messaging/JBoss)] - Re: Optimisations: A couple of low hanging fruits going for

Wednesday, 14 May 2008

UTF-8 is a little more complex than that.  Chars 0-0x7F are represented by one byte,
0-0x7F.  Beyond that characters are represented with 2 to 4 bytes.  This means that for
every character there are multiple comparisons and shifts performed, with some extra bits
being set or cleared for certain characters.

UTF-16 on the other hand, being the native encoding for Java, is written one char at a
time without transcoding - no shifts, no comparisons, no bitmasks.  It's just a
straight write of chars.  You can't possibly do better than that in terms of
processing speed.

View the original post :
http://www.jboss.com/index.html?module=bb&op=viewtopic&p=4150822#...

Reply to the post :
http://www.jboss.com/index.html?module=bb&op=posting&mode=reply&a...

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

[jboss-dev-forums] [Design of Messaging on JBoss (Messaging/JBoss)] - Re: Optimisations: A couple of low hanging fruits going for