[jboss-dev-forums] [JBoss AS Development] - JBoss and the "em dash" character

Thez do-not-reply at jboss.com
Tue Nov 17 08:30:43 EST 2009


Hope this is the correct forum to post this.

I've a JSP that I use to print out contents of txt files. I ran into trouble with some files which when opened cause JBoss to truncate the output of the JSP page, even though the page is executed to the end. I debugged this and found the following.

The files are saved as UTF8, and contain the "em dash" character, U+2014 (8212) unicode, or & mdash; in html (without the space, it seems these forums can't handle the character either). The UTF8 byte sequence is "E2 80 94".

Now, when I read the file contents with BufferedReader (InputStreamReader opened with UTF8), line by line, and print the lines out, the output is truncated to the first line containing the character, and nothing after that is printed.

Is this a bug or a 'feature' ? Seems very strange behavior.



View the original post : http://www.jboss.org/index.html?module=bb&op=viewtopic&p=4266081#4266081

Reply to the post : http://www.jboss.org/index.html?module=bb&op=posting&mode=reply&p=4266081



More information about the jboss-dev-forums mailing list