[jboss-jira] [JBoss JIRA] Commented: (JBPORTAL-1727) CLONE -CMS saves pages in native encoding (ISO-8859-1 or Cp1251) but retrieve it in UTF-8. then national leters are damaged

Mariusz Smykula (JIRA) jira-events at lists.jboss.org
Wed Oct 3 03:44:03 EDT 2007


    [ http://jira.jboss.com/jira/browse/JBPORTAL-1727?page=comments#action_12380076 ] 
            
Mariusz Smykula commented on JBPORTAL-1727:
-------------------------------------------

Patch for issue JBPORTAL-832 is resolving this problem too.

> CLONE -CMS saves pages in native encoding (ISO-8859-1 or Cp1251) but retrieve it in UTF-8. then national leters are damaged
> ---------------------------------------------------------------------------------------------------------------------------
>
>                 Key: JBPORTAL-1727
>                 URL: http://jira.jboss.com/jira/browse/JBPORTAL-1727
>             Project: JBoss Portal
>          Issue Type: Bug
>      Security Level: Public(Everyone can see) 
>          Components: Portal CMS
>    Affects Versions: 2.6.1 Final
>         Environment: jboss4.2 (bundled version) and 4.0.5.GA
>            Reporter: Mariusz Smykula
>         Assigned To: Sohil Shah
>             Fix For: 2.6.3 Final
>
>
> When I input ru-characters in the text area it has been puted into table in native encoding (cp1251) - !!! not UTF-8. But pages are sends in UTF-8, and it damaged.
> I think that I have to encode the request in UTF-8 ??? 
> ---------------------------------------------
>  create file.html in Ru (in cms admin) with this text
> abc???
> then look at the last record of JBP_CMS_VERSION_BINVAL
> (six bytes in BINVAL_DATA field):
> 61 62 63 e0 e1 e2
> it is cp1251 encoding !!!!
> why this text non unicoded ???
> then when i retrive it i have - 'abc???'
> I think that problem isn't in ContentTypeInterceptor because it succefully
> sets UTF8, because localized resources seems right (in russian).
> may be it is need to store text in CLobs insdead blobs ??? (a use Derby) or customize Jackrabbit ???
> -----------------------------------------------------------
> but when I upload file on UTF-8 with russian characters - all correct.
> I think that cms saves the content in one byte encoding (cp1250 or same). 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://jira.jboss.com/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        



More information about the jboss-jira mailing list