[teiid-issues] [JBoss JIRA] (TEIID-3489) Can a function be added to filter certain ASCII chars below 31 and above 127 out of an XML Document

Steven Hawkins (JIRA) issues at jboss.org
Wed Oct 14 15:49:00 EDT 2015


    [ https://issues.jboss.org/browse/TEIID-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13118417#comment-13118417 ] 

Steven Hawkins commented on TEIID-3489:
---------------------------------------

If the document is small and can be cast to a string, then the 8.12 regex replace function could be used:

regexpReplace(doc, '[^\x09\x0A\x0D\x20-\x7E]', '', 'g')

That will remove all invalid ascii characters.

> Can a function be added to filter certain ASCII chars below 31 and above 127 out of an XML Document
> ---------------------------------------------------------------------------------------------------
>
>                 Key: TEIID-3489
>                 URL: https://issues.jboss.org/browse/TEIID-3489
>             Project: Teiid
>          Issue Type: Enhancement
>          Components: Query Engine
>            Reporter: Debbie Steigner
>            Priority: Minor
>             Fix For: 9.0
>
>
> Add a function in Teiid to filter off all ASCII chars below 31 and above 127.
> web service data source that contains control characters due to data entered by copy and paste the text from another system.
> The characters in this instance were the print control chars ASCII 28 and 29, File Separator and Group Separator respectively. These are used for formatting in older systems. Unfortunately, the XML standard prohibits the processing of all ASCII characters below 31 apart from Tab, Line Feed and Carriage Return which are permissible. Also, ASCII chars above 127 to 255 will also cause XML parsers to crash.



--
This message was sent by Atlassian JIRA
(v6.4.11#64026)


More information about the teiid-issues mailing list