[
https://issues.jboss.org/browse/TEIID-3489?page=com.atlassian.jira.plugin...
]
Steven Hawkins commented on TEIID-3489:
---------------------------------------
If the document is small and can be cast to a string, then the 8.12 regex replace function
could be used:
regexpReplace(doc, '[^\x09\x0A\x0D\x20-\x7E]', '', 'g')
That will remove all invalid ascii characters.
Can a function be added to filter certain ASCII chars below 31 and
above 127 out of an XML Document
---------------------------------------------------------------------------------------------------
Key: TEIID-3489
URL:
https://issues.jboss.org/browse/TEIID-3489
Project: Teiid
Issue Type: Enhancement
Components: Query Engine
Reporter: Debbie Steigner
Priority: Minor
Fix For: 9.0
Add a function in Teiid to filter off all ASCII chars below 31 and above 127.
web service data source that contains control characters due to data entered by copy and
paste the text from another system.
The characters in this instance were the print control chars ASCII 28 and 29, File
Separator and Group Separator respectively. These are used for formatting in older
systems. Unfortunately, the XML standard prohibits the processing of all ASCII characters
below 31 apart from Tab, Line Feed and Carriage Return which are permissible. Also, ASCII
chars above 127 to 255 will also cause XML parsers to crash.
--
This message was sent by Atlassian JIRA
(v6.4.11#64026)