[JBoss JIRA] (TEIID-3748) Impala translator - SELECT and HAVING statements are translating differently for Case statements - teiid-issues

Wednesday, 3 February 2016

     [
https://issues.jboss.org/browse/TEIID-3748?page=com.atlassian.jira.plugin...
]

Steven Hawkins updated TEIID-3748:
----------------------------------
    Issue Type: Bug  (was: Quality Risk)

I was able to reproduce this locally.  With the initial query:

SELECT stringkey, sum(intnum),count(distinct case when floatnum >= 0 then 1 end) FROM
smalla WHERE intkey=6 GROUP BY stringkey HAVING sum(intnum)>100 AND count(distinct case
when floatnum >= 0 then 1 end)=0

The pushdown is:

SELECT g_0.stringkey, SUM(g_0.intnum), 

COUNT(DISTINCT CASE WHEN g_0.floatnum >= 0.0 THEN 1 END) 

FROM `smalla` AS g_0 WHERE g_0.intkey = 6 

GROUP BY g_0.stringkey HAVING SUM(g_0.intnum) > 100 AND 

COUNT(DISTINCT CASE WHEN g_0.floatnum >= convert(0, float) THEN 1 END) = 0

So the non-evaluated cast is then causing an issue for Impala

...
 Impala translator - SELECT and HAVING statements are translating
differently for Case statements

------------------------------------------------------------------------------------------------

                 Key: TEIID-3748
                 URL: https://issues.jboss.org/browse/TEIID-3748
             Project: Teiid
          Issue Type: Bug
          Components: JDBC Connector
    Affects Versions: 8.11.4
         Environment: Ubuntu Trusty
            Reporter: Don Krapohl
            Assignee: Steven Hawkins
              Labels: Impala_Translator, Translators
         Attachments: server.log

 Error from Impala-
 all DISTINCT aggregate functions need to have the same set of parameters as
count(DISTINCT (CASE WHEN (secondcol >= 0) THEN 1 ELSE CAST(NULL AS STRING) END))
 deviating function: count(DISTINCT (CASE WHEN (secondcol >= 0) THEN 1 ELSE NULL END))
 Query:
 SELECT user_key, sum(firstcol),count(distinct case when secondcol >= 0 then 1 end) 
 FROM sometable 
 WHERE customer_key=6
 GROUP BY user_key 
 HAVING sum(firstcol)>100 
 	AND count(distinct case when secondcol >= 0 then 1 end)=0

 Query explanation:
 For all users
 Add up values in the firstcol column (integer column)
 count distinct values in secondcol where secondcol value zero or more
 	otherwise return null (output is string)
 Translated Teiid query:
 SELECT user_key, SUM(firstcol) as `EXPR_0`, COUNT(DISTINCT (CASE WHEN (secondcol >= 0)
THEN '1' ELSE CAST(NULL AS STRING) END)) as `EXPR_1`
 FROM sometable 
 WHERE customer_key` = 6
 HAVING (EXPR_0 > 100) AND (COUNT(DISTINCT (CASE WHEN (secondcol >= 0) THEN
'1' ELSE NULL END)) = 0))
 Note the difference between the select and having for EXPR_1:
 Select - THEN '1' ELSE CAST(NULL AS STRING) END
 Having - THEN '1' ELSE NULL END
 Impala doesn't accept that these are the same aggregate function.  Aliases aren't
accepted in the HAVING.
 One further observation- if we swap the translation and write the statement in the select
as 
 COUNT(DISTINCT (CASE WHEN (secondcol >= 0) THEN '1' *ELSE NULL END*))
 Teiid translates the SELECT to
 COUNT(DISTINCT (CASE WHEN (secondcol >= 0) THEN '1' *ELSE CAST(NULL AS STRING)
END*))
 So it always makes these mismatched. 

--
This message was sent by Atlassian JIRA
(v6.4.11#64026)

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

[JBoss JIRA] (TEIID-3748) Impala translator - SELECT and HAVING statements are translating differently for Case statements