[jboss-jira] [JBoss JIRA] (SWSQE-324) Routing issue with zalenium

Filip Brychta (JIRA) issues at jboss.org
Thu Jul 26 08:02:00 EDT 2018


    [ https://issues.jboss.org/browse/SWSQE-324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13610951#comment-13610951 ] 

Filip Brychta commented on SWSQE-324:
-------------------------------------

This issue is not related to zalenium. Here is what is happening:
OCP nodes use dnsmasq for DN resolution. Dnsmasq is using list of upstream DNS and based on some algorithm it chooses the fastest one for given query.
To get debug logging of queries it's necessary to uncomment log-queries in /etc/dnsmasq.conf.

For not yet known reason from time to time  all queried upstream servers don't provide response for the query which results to DN resolution failure as visible in this ticket.
Jul 26 03:50:27 b20 dnsmasq[7295]: query[A] kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com from 10.129.1.167
Jul 26 03:50:27 b20 dnsmasq[7295]: forwarded kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com to 10.16.23.54
without response so it tries it again after 5s and tries to forward to all upstream DNS:
Jul 26 03:50:32 b20 dnsmasq[7295]: query[A] kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com from 10.129.1.167
Jul 26 03:50:32 b20 dnsmasq[7295]: forwarded kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com to 10.16.23.35
Jul 26 03:50:32 b20 dnsmasq[7295]: forwarded kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com to 10.38.5.26
Jul 26 03:50:32 b20 dnsmasq[7295]: forwarded kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com to 10.5.30.160
Jul 26 03:50:32 b20 dnsmasq[7295]: forwarded kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com to 10.11.5.19
Jul 26 03:50:32 b20 dnsmasq[7295]: forwarded kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com to 10.16.23.54
but still no response so again after 5s:
Jul 26 03:50:37 b20 dnsmasq[7295]: query[A] kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com from 10.129.1.167
Jul 26 03:50:37 b20 dnsmasq[7295]: forwarded kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com to 10.16.23.35
Jul 26 03:50:37 b20 dnsmasq[7295]: forwarded kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com to 10.38.5.26
Jul 26 03:50:37 b20 dnsmasq[7295]: forwarded kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com to 10.5.30.160
Jul 26 03:50:37 b20 dnsmasq[7295]: forwarded kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com to 10.11.5.19
Jul 26 03:50:37 b20 dnsmasq[7295]: forwarded kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com to 10.16.23.54
still no response so it fails.

What is really weird is that other queries during this timeperiod are working fine e.g.:
Jul 26 03:50:28 b20 dnsmasq[7295]: query[A] kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com from 10.129.1.169
Jul 26 03:50:28 b20 dnsmasq[7295]: forwarded kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com to 10.16.23.54
Jul 26 03:50:28 b20 dnsmasq[7295]: reply kiali-istio-system.openshift.jonqe.lab.eng.bos.redhat.com is 10.16.23.38

I have no idea why it fails 3 times for 10.129.1.167 when it's working fine for other clients.







> Routing issue with zalenium
> ---------------------------
>
>                 Key: SWSQE-324
>                 URL: https://issues.jboss.org/browse/SWSQE-324
>             Project: Kiali QE
>          Issue Type: QE Task
>            Reporter: Jeeva Kandasamy
>            Assignee: Filip Brychta
>
> I see the following error(I guess it is routing error) weekly once,
> ------
> http_conn_args = {}, host = 'zalenium.prod.os.jonqe.lab.eng.bos.redhat.com'
> h = <httplib.HTTPConnection instance at 0x7f45e47c6c68>
> err = gaierror(-2, 'Name or service not known')
> ------
> Detailed report: http://jenkins2.bc.jonqe.lab.eng.bos.redhat.com:8080/job/run-python-ui-tests/437/display/redirect



--
This message was sent by Atlassian JIRA
(v7.5.0#75005)


More information about the jboss-jira mailing list