New Hawkular Blog Post: Hawkular Alerts with OpenTracing

Wednesday, 6 September 2017

New Hawkular blog post from noreply(a)hawkular.org (John Mazzitelli): http://ift.tt/2xc8MUQ

Two recent blogs discuss how OpenTracing instrumentation can be used to collect
application metrics:

http://ift.tt/2rX1NbW

http://ift.tt/2vWrA6Y

A further interesting integration can be the addition of Hawkular Alerts to the
environment.

As the previous blog and demo discuss, Hawkular Alerts is a generic, federated alerts
system that can trigger events, alerts, and notifications from different, independent
systems such as Prometheus, ElasticSearch, and Kafka.

Here we can combine the two. Let’s follow the directions for the OpenTracing demo (using
the Jaeger implementation) and add Hawkular Alerts.

What this can show is OpenTracing application metrics triggering alerts when (as in this
example) OpenTracing spans encounter a larger-than-expected error rates.

(Note: these instructions assume you are using Kubernetes / Minikube - see the Hawkular
OpenTracing blogs linked above for more details on these instructions)

START KUBERNETES

Here we start minikube giving it enough resources to run all of the pods necessary for
this demo. We also start up a browser pointing to the Kubernetes dashboard, so you can
follow the progress of the remaining instructions.

minikube start --cpus 4 --memory 8192

minikube dashboard

DEPLOY PROMETHEUS

kubectl create -f http://ift.tt/2wHvySK

kubectl create -f http://ift.tt/2tgfO8q

(Note: the last command might not work depending on your version - if you get errors,
download a copy of prometheus-kubernetes.yml and edit it, changing “v1alpha1” to “v1”)

DEPLOY JAEGER

kubectl create -f http://ift.tt/2tfYiRY

The following will build and deploy the Jaeger example code that will produce the
OpenTracing data for the demo:

mkdir -p ${HOME}/opentracing ; cd ${HOME}/opentracing

git clone git@github.com:objectiser/opentracing-prometheus-example.git

cd opentracing-prometheus-example/simple

eval $(minikube docker-env)

mvn clean install docker:build

kubectl create -f services-kubernetes.yml

(Note: The last command might not work depending on your version - if you get errors, edit
services-kubernetes.yml, changing “v1alpha1” to “v1”)

DEPLOY HAWKULAR-ALERTS AND CREATE ALERT TRIGGER

The following will deploy Hawkular Alerts and create the trigger definition that will
trigger an alert when the Jaeger OpenTracing data indicates an error rate that is over
30%

kubectl create -f http://ift.tt/2w86alD

Next use minikube service hawkular-alerts --url to determine the Hawkular Alerts URL and
point your browser to the path “/hawkular/alerts/ui” at that URL (i.e.
http://host:port/hawkular/alerts/ui).

...
From the browser page running the Hawkular Alerts UI, enter a tenant
name in the top right text field (“my-organization” for example) and click the “Change”
button. 
Navigate to the “Triggers” page (found in the left-hand nav menu).

Click the kabob menu icon at the top and select “New Trigger”.

In the text area, enter the following to define a new trigger that will trigger alerts
when the Prometheus query shows that there is a 30% error rate or greater in the
accountmgr or ordermgr servers:

{
   &quot;trigger&quot;:{
      &quot;id&quot;:&quot;jaeger-prom-trigger&quot;,
      &quot;name&quot;:&quot;High Error Rate&quot;,
      &quot;description&quot;:&quot;Data indicates high error rate&quot;,
      &quot;severity&quot;:&quot;HIGH&quot;,
      &quot;enabled&quot;:true,
      &quot;autoDisable&quot;:false,
      &quot;tags&quot;:{
         &quot;prometheus&quot;:&quot;Test&quot;
      },
      &quot;context&quot;:{
         &quot;prometheus.url&quot;:&quot;http://prometheus:9090&quot;
      }
   },
   &quot;conditions&quot;:[
      {
         &quot;type&quot;:&quot;EXTERNAL&quot;,
         &quot;alerterId&quot;:&quot;prometheus&quot;,
         &quot;dataId&quot;:&quot;prometheus-test&quot;,

&quot;expression&quot;:&quot;(sum(increase(span_count{error=\&quot;true\&quot;,span_kind=\&quot;server\&quot;}[1m]))
without (pod,instance,job,namespace,endpoint,transaction,error,operation,span_kind) /
sum(increase(span_count{span_kind=\&quot;server\&quot;}[1m])) without
(pod,instance,job,namespace,endpoint,transaction,error,operation,span_kind)) &gt;
0.3&quot;
      }
   ]
}

Figure 1: Create New Alert Trigger

Figure 2: Alert Trigger

Now navigate back to the “Dashboard” page (again via the left-hand nav menu). From this
Dashboard page, look for alerts when they are triggered. We’ll next start generating the
data that will trigger these alerts.

GENERATE SOME SAMPLE OPEN TRACING APPLICATION DATA

export ORDERMGR=$(minikube service ordermgr --url)

${HOME}/opentracing/opentracing-prometheus-example/simple/genorders.sh

Once the data starts to be collected, you will see alerts in the Hawkular Alerts UI as
error rates become over 30% in the past minute (as per the Prometheus query).

Figure 3: Alerts Dashboard

Figure 4: Alert

If you look at the alerts information in the Hawkular Alerts UI, you’ll see the conditions
that triggered the alerts. For example, one such alert could look like this:

Time: 2017-09-01 17:41:17 -0400
External[prometheus]: prometheus-test[Event [tenantId=my-organization,
id=1a81471d-340d-4dba-abe9-5b991326dc80, ctime=1504302077288, category=prometheus,
dataId=prometheus-test, dataSource=none, text=[1.504302077286E9, 0.3333333333333333],
context={service=ordermgr, version=0.0.1}, tags={}, trigger=null]] matches
[(sum(increase(span_count{error=&quot;true&quot;,span_kind=&quot;server&quot;}[1m]))
without
(pod,instance,job,namespace,endpoint,transaction,error,operation,span_kind) /
sum(increase(span_count{span_kind=&quot;server&quot;}[1m])) without
(pod,instance,job,namespace,endpoint,transaction,error,operation,span_kind)) &gt;
0.3]

Notice the “ordermgr” service (version &quot;0.0.1&quot;) had an error rate of
0.3333 (33%) which caused the alert since it is above the allowed 30% threshold.

At this point, the Hawkular Alerts UI provides the ability for system admins to log notes
about the issue, acknowledge the alert and mark the alert resolved if the underlying issue
has been fixed. These lifecycle functions (also available as REST operations) are just
part of the value add of Hawkular-Alerts.

You could do more complex things such as only trigger this alert if this Prometheus query
generated results AND some other condition was true (say, ElasticSearch logs match a
particular pattern, or if a Kafka topic had certain data). This demo merely scratches the
surface, but does show how Hawkular Alerts can be used to work with OpenTracing to provide
additional capabilities that may be found useful by system administrators and IT support
personnel.

from Hawkular Blog

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015