Steadybit logoResilience Hub
Try SteadybitGitHub icon
Steadybit logoResilience Hub

Grafana

Extension

Extension

A Steadybit check implementation for data exposed through Grafana.
Install now

Grafana

A Steadybit check implementation for data exposed through Grafana.
Extension

Extension

Install now

Grafana

Extension

Extension

A Steadybit check implementation for data exposed through Grafana.
Install now

Grafana

A Steadybit check implementation for data exposed through Grafana.
Extension

Extension

Install now
Go back to list
YouTube content is not loaded by default for privacy reasons.

Introduction to the Grafana Extension

The Steadybit Grafana Extension bridges the world of Steadybit and Grafana. The extension adds checks to your Chaos Engineering experiments to validate Grafana Alert rules state and reports events of your experiments to Grafana as annotations to ease correlation.

Integration and Functionality

Integration of Grafana into Steadybit works via the Grafana API. Thus, all you need is a Grafana's Service Account Token.

Integration of Grafana in Steadybit

With the Alert Rule Check you can integrate your Grafana alert rules into your experiments. Verify that Grafana alert rules notice a fault injected by Steadybit to check that your observability strategy is working as expected.

Integration of Steadybit in Grafana

The extension automatically reports experiment executions to Grafana, which helps you to correlate experiments with your dashboards.

Installation and Setup

To integrate the Grafana extension with your environment, follow our setup guide.

Statistics
-Stars
Tags
Grafana
Kubernetes
Check
Observability
Monitoring
Homepage
hub.steadybit.com/extension/com.steadybit.extension_grafana
License
MIT
MaintainerSteadybit
Install now

Provided Target Discovery

See all
Grafana alert rules

Provided Actions

See all
alert-rule check

Collects information about the alert-rule state and optionally verifies that the monitor has an expected state.

Check

Check

Grafana alert rules

Useful Templates

See all
Grafana alert rule fires when a Kubernetes pod is in crash loop

Verify that a Grafana alert rule alerts you when pods are not ready to accept traffic for a certain time.

Motivation

Kubernetes features a readiness probe to determine whether your pod is ready to accept traffic. If it isn't becoming ready, Kubernetes tries to solve it by restarting the underlying container and hoping to achieve its readiness eventually. If this isn't working, Kubernetes will eventually back off to restart the container, and the Kubernetes resource remains non-functional.

Structure

First, check that the Grafana alert rule responsible for tracking non-ready containers is in an 'okay' state. As soon as one of the containers is crash looping, caused by the crash loop attack, the Grafana alert rule should fire and escalate it to your on-call team.

Solution Sketch

  • Kubernetes liveness, readiness, and startup probes
Crash loop
Harden Observability
Restart
Grafana
Kubernetes

Grafana alert rules

Kubernetes cluster

Kubernetes pods

Start Using Steadybit Today

Get started with Steadybit, and you’ll get access to all of our features to discover the full power of Steadybit. Available for SaaS and on-prem!

Are you unsure where to begin?

No worries, our reliability experts are here to help: book a demo with them!

Steadybit logoResilience Hub
Try Steadybit
© 2024 Steadybit GmbH. All rights reserved.
Twitter iconLinkedIn iconGitHub icon