alerting

Modern Observability - Scalable Observability with the LGTM Stack: Harnessing the Power of Loki, Grafana, Tempo, and Mimir

In today's rapidly evolving technology landscape, robust and scalable observability is crucial for maintaining reliable, high-performance systems. This talk delves into an advanced observability stack, known as the LGTM stack—Loki, Grafana, Tempo, and Mimir—that surpasses the limitations of traditional Prometheus-based solutions while maintaining compatibility with existing query languages and alert configurations. We will explore the core principles of OpenTelemetry, its seamless integration into the LGTM stack, and the significant benefits it brings to monitoring and tracing capabilities.

Understanding Alerting - How to come up with a good enough alerting strategy

Have you ever considered that your incident from last night might actually be something very positive? No? Then you should watch this talk! I'm going to introduce you to some concepts in the domain of resiliency engineering and then have a look into how you can build an alerting strategy that doesn't page you unnecessarily at 3am.