Lightning Talk: Managing Silences - Provisioning Silences into Alertmanager - Théo Brigitte

Presenters Théo Brigitte Source PromCon EU 2025 Silencing the Noise: Automating Alert Management with a Git-Based Operator 🚀 Let’s be honest – managing alerts across a sprawling Kubernetes infrastructure can feel like herding cats. 😼 One rogue deployment, a misconfigured metric, and suddenly everyone is getting paged. 🚨 At Giants Forum, we faced this challenge head-on, and the solution involved a surprisingly elegant blend of Git, Kubernetes, and a little bit of operator magic. ✨ ...

December 19, 2025 · 3 min

Lightning Talk: Alert Quorum Universal Aggregator - AQUA - Mirek Chocholous

Presenters Mirek Chocholous Source PromCon EU 2025 Taming the Alert Beast: Scaling Alert Management in the Modern World 🚀 Hey tech enthusiasts! 👋 Today, we’re diving into a surprisingly complex challenge faced by many monitoring teams: Alert Manager chaos. We’re going to explore how to streamline notifications, reduce alert fatigue, and ultimately, keep your team – and your boss – happy. Let’s unpack this with Merrick from CDN77, who highlighted some critical issues and a potential solution. ...

December 19, 2025 · 3 min

Lightning Talk: Schema Inference - Nicolas Takashi & Arthur Sens

Presenters Nicolas Takashi Arthur Sens Source PromCon EU 2025 Decoding Telemetry: Building Schemas for a Data-Driven World 🚀 Let’s be honest, the world of observability and telemetry can feel… overwhelming. A deluge of metrics, logs, and spans – where do you even start when trying to understand what’s happening in your systems? This presentation tackled a surprisingly elegant solution: engineering telemetry like you engineer a pipeline, using schemas to unlock clarity and drive adoption. 💡 ...

December 19, 2025 · 3 min

Lightning Talk: Prometheus Rules management and validation - Hervé Nicol

Presenters Hervé Nicol Source PromCon EU 2025 🚀 Mastering Prometheus Rules: A Deep Dive into Rule Management 💡 Let’s be honest, Prometheus rules are essential for any serious monitoring operation. But let’s also be real – managing them can feel like navigating a complex maze. How do you keep them consistent? How do you ensure they’re actually working as intended? At James Swam, we’ve been wrestling with these questions, and we’ve found some surprisingly effective strategies. Let’s break down our approach and explore some best practices. ...

December 19, 2025 · 3 min

Lightning Talk: Scrape Trolley Dilemma - Bartek Protka

Presenters Bartek Protka Source PromCon EU 2025 🤖 The Trolley Problem of Metrics: Navigating Collection Resiliency 🚀 Hey everyone! 👋 Let’s talk about a surprisingly relevant thought experiment that’s impacting how we manage our data pipelines – the Trolley Problem. It might sound a bit heavy, but it perfectly illustrates a critical challenge in collection resiliency and how we’re tackling it. 🤯 The Trolley Problem in Data Collection The classic Trolley Problem asks: if you can divert a runaway trolley to save five people, but doing so will kill one, what do you do? In the world of data collection, we’re facing a similar dilemma. Imagine your data collection system – think Prometheus, OpenTelemetry collectors, or any other metric pipeline – is nearing its memory limit. You’re staring down a massive influx of data, potentially high-cardinality data (meaning lots of unique values), and you know that processing it could lead to a crash. ...

December 19, 2025 · 3 min