Correlation Over Collection: A Layered Observability Framework | Khushboo Nigam | Conf42 SRE 2026

Presenters Khushboo Nigam Source Conf42 SRE 2026 Unraveling the Chaos: Why Correlating Telemetry is the New Superpower for Cloud-Native Observability ✨ Hey tech enthusiasts! Khushboo Nigam, a Cloud Architect specializing in observability for cloud-native systems, recently shed light on a pervasive challenge facing SRE teams today. It’s not about collecting more data; it’s about making sense of the massive amounts of telemetry modern distributed systems generate. The core message is clear: Correlation over Collection. ...

March 19, 2026 · 5 min

FLEX2 Analytics in AMBR250 Upstream Biologics Workflows | Amogha Tenneti | Conf42 SRE 2026

Presenters Amogha Tenneti Source Conf42 SRE 2026 From Lab Bench to High-Throughput: How SRE Principles Revolutionized Biologics Automation 🚀🔬 Good morning, everyone! Amogha Tenneti here, and I’m thrilled to share a fascinating journey with you. We often associate Site Reliability Engineering (SRE) with the world of software, protecting our digital systems from outages and ensuring seamless user experiences. But what if I told you that the very same SRE principles can unlock incredible reliability, compliance, and scalability in a highly regulated, hands-on environment like a biological process development lab? ...

March 19, 2026 · 5 min

Human-Governed Automation Loops for AI at Planet Scale | Suganya Nagarajan | Conf42 SRE 2026

Presenters Suganya Nagarajan Source Conf42 SRE 2026 🚀 Beyond Blind Automation: The Power of Human-Governed AI Loops In the high-stakes world of Site Reliability Engineering (SRE), we face a recurring dilemma: as systems scale, manual interventions become a bottleneck, yet blind automation remains a dangerous liability. I am Suganya Nagarajan, an engineering manager with a decade of experience in large-scale distributed systems. Today, I want to share a framework to bridge this gap: Human-Governed Automation Loops (HAL). This approach ensures that our AI systems remain reliable, accountable, and safe, even as they operate at breakneck speeds. ...

March 19, 2026 · 4 min

Reducing On-Call Pain in Hybrid Platforms | Shruthi Rajashekar | Conf42 SRE 2026

Presenters Shruthi Rajashekar Source Conf42 SRE 2026 Unifying the Hybrid Cloud: How VM Service is Revolutionizing VM and Container Management 🚀 Hey tech enthusiasts! Shruthi Rajashekar, an engineering manager at Broadcom, is here to shed some light on a game-changer for hybrid cloud environments. For the past decade at VMware Broadcom, Shruthi has been instrumental in developing foundational technologies like vMotion and VM service, bridging the gap between traditional virtualization and modern cloud-native infrastructure. Today, she’s diving deep into how a unified control plane for virtual machines (VMs) and container-based workloads can be achieved using VM service, a VCF offering, and why this approach is absolutely critical for platforms demanding high availability and operational excellence. ...

March 19, 2026 · 6 min

The Failures You Don’t See on Dashboards | Abhimanyu Narwal | Conf42 SRE 2026

Presenters Abhimanyu Narwal Source Conf42 SRE 2026 The Silent Killers: Unmasking Failures Beyond Your Dashboards 🕵️‍♀️ We’ve all been there. Alarms blaring, graphs spiking, the adrenaline rush of an incident. As engineers, we excel at fighting outages, diving into the chaos, and emerging victorious. But what if the most expensive reliability failures aren’t the ones that make noise? What if they’re the quiet, insidious ones that slow us down, all while our dashboards gleam with a reassuring green? ...

March 19, 2026 · 5 min