Correlation Over Collection: A Layered Observability Framework | Khushboo Nigam | Conf42 SRE 2026

Presenters Khushboo Nigam Source Conf42 SRE 2026 Unraveling the Chaos: Why Correlating Telemetry is the New Superpower for Cloud-Native Observability ✨ Hey tech enthusiasts! Khushboo Nigam, a Cloud Architect specializing in observability for cloud-native systems, recently shed light on a pervasive challenge facing SRE teams today. It’s not about collecting more data; it’s about making sense of the massive amounts of telemetry modern distributed systems generate. The core message is clear: Correlation over Collection. ...

March 19, 2026 · 5 min

FLEX2 Analytics in AMBR250 Upstream Biologics Workflows | Amogha Tenneti | Conf42 SRE 2026

Presenters Amogha Tenneti Source Conf42 SRE 2026 From Lab Bench to High-Throughput: How SRE Principles Revolutionized Biologics Automation 🚀🔬 Good morning, everyone! Amogha Tenneti here, and I’m thrilled to share a fascinating journey with you. We often associate Site Reliability Engineering (SRE) with the world of software, protecting our digital systems from outages and ensuring seamless user experiences. But what if I told you that the very same SRE principles can unlock incredible reliability, compliance, and scalability in a highly regulated, hands-on environment like a biological process development lab? ...

March 19, 2026 · 5 min

Human-Governed Automation Loops for AI at Planet Scale | Suganya Nagarajan | Conf42 SRE 2026

Presenters Suganya Nagarajan Source Conf42 SRE 2026 🚀 Beyond Blind Automation: The Power of Human-Governed AI Loops In the high-stakes world of Site Reliability Engineering (SRE), we face a recurring dilemma: as systems scale, manual interventions become a bottleneck, yet blind automation remains a dangerous liability. I am Suganya Nagarajan, an engineering manager with a decade of experience in large-scale distributed systems. Today, I want to share a framework to bridge this gap: Human-Governed Automation Loops (HAL). This approach ensures that our AI systems remain reliable, accountable, and safe, even as they operate at breakneck speeds. ...

March 19, 2026 · 4 min

Program Leadership in AI-Enabled Platform Systems | Sonali Galhotra | Conf42 SRE 2026

Presenters Sonali Galhotra Source Conf42 SRE 2026 🌐 Beyond the Dashboard: Why Reliability is an Organizational System Problem In the world of Site Reliability Engineering (SRE), we often obsess over uptime, latency, error budgets, and system telemetry. While these metrics are vital, they don’t tell the whole story. According to Sonali Galhotra, a leader at the intersection of technical program leadership and platform engineering, the most critical reliability signals don’t always appear on a monitoring dashboard. Instead, they emerge from how an organization structures itself and where it chooses to invest. ...

March 19, 2026 · 4 min

Reducing On-Call Pain in Hybrid Platforms | Shruthi Rajashekar | Conf42 SRE 2026

Presenters Shruthi Rajashekar Source Conf42 SRE 2026 Unifying the Hybrid Cloud: How VM Service is Revolutionizing VM and Container Management 🚀 Hey tech enthusiasts! Shruthi Rajashekar, an engineering manager at Broadcom, is here to shed some light on a game-changer for hybrid cloud environments. For the past decade at VMware Broadcom, Shruthi has been instrumental in developing foundational technologies like vMotion and VM service, bridging the gap between traditional virtualization and modern cloud-native infrastructure. Today, she’s diving deep into how a unified control plane for virtual machines (VMs) and container-based workloads can be achieved using VM service, a VCF offering, and why this approach is absolutely critical for platforms demanding high availability and operational excellence. ...

March 19, 2026 · 6 min