Presenters

Source

grpConf India 2025

Unlocking the Power of Proxyless gRPC in Service Meshes! 🚀

Hey tech enthusiasts! Ever felt like your application code is drowning in complex networking logic? We’ve all been there, right? The good news is, the world of service meshes is evolving, and today, we’re diving deep into a game-changer: proxyless gRPC within a service mesh architecture. Join us as we explore this exciting evolution, the crucial role of XDS, and how gRPC is leveraging these technologies for blazing-fast, performant systems.

What Exactly is a Service Mesh? 🤔

At its core, a service mesh is a dedicated infrastructure layer that handles managed connectivity between your microservices. Think of it as a sophisticated network of horizontally scalable microservices where the network infrastructure intelligently manages how these services talk to each other. The key here is that the application code stays blissfully unaware of this complex connection management – it all happens seamlessly under the hood.

The Challenges of Traditional Service Communication 🚧

Building and maintaining such a system isn’t without its hurdles. When services need to connect, several critical pieces of information are required:

Service Discovery: How does a client know which service instances are alive and kicking, and what are their IP addresses?
Smart Routing: We need to intelligently route requests, perhaps considering which services are co-located with the client for lower latency.
Load Balancing: To prevent overwhelming servers, clients need feedback on server load.
Failover: If a server goes down, automatic failover to a healthy instance is essential.
Observability: Collecting metrics on latency and failure rates is vital for monitoring service health.

The overarching goal? To free application developers from the burden of complex networking logic, allowing them to focus purely on the business value their code delivers.

The Rise of the Sidecar Proxy: A Major Leap Forward 💡

The first major breakthrough in addressing these challenges was the sidecar proxy pattern. Here’s how it works:

A sidecar proxy runs alongside each application in its own container.
It acts as an intermediary, handling all the intricate networking logic.
These proxies operate at network layers L3 to L7, intercepting all incoming and outgoing traffic for the application.

Envoy stands out as a pioneering example of a sidecar proxy, and when combined with a control plane like Istio, it has become the de facto standard for many service mesh implementations today.

The Good and the Not-So-Good of Sidecars 👍👎

Advantages:

Easy Management & Scaling: Deployed alongside applications, they are inherently easier to manage and scale.
Single Tenancy: Proxies are dedicated to their application, aligning with the application’s lifecycle.
Kubernetes Native: Seamless integration with Kubernetes streamlines deployment and management.

Drawbacks:

Not Fully Transparent: Requires “injectors” to attach sidecars, potentially altering YAML files and adding overhead to the developer workflow.
Upgrade Challenges: Upgrading proxies, especially for security vulnerabilities, can be a significant pain point.

These limitations highlighted the need for dynamic configuration and reduced operational overhead.

Enter XDS: The Dynamic Configuration Game-Changer 🌟

This is precisely where the XDS API steps in to revolutionize service mesh configurations.

Envoy, initially designed for static configurations, evolved to embrace dynamic configuration through XDS APIs.
The XDS API allows the control plane to dynamically configure Envoy proxies.
Envoy, sitting alongside your application, intercepts traffic and is configured via XDS for features like load balancing, traffic management, connection security, and header handling.

The Heart of the Service Mesh: Data Plane & Control Plane ❤️

At the core of any service mesh are two key components:

Data Plane: This is where the actual network traffic flows, managed by the control plane.
Control Plane: The “brain” that orchestrates and controls all traffic flow.

By implementing XDS APIs, gRPC can directly subscribe to and receive configuration updates from the control plane. This opens up a powerful new possibility…

The Exciting Leap to Proxyless gRPC! 🤩

What if your application could speak the XDS language directly, bypassing the need for a sidecar proxy altogether? This is the essence of proxyless gRPC!

gRPC natively implements the XDS client within its library.
This allows gRPC clients and servers to directly fetch configurations from the control plane using XDS.
The result? Elimination of the sidecar proxy and an extra network hop, leading to significant performance gains.

How to Enable XDS in Your gRPC Client 🛠️

Enabling XDS for your gRPC client is surprisingly straightforward:

XDS Name Resolution: By default, gRPC uses DNS. To switch to XDS, simply prepend xds:// to your target URL. No environment variables or complex configurations needed!

Configuring the XDS Control Plane ⚙️

But how does your application know where to find the XDS control plane? This is handled through a bootstrap configuration file:

xds_servers:
  - server_uri: "dns:///control-plane.example.com:853" # Your control plane address
    channel_credentials:
      type: TLS # Or INSECURE in development
    server_features:
      # Features supported by the server
  node:
    id: "my-app-node-1" # Unique identifier for your application
    cluster: "my-application"
    metadata: # Optional metadata for smart routing
      environment: "production"

This configuration specifies the control plane’s URI, credentials, supported features, and a unique node ID for your application.

Diving Deeper: The XDS API Structure 🏊‍♂️

The XDS API consists of two main parts:

Data Model: Defines the structure of configuration resources and their fields. This is where the intelligence lies!
Transport Protocol: Primarily a publish-subscribe mechanism (using gRPC’s bidirectional streaming) for the gRPC client to subscribe to the control plane. The content of the resources is crucial here.

The Four Pillars of XDS Resources 🏛️

There are four primary resource types that are fundamental to XDS:

Listener:
- The entry point for XDS configuration.
- Represents a virtual service that needs to be resolved to physical endpoints.
- Can include L7 filter configurations, like attaching authentication tokens or custom HTTP headers.
- Points to a Route resource.
Route:
- Contains a mapping table for requests.
- Routes requests based on headers and other request fields to the appropriate service.
- Supports advanced features like traffic splitting (e.g., for canary deployments) and configurations for retries and timeouts.
Cluster:
- Represents an actual deployment of a service.
- You can have different clusters for different versions of a service.
- This is where load balancing policies are defined for endpoints within the cluster.
- Manages endpoints distributed across different localities.
Endpoint:
- Represents a single server instance (e.g., a Kubernetes pod).
- Endpoints are grouped within a cluster and can span multiple localities.
- Includes health status and can be prioritized and weighted based on client location and desired load distribution. Localities can have priorities, and endpoints within a locality can have weights for rotation.

How Load Balancing Works with XDS ⚖️

The process of load balancing with XDS is a sophisticated, multi-layered approach:

Name Resolution: The XDS name resolver component translates the xds:// address into resource definitions.
Streaming Watch: The XDS client within gRPC establishes a bidirectional stream with the XDS server to watch for resource updates.
Load Balancer Tree Construction: Based on the received resource definitions, a tree of load balancers is dynamically created.
- Priority Load Balancing: Handles the priority of localities based on client location.
- Weighted Round Robin: Distributes traffic among localities based on their assigned weights.
- Endpoint Load Balancing: Within a locality, policies like “pick first” or “round robin” are applied to individual endpoints, as defined in the Cluster resource.
Connection Establishment: Connections are established from the bottom of the tree upwards.
Status Propagation: The health status of endpoints and load balancers is propagated up the tree, ensuring that parent nodes are aware of their children’s readiness.
RPC Routing: Once the entire load balancing tree is ready, RPCs are routed through this structure, landing on an appropriate cluster’s load balancing policy based on the route definition and then down the tree to a healthy endpoint.

This elegant system ensures that data plane connections and RPCs are dynamically configured by the control plane, offering unparalleled flexibility and performance.

Q&A Highlights: Clarifying the Nuances 🗣️

The session included some insightful questions that shed further light on XDS:

XDS vs. DNS Speed: While both perform lookups, XDS isn’t necessarily faster than traditional DNS. DNS provides a global registry of IP addresses. XDS, however, operates within the service mesh, enabling multi-level resolution and dynamic configuration management by talking to the control plane for resource definitions like listeners and endpoints. It’s a different paradigm focused on service mesh internal routing rather than global IP discovery.
XDS Server Location: The XDS server (control plane) is a separate component from your application or Kubernetes cluster. While Envoy proxies might run as sidecars within your cluster, the control plane (like Istio) is an external entity that communicates with these proxies (or directly with gRPC clients in the proxyless model) via the XDS API.
XDS Generality: The XDS API is a generic specification and is not tied to a specific control plane like Istio. Any control plane can implement XDS to configure data planes.
Comparison with eBPF Solutions: While some solutions like Cilium leverage eBPF for service mesh functionalities without sidecars, the fundamental approach differs. Cilium uses eBPF for kernel-level routing decisions, whereas XDS (in its Envoy or proxyless gRPC implementations) relies on proxy configurations or in-library logic driven by a control plane. The discussion on these advanced topics was deferred for a deeper dive post-session.

The Future is Proxyless! ✨

The evolution towards proxyless gRPC with XDS represents a significant stride in making service meshes more efficient and developer-friendly. By eliminating sidecar proxies, we unlock lower latency, reduced resource consumption, and a simpler operational model. This is a trend we’ll be watching closely, and we encourage you to explore how you can leverage this powerful combination in your own microservice architectures!

Stay tuned for more insights into the ever-evolving world of cloud-native technologies!

How Proxyless gRPC Works in a Service Mesh - Kannan Jayaprakasam & MV Shiva Prasad, Google

Unlocking the Power of Proxyless gRPC in Service Meshes! 🚀

What Exactly is a Service Mesh? 🤔

The Challenges of Traditional Service Communication 🚧

The Rise of the Sidecar Proxy: A Major Leap Forward 💡

The Good and the Not-So-Good of Sidecars 👍👎

Enter XDS: The Dynamic Configuration Game-Changer 🌟

The Heart of the Service Mesh: Data Plane & Control Plane ❤️

The Exciting Leap to Proxyless gRPC! 🤩

How to Enable XDS in Your gRPC Client 🛠️

Configuring the XDS Control Plane ⚙️

Diving Deeper: The XDS API Structure 🏊‍♂️

The Four Pillars of XDS Resources 🏛️

How Load Balancing Works with XDS ⚖️

Q&A Highlights: Clarifying the Nuances 🗣️

The Future is Proxyless! ✨

Appendix

Unlocking the Power of Proxyless gRPC in Service Meshes! 🚀#

What Exactly is a Service Mesh? 🤔#

The Challenges of Traditional Service Communication 🚧#

The Rise of the Sidecar Proxy: A Major Leap Forward 💡#

The Good and the Not-So-Good of Sidecars 👍👎#

Enter XDS: The Dynamic Configuration Game-Changer 🌟#

The Heart of the Service Mesh: Data Plane & Control Plane ❤️#

The Exciting Leap to Proxyless gRPC! 🤩#

How to Enable XDS in Your gRPC Client 🛠️#

Configuring the XDS Control Plane ⚙️#

Diving Deeper: The XDS API Structure 🏊‍♂️#

The Four Pillars of XDS Resources 🏛️#

How Load Balancing Works with XDS ⚖️#

Q&A Highlights: Clarifying the Nuances 🗣️#

The Future is Proxyless! ✨#

Appendix#

Unlocking the Power of Proxyless gRPC in Service Meshes! 🚀

What Exactly is a Service Mesh? 🤔

The Challenges of Traditional Service Communication 🚧

The Rise of the Sidecar Proxy: A Major Leap Forward 💡

The Good and the Not-So-Good of Sidecars 👍👎

Enter XDS: The Dynamic Configuration Game-Changer 🌟

The Heart of the Service Mesh: Data Plane & Control Plane ❤️

The Exciting Leap to Proxyless gRPC! 🤩

How to Enable XDS in Your gRPC Client 🛠️

Configuring the XDS Control Plane ⚙️

Diving Deeper: The XDS API Structure 🏊‍♂️

The Four Pillars of XDS Resources 🏛️

How Load Balancing Works with XDS ⚖️

Q&A Highlights: Clarifying the Nuances 🗣️

The Future is Proxyless! ✨

Appendix