Load Balancing and Service Mesh

Published April 5, 2023 | Last modified December 4, 2025

Proxy & Load Balancer

F5® Distributed Cloud’s load balancing and proxy capabilities lets you control the flow of application and API traffic between services, to the Internet, and from clients on the Internet. In the case of F5® Distributed Cloud Mesh (Mesh), a load balancer is a proxy that is defined to be an entity that terminates incoming TCP connections or UDP streams and initiates a new connection from the proxy. The server is referred to as an endpoint and is usually a collection or a set of endpoints that offer some service. Clients and servers may be users, machines, applications, or microservices.

There are many reasons for a proxy to be deployed between a client and a server:

Connecting client and servers that are on isolated networks
Application-level (Layer 7) routing: load balancing, API request routing, and more
Security: URL filtering, application firewall, application policy, TLS offload, and more
Network and Application resilience, testing, and fault injection
Content caching
TCP optimization

Figure: F5 Distributed Cloud Services Proxy

Proxy presents Virtual IP (VIP) of the server to client. The client has to discover the VIP and this is usually done by making a client’s DNS server answer with VIP when the client does name resolution. There may be other service discovery mechanism that clients may use. For example, K8s discovery, Consul, Route53, and more. Proxy’s control plane has to make sure that client can discover server’s IP address, and this is achieved by publishing the VIP. Advertising involves allocating a VIP and publishing the name and VIP mapping in the service registry (for example, DNS, Route53, or Consul). Similarly, the proxy has to discover the IP addresses of the endpoints associated with the server and this is usually achieved by performing endpoint discovery. Endpoint discovery methods can be done using DNS, Consul, or K8s.

Forward Proxy

Forward proxy is usually configured when connecting an inside private network with an outside public network. There is no VIP in the forward proxy and traffic is usually routed using a default route to the outside network. You can configure routing between the two networks using a network connector. DNS is used to discover server/endpoint IP addresses. One characteristic of the forward proxy is that the IP address of the server/endpoints is same in inside network and outside network. This allows you to solve problems, like implement URL filtering for HTTP traffic, perform TLS inspection, filter based on DNS names, and much more.

Figure: Forward Proxy

Forward proxy configuration is covered in the Networking section and done using Network Connector and Network Firewall.

Reverse proxy

When the server is in the inside network and the clients are in the outside network. There is no reachability to the server unless a proxy is deployed. To provide connectivity to the server, the proxy is configured as a “reverse” proxy and a VIP will be assigned to the server in the outside network. This allows you to solve problems, like controlling the inbound traffic and implement security controls to prevent attacks on the server.

Figure: Reverse Proxy

Subsequent discussions will not distinguish between inside and outside networks because a reverse proxy can also be deployed within the same network. This allows you to solve the same problems, like controlling traffic flow and performing load balancing based on constraints.

Figure: Reverse Proxy in the Same Network

In Mesh, a load balancer is the same as a reverse proxy and is configured to have multiple endpoints for a server. It distributes the load of incoming client requests based on various scheduling algorithms. For example, round robin, weighted least request, random, ring-hash, and more. It selects only the active endpoints based explicit health check probes or implicit health checks (deductions based on latency, error rate, and much more of the responses). This allows you to solve multiple problems:

Set up routing rules to control the flow of application and API traffic
Authentication and Authorization
Simplify configuration of service properties like circuit-breakers, retries, timeouts, and more
Set up traffic split to endpoints based on A/B testing, canary or staging rollouts, and more
Implement security controls and application policies to secure endpoints from attacks
Provide comprehensive observability and tracing capabilities for troubleshooting

A reverse proxy in F5 Distributed Cloud Services is called “virtual host” and a virtual host needs to be assigned with a VIP. It has a list of “domains” that map to a VIP. The following types of reverse-proxies are supported by the virtual host:

TCP Proxy: When an incoming TCP connection is sent to a selected outgoing server, it is called “TCP proxy”. Each TCP proxy requires a combination of VIP and a TCP port.
HTTP Proxy: When an incoming TCP connection uses HTTP protocol, the virtual host can be configured to parse HTTP header and provide the functionality of a “HTTP proxy”. Based on HOST header (in HTTP) and URL requested, traffic can be routed to different set of the endpoint servers. Selecting a set of endpoints based HTTP headers and URLs is called “application routing” and the rules are called “routes”. A set of endpoints are called “clusters”. A given virtual host can be identified based on host header, and it must contain the domain that is configured in the virtual host. Hence, a combination (VIP, TCP port) can be shared by multiple virtual hosts that are configured as HTTP proxies.
HTTPS Proxy: When an incoming TCP connection is encrypted using HTTPS protocol, the virtual host can be configured with TLS parameters that can be used to decrypt the packets and provide the functionality of “HTTPs proxy.” All the traffic routing and control features that are provided by the HTTP proxy can also be implemented here.
TCP Proxy with SNI: When an incoming stream is TLS, but the virtual host does not have TLS parameters to decrypt the packet, then the virtual host can still use the SNI information in the packet to match the host configured in the domains of the virtual host. Once the virtual host is matched, incoming TCP stream is sent as is to the selected server endpoint. This is called “TCP proxy with SNI”. In the case of “HTTPs proxy” and “TCP proxy with SNI” combination (VIP and port) can be shared.

Globally Distributed Load Balancer

Typically, to provide reachability for distributed applications or services, there needs to be some kind of network connectivity and routing across the sites so that the client has the ability to route to the VIP of the server. This creates all kinds of challenges from the network point of view - either private connectivity needs to be setup from the client network to the server network or the VIP has to be exposed to the public Internet.

F5 Distributed Cloud Services provide a unique solution with its distributed proxy implementation and solves more than the problem described above:

Security & Connectivity: Using the F5 Distributed Cloud fabric, we create private and secure connectivity across sites and then build a distributed proxy on-top of this fabric. This gives the capability of application routing without network routing - provide zero-trust and improved security.
Globally Distributed Application Routing & Control: Since the proxy is distributed and the health of all the endpoints is available across multiple sites, the routing and traffic control decision can now be made based on actual health of individual endpoints across all the sites and not just the proxies front-ending the endpoints. This is especially useful for high-traffic web services.

As a result, we recommend that sites should be configured with isolated networks and applications be connected across Sites, via Fabric, using virtual host proxies that expose only the services that need reachability.

In the diagram below, the proxy is distributed across two sites, ingress and egress sites. In the degenerate case, ingress and egress can be the same site.

Figure: Globally Distributed Load Balancer

Ingress

The set of ingress sites is decided by the “advertise policy”. In the most common case, VIP is automatically picked by the system to be the interface IP of the site. One can also specify an IP address to be used as a VIP in the advertise policy. There needs to be (protocol, port) on which the service is available. If the site has multiple networks, the network can also be specified for the VIP selection.

Finally, the VIP is published into the service registry mechanism using the service discovery configuration and method of the site.

Client discovers the VIP for a service and initiates a connection. This connection terminates on one of the nodes in the ingress site. Depending on the load balancing strategy and policy configured for the cluster, Ingress proxy will choose (egress site, egress endpoint). Policy could be the nearest site, or local only. It will initiate a connection to the selected egress site, egress endpoint. In case of an HTTP proxy and HTTPS proxy, a pre-existing and persistent connection between ingress to egress will be reused for higher performance.

Egress

The set of the egress sites is decided by the endpoint configuration. Endpoint can specify sites and discovery method to discover endpoints servers on a given set of egress site. Configuration of endpoint discovery method can be different for each site.

Once the endpoints are discovered, egress proxy will automatically start doing health checks and if it determines that service is healthy, it will propagate egress site and endpoint information to all ingress sites using F5 Distributed Cloud's distributed control plane.

Configuring Load Balancer

A virtual host can be configured as a front-end or a load balancer for a single service or to multiple services and APIs. Every virtual host has a set of sites “where” consumers are present and a set of sites “where” providers of the service or application are present. The consumers can also be configured to be present on the Internet.

The configuration of a virtual host object requires you to configure, at a minimum, the following objects:

advertise-policy
routes
clusters
endpoints

The presence of consumers is configured using “advertise-policy”, and the presence of endpoint servers is configured using “routes”, “cluster”, and “endpoint.”

Configuring Virtual Host

The virtual host is the primary configuration for a proxy and can only be configured as a reverse proxy.

The following are key items that need to be configured for the virtual host:

Domains: It is a list of the DNS names that is used by the client to resolve the VIP for the virtual host. The names can also be specified as partial match. For example, \*.foo.com. These names have to match exactly the names on the certificates and the host headers. The match also includes the port number for non-standard HTTP and HTTPs ports. For example, \*.foo.com:8080.
Proxy Type: Following types of proxy configurations are supported:

UDP-proxy: Incoming UDP packets are SNAT/DNAT to outgoing interface. There is no way to share (VIP or TCP port).
SMA proxy: This is a special mode that is defined for secret management access (SMA) by F5 Distributed Cloud services from one site to another site. The behavior is the same as a TCP proxy, where advertise policy uses a special network to give access to F5 Distributed Cloud's built-in internal networks. For example, it can be used to provide access to Vault to access certificates for the virtual host.

Routes: This is covered in detail in the Configuring Routes section.
Advertise Policy: This is covered in detail in the Configuring Advertise Policy section.
WAFs: This is an optional configuration and is covered in the Globally Distributed Load Balancer section.

The virtual host supports TLS and mTLS on both upstream and downstream connections:

Downstream (clients to virtual host): The virtual host supports both server-side TLS and mTLS. It needs server-side certificates with the principal of all domains configured in the virtual host. If the clients need mTLS, then it needs trusted CA certificates.
Upstream (virtual host to endpoints): Since it is a client, for TLS, it needs a trusted CA. Or, for mTLS, it needs a client certificate from a CA that the endpoints trust.

All of these are configured in virtual host under TLS parameters and the private keys can be protected using F5 Distributed Cloud's Secrets Management and Blindfold.

Details on all the parameters that can be configured for this object are covered in the API Specification.

Configuring Routes

When the virtual host is of type HTTP/HTTPS, the request can be further matched based on parameters, like URLs, headers, query parameters, HTTP methods, and more. Once the request is matched, it can be sent to a specific endpoint based on the routing configuration and policy rules. This capability of performing traffic routing also allows for multiple services to be controlled using a single virtual host or a single service to appear in multiple virtual hosts.

Rules to match requests and perform traffic routing to a destination endpoint are called routes. Routes can be configured with many features and parameters:

Route request to set of given endpoints
Send redirect response
Send a response
Change the protocol
Add or remove headers
Timeouts, retries, and more
Web Application Firewall

Even though the TCP proxy does not need routes, they are required to provide an empty configuration to keep object models consistent.

Details on all the parameters that can be configured for this object are covered in the API Specification.

Configuring Clusters

Since “Routes” direct the traffic to an endpoint (endpoints being the providers of service), there is a configuration object called cluster that represents a set (or a group) of endpoints. There are two things that can be achieved by configuring a cluster object for a set of endpoints:

Common configuration for the set of endpoints. For example, the configuration of parameters, like health checks, circuit breakers, and more.
Subset the endpoints into smaller groups using labels on endpoints to create multiple clusters with common configuration. This can be done for traffic routing, using “Routes” to one of these clusters based on version (canary testing), or different features (A/B), and more.

Configuration of the following capabilities is possible for a cluster:

Endpoint Selection based on labels
Healthcheck mechanism selection
Load-balancer Algorithm
Circuit Breaker
TLS parameters

Details on all the parameters and how they can be configured for this object are covered in the API Specification.

Configuring Endpoints

Configuration of an endpoint needs to be done for the system to discover the IP address, port, and health of each individual endpoint that comprises a service. This is achieved by a combination of two things:

“where” is the end-point located: This can be configured by specifying either the virtual network, external network, site, or a specific site or a group of sites (virtual-site).
“how” to discover it in that location: This is specified by configuring a specific IP address and port for the end-point or using a discovery method like DNS, K8s, or Consul. When the discovery method is specified, then the service is selected based on the service name or label selector expression.

Details on all the parameters and how they can be configured for this object are covered in the API Specification.

Configuring Advertise policy

Once the endpoints are configured, we need to specify the locations “where” this virtual host is available, and this can be achieved by configuring the advertise-policy object. It is possible to configure the advertisement on a given network (for example, public Internet), a particular site (to restrict the locations where the services are reachable), or on a virtual-site (a group of sites). VIP is automatically assigned by the system based on configuration of “where”

Value of “where” can be any one of the following:

Virtual site: The configuration is applied to all sites selected by virtual site and site local network type on that site. The VIP is chosen from the HOST_IP of the outside interface.
Virtual site, network type: The configuration is applied to all sites selected by virtual site and network type on those sites. The VIP chosen from the HOST_IP of the interface in the given network type.
Site, network type: The configuration is applied on this specific site and virtual network of given type. The VIP chosen from the HOST_IP of the interface in the given network type.
Virtual network: configuration is applied to all sites “where” this network is present.

Usually, “where”, protocol, and port configuration are sufficient for advertising a VIP in a given location.

In addition, the following parameters may be configured:

Explicit VIP configuration - This explicit VIP will be part VRRP or BGP to anycast VIP.
TLS parameters like protocol version, cipher suites, TLS certificates, trusted CA, and client certificate. For example, you need a different certificate in a region (for example, China) compared to the rest of the world.

Depending on the service discovery method configured for the site, the VIP will be registered in the service registry.

Details on all the parameters and how they can be configured for this object are covered in the API Specification.

Automatic Certificate Generation

In case of HTTPS load balancer with automatic TLS certificate, F5 Distributed Cloud generates certificates for the domains using the LetsEncrypt Automatic Certificate Management Environment (ACME) server. Automatic certificate generation is supported in any of the following two ways:

Using delegated domains: Delegate domains to F5 Distributed Cloud, and it acts as the authoritative domain server for your domains.
Using non-delegated domains: F5 Distributed Cloud creates a TXT record for the domain storing the ACME challenge coming from LetsEncrypt. Add this to your DNS records.

Using Delegated Domain

F5 Distributed Cloud acts as the ACME client and obtains the certificates as per the sequence below in the diagram.

Figure: Automatic Certificate Generation Sequence

The following is the list of activities for automatic certificate generation using delegated domains:

F5 Distributed Cloud acts as ACME client and creates a new order with the domain configured in the virtual host.
LetsEncrypt issues a DNS challenge for a TXT record to be created under the domain with a specified text message. It also provides a nonce that F5 Distributed Cloud requires you to sign with its private key pair.
F5 Distributed Cloud adds the required TXT record in the delegated domain and verifies that the TXT record is resolved.

Note: This requires the parent DNS domain to be configured with the NS record pointing to the delegated name servers. For instructions, see the Delegate Domain guide.

Once the record is resolved, F5 Distributed Cloud notifies LetsEncrypt CA that it is ready to finalize the validation.
LetsEncrypt validates that the challenge is satisfied and verifies the signature on the nonce.
F5 Distributed Cloud sends a certification signing request asking LetsEncrypt CA to issue a certificate for the specified domain.
LetsEncrypt CA verifies the signatures on the request and issues a certificate for the domain.

The certificates issued by the automatic generation process have a validity period of 90 days. F5 Distributed Cloud performs automatic renewal before the expiry date and obtains new certificates.

Using Non-Delegated Domain

The following is the list of activities for automatic certificate generation using non-delegated domains.

LetsEncrypt issues a DNS challenge for a TXT record to be created under the domain with a specified text message.
F5 Distributed Cloud creates a TXT record for the domain storing ACME challenge coming from LetsEncrypt.
Once the record is resolved, F5 Distributed Cloud notifies LetsEncrypt CA that it is ready to finalize the validation.

Note: This requires the parent DNS domain to be configured with a CNAME record using TXT value generated by F5 Distributed Cloud. For instructions, see the HTTP Load Balancer guide.

LetsEncrypt validates that the challenge is satisfied and verifies the signature on the nonce.
F5 Distributed Cloud sends a certification signing request asking LetsEncrypt CA to issue a certificate for the specified domain.
LetsEncrypt CA verifies the signatures on the request and issues a certificate for the domain.

Service Discovery

The ability for a site to automatically discover service endpoints and publish VIP so that clients can discover services represented by VIPs is called service discovery. Automatic service discovery is needed since services can move, scale out, or publish independently and in an automated way without coordinating with the proxy configuration.

The service discovery method can be configured per site or virtual site. There can be multiple methods configured for a given site and all the configured methods are used simultaneously to discover endpoints and publish VIPs.

DNS Method

The DNS method is the most widely used method for service discovery and most networks support this. The DNS method simply uses the DNS server specified to resolve names of the endpoints. There are two ways for publishing VIPs to the DNS:

APIs: The DNS server has to provide APIs to dynamically add DNS entries. While most legacy networks may not have this ability, there are many cloud environments, like AWS, Azure, and GCP that support DNS APIs.
DNS delegation is another way DNS method can be used to publish VIPs in the site. A subdomain can be delegated to an F5 Distributed Cloud Site. The site software will answer DNS queries for any name under that subdomain that is assigned to the “virtual host”.

K8S Method

When a Site is used as an ingress/egress gateway for K8s cluster, the Kubernetes native discovery method can be used. The Site software needs K8s credentials to use the K8s APIs to discover all services and endpoints. If a Site is external to K8s cluster, then the services need to be configured as “node-port” services in K8s. For publishing the VIPs, the site can automatically add DNS entries in K8s cluster DNS.

Consul

In case Consul is used as a service discovery method in the Site, F5 Distributed Cloud can use Consul to discover endpoints and publish VIPs. The Site needs Consul credentials to use Consul API to discover endpoints and publish VIPs.

Configuring Service Discovery

Service Discovery requires per-site configuration, and depending on the type of method used, it will need different configurations:

DNS: Credentials for the DNS API endpoint in the cloud provider or the customer will have to delegate the DNS to the Site.
Consul or K8s discovery: Credentials for K8s or Consul.

Usually, a site will have all the services that use the same discovery method. In some situations, multiple service discovery methods may be configured on the same site or there may be multiple configuration objects of the same method. For example, there may be multiple K8s clusters per site, and they may have different credentials. Service discovery configuration object uses the concept of “where” to apply the discovery configuration to sites.

Details on all the parameters and how they can be configured for this object are covered in the API Specification.

Service Mesh

Applications are typically composed of many services that are represented by more than one virtual host. It will be desirable to group these virtual hosts in order to better understand service interaction and observability across the entire application. By collecting metrics from various services and its client/server interactions, the system is able to create a service graph. In addition, using the access logs and machine learning, the system can identify API endpoints across the service mesh graph - per virtual host, per service, and for each edge of the graph (source and destination). For each of these API endpoints, the system is able to apply AI and statistical algorithms for anomaly detection, probability distribution, and performance metrics.

Configuring Service Mesh

The collection of virtual hosts for service mesh can be created by assigning the same known label ves.io/app_type to all virtual hosts that need to be in the collection.

You can also use the ves.io/app_type label to add the Virtual Kubernetes (vK8s). The value for this can be the name of the application you are deploying or the namespace name.

This can be set in the following:

Virtual host
vK8s service
vK8s container (use when the container does not have service associated with it)

The following is an example label for launching vK8s resources:

"labels": {
   "ves.io/app_type": "bookinfo-japan"
}

Copied!

Note: The following applies:

The default value for the ves.io/app_type label is the namespace name.

In case of a standalone pod such as a traffic generator, set this label for both the pod and the service for the service mesh to display accurate service graph.

Canary Rollout and A/B Testing

Canary and A/B rollout techniques are used to control the introduction of newer versions (canary) or newer features (A/B) to a small percentage of user-traffic and depending on the results, gradually increase the percentage of traffic while phasing out the other.

Using the virtual host and appropriate configuration of the “Routes”, “Clusters”, and “Endpoints”, it is possible to send traffic to different versions of the service using very simple configuration within the distributed load balancer. There are two common cases that can be achieved using this:

Canary testing: The client sends requests to a service and 5% of the calls go to a newer version of the service and the remaining 95% go to the current version.
A/B testing: The client sends requests to a service and 5% of the calls go to a newer version

The example below shows what can be done for Canary rollout using “Routes”, “Clusters”, and “Endpoint” definitions:

Route1  match on URL1, Header2
For subset with version-10 go to clusterA 95% traffic
For subset with version-11 go to clusterA  5% traffic

ClusterA
List of endpoints
...
Subset version-10 ( label expression 1)
Subset version-11 ( label expression 2)

Endpoint
Discover endpoints from K8s & use labels on endpoints to classify into one of the subset

Copied!

Load Balancing and Service Mesh

Proxy & Load Balancer

Forward Proxy

Reverse proxy

Globally Distributed Load Balancer

Ingress

Egress

Configuring Load Balancer

Configuring Virtual Host

Configuring Routes

Configuring Clusters

Configuring Endpoints

Configuring Advertise policy

Automatic Certificate Generation

Using Delegated Domain

Using Non-Delegated Domain

Service Discovery

DNS Method

K8S Method

Consul

Configuring Service Discovery

Service Mesh

Configuring Service Mesh

Canary Rollout and A/B Testing

On this page: