Kubernetes Service

PostedDecember 26, 2021

UpdatedFebruary 19, 2026

Author -Rajkumar Aute

In simple words, a Kubernetes Service is like a permanent address for your application. In Kubernetes, Pods (the actual containers running your code) are born and die very frequently. When a new Pod is created, it gets a brand-new IP address. If your frontend application needs to talk to your database, it cannot keep track of these constantly changing IPs. A Kubernetes Service sits in front of the Pods, gives them a single, permanent IP address, and automatically distributes the traffic (load balances) to the healthy Pods behind it.

cheat sheet

Service Type	Where is it accessible?	How it works (One-Liner)	Primary Use Case
ClusterIP	Inside the cluster only.	The default type. Gives a stable internal IP.	Internal microservices talking to each other.
NodePort	Outside the cluster (via Node IP).	Opens a specific port (30000-32767) on every Node.	Quick external testing or non-standard protocols.
LoadBalancer	Outside the cluster (via Cloud).	Provisions a cloud provider’s physical load balancer.	Exposing a web application directly to the internet.
ExternalName	Inside the cluster (DNS mapping).	Acts as an alias mapping to an external DNS name.	Pointing an internal app to an external managed DB.
Headless	Inside the cluster (Direct IPs).	Has no ClusterIP. Returns IPs of all target Pods directly.	Databases (StatefulSets) where you need to reach a specific replica.

In Kubernetes, Pods are ephemeral. They are constantly created, destroyed, and rescheduled due to scaling, deployments, or node failures. Every time a Pod is recreated, it gets a new internal IP address.

If your frontend application is trying to communicate with your backend database, it cannot rely on the database Pod’s IP address because it will inevitably change.

A Kubernetes Service solves this by providing a stable, persistent virtual IP address (ClusterIP) and a reliable DNS name for a logical grouping of Pods. As long as the Service exists, its IP and DNS will never change, and it acts as an internal load balancer, distributing traffic across the healthy Pods underneath it.

Under the Hood: How Services Actually Work

A Service isn’t a physical process or a standalone container; it is an abstraction maintained by the Kubernetes control plane and network components.

The Virtual IP (VIP): When you create a ClusterIP service, Kubernetes gives it a VIP. This IP does not belong to any physical network interface or hardware. It is purely logical.
Labels and Selectors: A Service doesn’t explicitly link to Pods by name. Instead, it uses a selector to look for Pods with specific labels (e.g., app: backend). Any Pod matching those labels is automatically added to the Service’s pool.
Endpoints & EndpointSlices: A Service doesn’t actually talk to Pods directly. It creates an Endpoints object. If you have 3 Pods matching your selector, the Endpoints object will contain those 3 IP addresses. In massive clusters, EndpointSlices are used to break these lists into smaller chunks to save API performance.
kube-proxy: This is the real unsung hero working behind the scenes. It runs on every single worker node in your cluster. Its job is to watch the API server for changes to Services and their target Pods, and then update the local networking rules on that node (usually using iptables) so that traffic sent to the Service VIP gets intercepted and seamlessly redirected to the actual Pod IP.
CoreDNS: Kubernetes has a brilliant built-in DNS server. When you create a service named my-database in the dev namespace, CoreDNS automatically creates a DNS record: my-database.dev.svc.cluster.local. This means your application code just needs to connect to the hostname my-database, and Kubernetes will resolve the IP for you!

The 5 Types of Kubernetes Services

Kubernetes gives you five different “doors” to access your applications. Some doors are only for people already inside the building (ClusterIP), some doors open to the street but are hard to find (NodePort), some doors have a dedicated security guard and a VIP driveway (LoadBalancer), some are just signboards pointing to another building entirely (ExternalName), and finally, some let you skip the front desk and go straight to the worker’s desk (Headless).

Imagine a massive corporate IT park in Bengaluru.

ClusterIP: This is the internal intercom system. Employees can call each other’s desks, but no one from outside the IT park can dial these numbers.
NodePort: This is like giving every single entry gate of the IT park a specific pin code. If an outsider knows the gate number and the pin code, they can drop a package.
LoadBalancer: This is hiring a professional courier company. The courier company gives you one single, premium public address. They receive all the packages and intelligently distribute them to the right gates.
ExternalName: This is the mailroom keeping a forwarding address book. When an internal employee tries to send mail to a “Partner API”, the mailroom automatically rewrites the address to the partner’s actual global address.
Headless Service: Instead of talking to the department manager to get work done, you are given the direct cabin numbers of all the engineers in that department so you can talk to them individually.

ClusterIP (The Default)

ClusterIP is the default Kubernetes Service type. It provides a stable, virtual IP address that is only accessible from within the cluster.

Stable Identity: Unlike Pods, which are ephemeral (they die and get new IPs), a Service’s ClusterIP remains constant throughout its lifecycle.
Internal Scope: You cannot ping or curl a ClusterIP from your laptop unless you use a port-forward, proxy, or VPN into the cluster network.
Automatic Load Balancing: It distributes traffic across all Pods that match the service’s selector.

ClusterIP Manifest

To create one, you define a Service resource. If you don’t specify the type, Kubernetes defaults to ClusterIP.

Key Fields to Master:

port: The internal “virtual” port other pods use to talk to this service.
targetPort: The actual port your application is listening on inside the container.
selector: This is the “glue.” The service finds every Pod with the label app: api-server and adds their IPs to its “Endpoints” list.

How it Works Under the Hood

This is where most people get confused. The ClusterIP isn’t a “real” IP assigned to a physical or virtual network interface. It’s a Virtual IP (VIP) managed by kube-proxy.

The Watcher: kube-proxy runs on every node in your cluster. It constantly watches the Kubernetes API Server for new Services and Endpoints.
The Rule Maker: When you create backend-service, kube-proxy creates iptables (or IPVS) rules on every single node.
The Redirection: When Pod A tries to send a packet to the ClusterIP 10.96.0.10, the Linux kernel intercepts it before it even leaves the node. The iptables rules say: “Wait! Don’t look for a real machine at 10.96.0.10. Instead, randomly pick one of these Pod IPs (e.g., 192.168.1.5) and send the packet there.”

Service Discovery (DNS)

You rarely use the actual IP address (e.g., 10.96.x.x) in your application code. Instead, you use the DNS name. Kubernetes comes with CoreDNS built-in, which automatically creates DNS records for your services.

Same Namespace: If a frontend pod wants to talk to the backend in the same namespace, you just call http://backend-service.
Different Namespace: If they are in different namespaces, use the Fully Qualified Domain Name (FQDN): http://backend-service.prod.svc.cluster.local.

Key Components: Service Object, kube-proxy, iptables/IPVS, CoreDNS, Endpoints/EndpointSlices.
Key Characteristics: Ephemeral backend, persistent virtual frontend, strictly internal (Layer 4 TCP/UDP).
Use Case: Connecting a React frontend container to a Node.js backend API container securely inside the cluster.
Benefits: Decouples microservices, provides automatic load balancing, and offers highly reliable internal DNS discovery.
Best Practices:
- Keep port and targetPort identical when possible to avoid mental gymnastics during debugging.
- Always use explicit, standard labels (e.g., app.kubernetes.io/name: my-backend) for your selectors.

https://kubernetes.io/docs/concepts/services-networking/service/#type-clusterip

–

NodePort

A NodePort Service is a way to expose your Kubernetes applications to external traffic. As you rightly pointed out, it builds directly on top of the ClusterIP service. When you create a NodePort service, the Kubernetes control plane allocates a specific static port (by default, between the range of 30000 and 32767).

This exact port is then opened on the IP address of every single worker node in your cluster.

The Traffic Flow: External Client Request → Hits NodeIP:NodePort (on any worker node) → Gets routed to the ClusterIP of the Service → Finally gets load-balanced to the actual Pod running the application.

While this is incredibly helpful for quick access, it comes with strict drawbacks for production. Exposing ports directly on nodes means you are bypassing standard gateway network security unless strict firewall rules (like AWS Security Groups or Azure NSGs) are applied. Furthermore, it places the burden of load balancing on the client side if the client keeps hitting Node A and Node A goes down, the connection drops, even if the application is perfectly healthy on Node B.

Default Range: 30000-32767.
Scope: Opens the same port on all nodes, even if the pod is not running on that specific node.
Dependency: NodePort automatically creates a ClusterIP underneath.
Security: Not recommended for direct public exposure in production without an external Load Balancer or WAF.

Key Components: kube-proxy, iptables/IPVS, Node Network Interface, apiserver (for allocating the port).
Key Characteristics: Static port range, applies globally across the cluster nodes.
Use Case: Quick debugging, local development (like Minikube/Kind), or acting as backend targets for traditional, non-cloud-native external load balancers.
Benefits: Simplest way to get external traffic into a bare-metal cluster without relying on Cloud Provider LoadBalancers.
Best Practices: Restrict node access via strict VPC Security Groups. Use an Ingress controller for HTTP/HTTPS traffic instead of direct NodePorts. Monitor node health, as client traffic will fail if they target a dead node.

Challenge: Port Exhaustion. You only have 2,767 ports available per cluster.
- Solution: Use Ingress. One Ingress (using one NodePort/LoadBalancer) can route hundreds of applications based on hostnames.
Problem: Node IPs are ephemeral. If a cloud node dies and is replaced, its IP changes, breaking client access.
- Solution: Place an external Load Balancer or a DNS Round-Robin system in front of the nodes.
Issue: Connection refused.
- Solution: Check if the Node’s firewall (e.g., ufw or AWS Security Group) actually allows inbound TCP traffic on the specific 30xxx port. Also verify the Pod is Running and passing its Readiness Probe.

Kubernetes Official Documentation: Service – NodePort

–

LoadBalancer

A LoadBalancer Service is the standard, most straightforward way to expose your Kubernetes application to the public internet. As you correctly noted, it does not replace the NodePort or ClusterIP; instead, it builds on top of them.

When you declare a Service of type LoadBalancer, the Kubernetes Control Plane (specifically the Cloud Controller Manager) communicates directly with your cloud provider’s API (like AWS, Google Cloud, or Azure). It automatically provisions a physical or virtual load balancer outside of your Kubernetes cluster.

The Traffic Flow: External User Request → Hits the Cloud Load Balancer IP → The Load Balancer routes it to a NodeIP:NodePort (on your worker nodes) → kube-proxy rules route it to the internal ClusterIP → Finally, it reaches the destination Pod.

Cloud Dependency: Requires a supported cloud provider (AWS, GCP, Azure) or a bare-metal solution like MetalLB.
Automatic Provisioning: Creating the Service automatically creates the external cloud resource.
Cost Factor: Every LoadBalancer Service usually spins up a dedicated cloud load balancer, which costs real money.
Layer 4 vs Layer 7: By default, it provisions a Layer 4 (TCP/UDP) load balancer.

For production-grade DevSecOps architectures, simply deploying a LoadBalancer is not enough.

Annotations are Key: You heavily customize cloud LBs using annotations. For example, in AWS, adding service.beta.kubernetes.io/aws-load-balancer-internal: "true" creates an internal LB instead of an internet-facing one, which is crucial for secure microservices. Adding ACM certificate ARNs via annotations handles SSL/TLS termination at the LB level, keeping internal cluster traffic fast and unencrypted.
Security Groups: The Cloud Controller Manager dynamically updates Node Security Groups/Firewalls to allow traffic only from the Load Balancer, effectively shielding the NodePorts from direct internet access.
Relevant Tools & Links:
- AWS Load Balancer Controller: A Kubernetes controller for AWS that manages AWS Elastic Load Balancers for a Kubernetes cluster (highly recommended over the legacy in-tree controller).
- MetalLB: A load-balancer implementation for bare metal Kubernetes clusters using standard routing protocols (ARP, BGP).
- kube-vip: High availability and load balancing for both the Kubernetes control plane and Services.

The Bare Metal Gap: The snippet assumes you are always in the cloud. It is critical to mention that on bare-metal or on-premises servers, a LoadBalancer will not work out-of-the-box. You must install a tool like MetalLB to simulate this behavior.

Key Components: Cloud Controller Manager (CCM), Cloud Provider API, External LB infrastructure, kube-proxy.
Key Characteristics: Provides a single, stable ingress point (IP or DNS) for external traffic.
Use Case: Exposing web servers, APIs, or custom TCP/UDP services directly to the internet or a corporate intranet.
Benefits: Highly available, integrates natively with cloud security (WAF, DDoS protection), and removes the need to manage node IPs manually.
Best Practices: * Do not overuse it. Creating a LoadBalancer for every microservice is an anti-pattern and gets very expensive. Use it to expose an Ingress Controller (like NGINX), which then routes traffic to multiple services using just one LoadBalancer.
- Always terminate TLS at the Load Balancer to offload CPU usage from your pods.

Challenge: High Cloud Costs. 10 Services = 10 Cloud LBs = A huge monthly bill.
- Solution: Implement an Ingress Controller. You only pay for one LB, and the Ingress handles routing based on URL paths (e.g., example.com/api vs example.com/web).
Problem: Service stuck in <pending> state.
- Solution: Verify your cluster is integrated with a Cloud Provider. If on bare-metal, install MetalLB. If in AWS/GCP, check the IAM roles/permissions assigned to your Kubernetes control plane to ensure it has the rights to create LB resources.
Issue: Losing the original Client IP address.
- Solution: Configure externalTrafficPolicy: Local or rely on the X-Forwarded-For HTTP header if using a Layer 7 setup.

https://kubernetes.io/docs/concepts/services-networking/service/#loadbalancer

–

ExternalName

An ExternalName Service is a special type of Kubernetes Service that does not use label selectors, does not allocate a ClusterIP, and does not route traffic through kube-proxy. Instead, it acts purely at the DNS level as an internal alias for an external service.

When you create an ExternalName Service (let’s say db-svc), the cluster’s DNS service (usually CoreDNS) creates a CNAME (Canonical Name) record. When your Pods want to connect to the database, they simply query db-svc. CoreDNS catches this query and responds with the CNAME, pointing the Pod directly to my-database.example.com.

The Traffic Flow: Application Pod asks CoreDNS for db-svc → CoreDNS returns the CNAME my-database.example.com → Pod resolves the external IP of my-database.example.com → Pod connects directly to the external database, completely bypassing kube-proxy.

This is a fantastic pattern because it abstracts the external environment. If you move from a staging AWS RDS database to a production one, you just update the ExternalName Service in Kubernetes. You never have to touch your application code or rebuild your container images to change the database URL!

At a production DevSecOps level, routing egress traffic requires careful security considerations.

Egress Network Policies: Because ExternalName relies on DNS and external IPs, standard Kubernetes NetworkPolicies (which rely on Pod selectors) are hard to enforce. If you restrict egress traffic, you must allow egress to the specific IP blocks of the external service, which can be dynamic and painful to manage.
Service Mesh Integration: Advanced architects often avoid ExternalName for strict production egress. Instead, they use an Egress Gateway or tools like Istio ServiceEntry. This allows for mutual TLS (mTLS), retries, timeouts, and deep observability for traffic leaving the cluster.
Relevant Tools & Links:
- CoreDNS: The default DNS server for K8s that handles the CNAME translation.
- Cilium Egress Gateway: Excellent for controlling the source IP of egress traffic going to external databases.
- Istio: For managing secure, observable egress traffic routing.

Key Components: CoreDNS, External API/Database, Application Pod.
Key Characteristics: No label selectors, no proxying, pure DNS aliasing.
Use Case: Connecting to managed databases (AWS RDS, Azure SQL), cross-namespace service referencing, or external third-party APIs.
Benefits: Prevents hardcoding of environmental variables, simplifies CI/CD pipelines, completely decouples the application from external infrastructure changes.
Best Practices: Use this mainly for databases or internal corporate services. For external HTTPS APIs, be very cautious of the Host Header issue. Ensure your worker nodes have routeable access (NAT Gateways, Internet Gateways) to the external resource.

Challenge: SSL/TLS Certificate validation failures.
- Solution: As mentioned above, configure your HTTP client in the application to explicitly set the Host/SNI header to the external domain name, not the Kubernetes service name.
Problem: IP address of the external service changes frequently.
- Solution: This is actually perfectly handled by ExternalName, because it returns a CNAME. When the Pod resolves the CNAME, it will always get the freshest IP address from the external DNS provider.
Issue: Egress traffic is blocked by strict enterprise firewalls.
- Solution: You must ensure the actual Worker Nodes (EC2, VMs) have firewall rules (Security Groups) allowing outbound traffic on the required port (e.g., 5432 for PostgreSQL) to the destination.

https://kubernetes.io/docs/concepts/services-networking/service/#externalname

–

Headless Service

A Headless Service is a unique configuration where you explicitly tell Kubernetes not to load balance traffic. As you rightly mentioned, you achieve this by setting the clusterIP field to None in your Service YAML file.

The Magic Configuration: Set clusterIP: None in your Service YAML.
No Load Balancing: Kubernetes skips kube-proxy rules and bypasses virtual IP creation.
DNS Magic: Querying the Service name returns multiple “A records” (the direct IP addresses of all ready Pods).
StatefulSet’s Best Friend: Headless Services are almost always paired with StatefulSets to provide stable, predictable network identities (e.g., pod-0.service-name).

Feature	Standard Service (ClusterIP)	Headless Service
`clusterIP` value	Randomly assigned virtual IP	`None`
Load Balancing	Handled by K8s (`kube-proxy`)	Handled by the Client application
DNS Resolution	Returns 1 IP (the Service’s ClusterIP)	Returns multiple IPs (the actual Pod IPs)
Primary Use Case	Stateless web apps & APIs	Stateful apps (Databases, Message Queues)

Consequently, kube-proxy does not create any iptables or IPVS rules for this service. Instead, the magic happens entirely at the DNS level. When a client application makes a DNS query to the CoreDNS server for the Headless Service, CoreDNS responds with a list of all the individual IP addresses of the Pods currently matching the Service’s label selector. This allows the client application (like a Cassandra node or a Kafka broker) to maintain a pool of direct connections to its peers, enabling complex leader-election and data replication protocols that would be impossible behind a traditional load balancer.

The Traffic Flow: Client queries Headless Service DNS → CoreDNS returns a list of direct Pod IPs (e.g., Pod A IP, Pod B IP) → Client application chooses one (or connects to all) → Client connects directly to the target Pod(s), completely bypassing Kubernetes load balancing.

Why do databases hate load balancers? In a PostgreSQL cluster, you have one “Master” (who writes data) and several “Slaves/Replicas” (who read data). If an application wants to write data, it must talk strictly to the Master. If a load balancer randomly sends the write request to a Replica, the database will throw an error! Headless Services solve this by letting the application discover exactly who the Master is.

StatefulSets and Predictable DNS: When you pair a Headless Service with a StatefulSet, K8s creates predictable DNS records for each individual pod. For example: web-0.nginx-headless.default.svc.cluster.local. This is critical for cluster bootstrapping (e.g., an Elasticsearch node needs to know the exact hostname of the master node to join the cluster).
SRV Records: CoreDNS doesn’t just return A records for Headless Services; it also returns SRV (Service) records. These records provide the port numbers and hostnames of the pods, which is essential for auto-discovery mechanisms in modern distributed systems.

Key Components: CoreDNS, Endpoints Controller, Pod IPs, StatefulSet.
Key Characteristics: clusterIP: None, no load balancing, peer-to-peer routing.
Use case: Database clustering (Cassandra, MongoDB, PostgreSQL), Message Brokers (RabbitMQ, Kafka), and Stateful applications requiring leader election.
Benefits: Complete control over network connections. Removes the network bottleneck and latency of passing through a proxy.
Best practices: * Always pair it with a StatefulSet.
- Use a normal ClusterIP service alongside your Headless Service if you also want normal web clients to access the database without writing complex client-side load balancing code. (e.g., Headless for node-to-node replication, ClusterIP for app-to-db queries).

Challenge: Client-side load balancing complexity. Since K8s isn’t load balancing, your application code (or database driver) must be smart enough to handle connection failures and distribute traffic among the Pod IPs.
Problem: DNS Caching. Some applications cache DNS lookups forever. If a Pod dies and comes back with a new IP, the application might keep trying to talk to the dead IP.
- Solution: Ensure your application respects DNS TTL (Time To Live) or use a sidecar proxy (like Envoy) to manage connections.
Issue: Debugging is harder. You can’t just curl the service IP.
- Solution: You must use nslookup or dig from inside a debug container to verify that the Headless Service is correctly returning the backend Pod IPs.

https://kubernetes.io/docs/concepts/services-networking/service/#headless-services

Advanced Traffic Routing Concepts

To truly master Services, you need to understand how to optimize them for large-scale environments:

Internal Traffic Policy (internalTrafficPolicy: Local): By default, if a request hits a NodePort, kube-proxy might round-robin that request to a Pod on a completely different node, causing unnecessary network hops. Setting this policy to Local forces the node to only send traffic to Pods running on that exact same node, severely reducing latency.
Topology Aware Routing: In multi-zone clusters, cross-zone network traffic costs money and adds latency. Topology Aware Routing instructs Kubernetes to prefer routing traffic to Endpoints (Pods) that are in the same geographical zone as the requester.
The Limits of Services (Why we need Ingress): Kubernetes Services operate at Layer 4 (TCP/UDP). They do not understand HTTP/HTTPS, meaning they cannot route traffic based on URL paths (e.g., sending /api to one Service and /web to another) or handle SSL termination. For Layer 7 routing, you must put an Ingress Controller or Gateway API in front of your Services.

DevSecOps Architect Level

For the architects designing production-grade, highly secure clusters, Services are just the beginning.

kube-proxy Modes: In production, rely on IPVS mode instead of iptables for kube-proxy. iptables evaluates rules sequentially and degrades heavily in performance if you have thousands of services. IPVS uses hash tables for routing and supports advanced load-balancing algorithms like least connections.
Gateway API & Ingress: Exposing hundreds of services via the LoadBalancer type is a bad practice as it wastes massive amounts of cloud provider costs. Instead, use an Ingress Controller like NGINX Ingress ([suspicious link removed]) or the modern Gateway API (https://gateway-api.sigs.k8s.io/) to route external HTTP/HTTPS traffic through a single, secure entry point to your internal ClusterIP services.
Service Mesh Security (mTLS): By default, pod-to-pod traffic via Services is completely unencrypted. A DevSecOps professional must secure this laterally. Implement a Service Mesh like Istio (https://istio.io/) or Linkerd (https://linkerd.io/). They inject sidecar proxies that handle automatic Mutual TLS (mTLS) encryption between your services, ensuring zero-trust network security without changing a single line of application code.
eBPF Networking: Modern high-performance clusters are replacing kube-proxy entirely using eBPF-based tools like Cilium (https://cilium.io/). Cilium provides massive performance gains, deep network observability, and highly granular network policies at the kernel level.

ExternalTrafficPolicy: If you use a NodePort or LoadBalancer, by default, traffic might hit Node A, but the target Pod is running on Node B. Kubernetes will route the traffic from Node A to Node B, adding an extra network hop and losing the original client IP address. Setting externalTrafficPolicy: Local forces the node to only send traffic to Pods on that exact same node. This preserves the Client IP and reduces latency, but can lead to uneven load balancing if your pods are not distributed evenly.
Dual-Stack Networking: Modern Kubernetes Services support IPv4/IPv6 dual-stack. You can assign both an IPv4 and IPv6 ClusterIP to a single service, which is critical for telecom, 5G, and modern IoT deployments.
Topology Aware Routing: In multi-zone clusters, cross-zone traffic costs money and adds latency. Topology Aware Routing tells the service to prefer sending traffic to endpoints in the same availability zone as the client making the request, optimizing both performance and billing.

Tech should learn

AWS(Draft)

AWS-Cloud-Tech

AWS-Compute

DevOps Essentials

DevSecOps Essentials(Draft)

CI/CD

GitHub Actions

Docker

Kubernetes (Draft)

The Kubernetes Foundation

Kubernetes Architecture

Kubernetes Setting Up the Lab

Kubernetes Namespace

Kubernetes Pod

Kubernetes Workload Controller

Kubernetes Storage and Configurations

Kubernetes Networking

Programming

Python