AWS Platform Guide

Horizontal Scaling

You can scale your application horizontally by adding more pods to your deployment. The simplest way is to specify the number of replicas in your deployment manifest. This example will run three pods for a deployment, balancing requests between them:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: example
spec:
  replicas: 3

Autoscaling

Most applications experience fluctuations in traffic both seasonally and throughout the day. You can use the horizontal pod autoscaler to automatically add and remove replicas based on metrics observed in your cluster. Here as an example which automatically scales deployment based on the number of incoming requests per second:

apiVersion: autoscaling/v2beta2
kind: HorizontalPodAutoscaler
metadata:
  name: example
spec:
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: example
  minReplicas: 2
  maxReplicas: 10
  metrics:
  - type: Pods
    pods:
      metric:
        name: http_requests_per_second
      target:
        type: AverageValue
        averageValue: 10

AWS Platform Guide

The guide for building and maintaining production-grade Kubernetes clusters with built-in support for SRE best practices.

Work with us to scale your application, improve stability, and increase the rate of defect-free deployments.