Skip to main content
Google Cloud Documentation
Technology areas
  • AI and ML
  • Application development
  • Application hosting
  • Compute
  • Data analytics and pipelines
  • Databases
  • Distributed, hybrid, and multicloud
  • Generative AI
  • Industry solutions
  • Networking
  • Observability and monitoring
  • Security
  • Storage
Cross-product tools
  • Access and resources management
  • Costs and usage management
  • Infrastructure as code
  • Migration
  • SDK, languages, frameworks, and tools
/
Console
  • English
  • Deutsch
  • Español
  • Español – América Latina
  • Français
  • Indonesia
  • Italiano
  • Português
  • Português – Brasil
  • 中文 – 简体
  • 中文 – 繁體
  • 日本語
  • 한국어
Sign in
  • Google Kubernetes Engine (GKE)
Start free
Overview Overview Guides Reference Samples Resources
Google Cloud Documentation
  • Technology areas
    • More
    • Overview
    • Overview
    • Guides
    • Reference
    • Samples
    • Resources
  • Cross-product tools
    • More
  • Console
  • Discover
  • Product overview
  • Explore GKE documentation
    • Overview
    • Main GKE documentation
    • GKE AI/ML documentation
    • GKE networking documentation
    • GKE security documentation
    • GKE fleet management documentation
  • Use GKE or Cloud Run?
  • Try it
    • Create a cluster in the console
    • Create a cluster with Terraform
    • Explore your cluster
  • Fine-tune GKE services with Gemini assistance
  • Learn fundamentals
  • Start learning about GKE
  • Learn Kubernetes fundamentals
    • Start learning about Kubernetes
    • Introducing containers
    • Kubernetes comic
    • Kubernetes.io
    • Video playlist: Learn Kubernetes with Google
  • Learn GKE essentials
    • GKE modes of operation
    • Video playlist: GKE Essentials
  • Common GKE user roles and tasks
  • Get started
  • Cluster lifecycle
  • Cluster administration overview
  • Cluster configuration
  • Deploying workloads
  • GKE cluster architecture
  • Workflows and tools
    • gcloud CLI overview
    • GKE in the Google Cloud console
    • Provision GKE resources with Terraform
    • Install kubectl and configure cluster access
    • Simplify deployment using your IDE
  • Learning path: Containerize your app
    • Overview
    • Understand the monolith
    • Modularize the monolith
    • Prepare for containerization
    • Containerize the modular app
    • Deploy the app to a cluster
  • Learning path: Scalable apps
    • Overview
    • Create a cluster
    • Monitor with Prometheus
    • Scale workloads
    • Simulate failure
    • Centralize changes
    • Production considerations
  • Design and plan
  • Code samples
  • Architectures and best practices
    • Develop and deliver apps with Cloud Code, Cloud Build, and Google Cloud Deploy
    • Address continuous delivery challenges
  • Set up GKE clusters
  • Plan clusters for running your workloads
    • Compare features in GKE Autopilot and Standard
    • About regional clusters
    • About feature gates
    • About alpha clusters
  • Set up Autopilot clusters
    • About GKE Autopilot
    • Create Autopilot clusters
    • Extend the run time of Autopilot Pods
  • Set up Standard clusters
    • Create a zonal cluster
    • Create a regional cluster
    • Create an alpha cluster
    • Create a cluster using Windows node pools
  • Prepare to use clusters
    • Use labels to organize clusters
    • Manage GKE resources using Tags
  • Configure node pools
    • About node pools
    • Add and manage node pools
    • About node images
    • About Containerd images
    • Specify a node image
    • About Arm workloads on GKE
    • Create Standard clusters and node pools with Arm nodes
    • Plan GKE Standard node sizes
    • About Spot VMs
    • About Windows Server containers
    • Auto-repair nodes
    • Automatically bootstrap GKE nodes with DaemonSets
    • Update Kubernetes node labels and taints for node pools
  • Set up clusters for multi-tenancy
    • About cluster multi-tenancy
    • Plan a multi-tenant environment
    • Prepare GKE clusters for third-party tenants
    • Set up multi-tenant logging
  • Use fleets to simplify multi-cluster management
    • About fleets
    • Create fleets
  • Set up service mesh
    • Provision Cloud Service Mesh in an Autopilot cluster
  • Enhance scalability for clusters
    • About GKE scalability
    • Plan for scalability
    • Plan for large GKE clusters
    • Plan for large workloads
    • Provision extra compute capacity for rapid Pod scaling
    • Consume reserved zonal resources
    • About quicker workload startup with fast-starting nodes
  • Reduce and optimize costs
  • Plan for cost-optimization
  • View GKE costs
    • View cluster costs breakdown
    • View cost-related optimization metrics
  • Optimize GKE costs
    • Right-size your GKE workloads at scale
    • Reduce costs by scaling down GKE clusters during off-peak hours
    • Identify underprovisioned and overprovisioned GKE clusters
    • Identify idle GKE clusters
  • Configure autoscaling for infrastructure
    • About cluster autoscaling
    • Configure cluster autoscaling
    • About node pool auto-creation
    • Configure node pool auto-creation
    • View cluster autoscaling events
  • Configure autoscaling for workloads
    • Scaling deployed applications
    • About autoscaling workloads based on metrics
    • Optimize Pod autoscaling based on metrics
    • About horizontal Pod autoscaling
    • Autoscale deployments using horizontal Pod autoscaling
    • Configure autoscaling for LLM workloads on GPUs
    • Configure autoscaling for LLM workloads on TPUs
    • View horizontal Pod autoscaler events
    • Scale to zero using KEDA
    • About vertical Pod autoscaling
    • Configure multidimensional Pod autoscaling
    • Scale container resource requests and limits
  • Provision storage
  • About storage for GKE clusters
  • Use Kubernetes features, primitives, and abstractions for storage
    • Use persistent volumes and dynamic provisioning
    • Use StatefulSets
    • About volume snapshots