Skip to content

Operations

Once your clusters and workloads are running, day-2 operations keep them healthy. Kupe Cloud handles infrastructure operations (node patching, control plane maintenance) — your team owns application-level operations and Kubernetes version upgrades.

Kupe managesYour team manages
Host node OS patching and security updatesApplication deployments and rollbacks
Control plane availability and backupsNamespace and RBAC design
Kupe platform component upgrades (Argo CD, Grafana, and other managed platform services)Kubernetes minor version upgrades (console-guided)
Storage and network infrastructureWorkload health, scaling, and capacity planning

Kupe supports in-place Kubernetes minor version upgrades through the console. Upgrades follow a guided workflow:

  1. Check compatibility — review release notes and test in a non-production cluster.
  2. Initiate upgrade — start the upgrade from the console cluster settings.
  3. Monitor progress — the console shows upgrade status and any issues.
  4. Validate — confirm workloads are healthy, APIs are responding, and no deprecation warnings appear.
  • Investigate unhealthy pods — events, logs, and describe output in the console.
  • Roll back a bad deployment — revert the Git commit and let Argo CD reconcile.
  • Scale workloads — adjust replica counts or HPA targets in your manifests.
  • Respond to alerts — follow the triage path from alert to dashboard to logs.
  • Decommission a cluster — remove workloads, verify no external dependencies, then delete.