Skip to the content.

Bad Pods

kubectl get pods --all-namespaces -o wide | egrep -v ‘Running|Completed|Error|BackOff|OOMKilled’

Autoscaler State ConfigMap

kubectl -n kube-system describe configmap/cluster-autoscaler-status

Autoscaler logs (all pods)

kubectl logtail --namespace=kube-system kube-cluster-autoscaler --since=10m

Deleting a pod that’s stuck in “Terminating” state for more than 5 minutes

kubectl delete pod <pod-name> --force --grace-period=0

Cluster spot instances per node

kubectl get nodes -L lifecycle-status -L 'failure-domain.beta.kubernetes.io/zone' -L beta.kubernetes.io/instance-type -L aws.amazon.com/spot