Difference between revisions of "Kubernetes troubleshooting"

From wikieduonline
Jump to navigation Jump to search
 
(33 intermediate revisions by 3 users not shown)
Line 1: Line 1:
* https://learnk8s.io/troubleshooting-deployments
+
* [[Kubernetes troubleshooting steps]]
  
 +
== Commands ==
 
* <code>[[kubectl logs]] [[your_pod]]</code>
 
* <code>[[kubectl logs]] [[your_pod]]</code>
 
* <code>[[kubectl get events -A]]</code>
 
* <code>[[kubectl get events -A]]</code>
 
* <code>[[kubectl describe pod]] your_pod</code>
 
* <code>[[kubectl describe pod]] your_pod</code>
 
* <code>[[kubectl describe nodes]]</code>, review <code>[[kubectl describe nodes (conditions:)|conditions:]]</code>
 
* <code>[[kubectl describe nodes]]</code>, review <code>[[kubectl describe nodes (conditions:)|conditions:]]</code>
 +
* <code>[[kubectl top]]</code>
 +
* <code>[[kubectl cluster-info dump]]</code>
  
  
* Tools: <code>[[kubectl top]]</code>, <code>[[K9s]]</code> and <code>[[crictl]]</code></code>
+
* Tools: <code>[[K9s]]</code> and <code>[[crictl]]</code></code>
  
== [[Events]] ==
+
== [[Kubernetes events|Events]] ==
 
* {{FailedScheduling}}
 
* {{FailedScheduling}}
 
* {{kubectl get events}}
 
* {{kubectl get events}}
Line 17: Line 20:
  
 
[[Load Balancer]]
 
[[Load Balancer]]
* [[UnAvailableLoadBalancer]]
+
* <code>[[UnAvailableLoadBalancer]]</code>
  
 
[[Kubelet]]
 
[[Kubelet]]
 
* <code>[[PLEG is not healthy]]</code>
 
* <code>[[PLEG is not healthy]]</code>
 +
* <code>[[/var/log/kubelet.log]]</code>
 +
 +
[[Scheduling]]
 +
* [[Kubernetes scheduling]]
 +
* [[Kubernetes Pod Topology Spread Constraints]]
 +
* [[Kubernetes pod affinity and anti affinity]]
 +
* [[Karpenter]]
 +
* <code>[[ttlSecondsUntilExpired]]</code>,  <code>[[controller.node]] [[Triggering termination for expired node after]] 168h0m0s .../...</code>
 +
 +
[[etcd]]
 +
 +
== Log ==
 +
* <code>[[Karpenter logs]]</code>
 +
* <code>[[Kubelet logs]]</code>
 +
* <code>[[/var/log/kubelet.log]]</code>
  
 
== Related ==
 
== Related ==
* [[Kubernetes node events]]: <code>[[BackoffLimitExceeded]], [[BackOff]], [[NodeNotReady]], [[FailedScheduling]]</code>
+
* [[Readiness]], [[Liveness]], <code>[[Readiness probe errored]]</code>
* <code>[[Readiness probe errored]]</code>
+
* <code>[[Reason]]: [[ProbeWarning]]</code>
* [[Readiness]], [[Liveness]]
 
* [[Reason]]: [[ProbeWarning]]
 
 
* [[Kubernetes Pod Disruptions]]
 
* [[Kubernetes Pod Disruptions]]
 
* <code>[[Unable to connect to the server]], [[~/.kube/config]]</code>
 
* <code>[[Unable to connect to the server]], [[~/.kube/config]]</code>
 
* <code>[[DiskPressure]]</code>
 
* <code>[[DiskPressure]]</code>
 
* <code>[[CalculateExpectedPodCountFailed]]</code>
 
* <code>[[CalculateExpectedPodCountFailed]]</code>
 +
* <code>[[aws eks create-cluster --logging]]</code>
 +
* <code>[[Node-pressure Eviction]]</code>
 +
* <code>[[karpenter.sh/do-not-evict: true]]</code>
 +
* <code>[[NodeNotReady]]</code>
 +
* <code>[[kubectl-node-shell]]</code>
 +
* <code>[[kubectl exec]]</code>
 +
* <code>[[kubectl attach]]</code>
 +
* [[EKS troubleshooting]]
 +
 +
== Activities ==
 +
* Review: https://learnk8s.io/troubleshooting-deployments
 +
* [[Kubernetes debugging with an ephemeral debug container]]: <code>[[kubectl debug]]</code>
  
 
== See also ==
 
== See also ==

Latest revision as of 11:55, 28 February 2024

Commands[edit]


Events[edit]

Kubernetes components[edit]

Load Balancer

Kubelet

Scheduling

etcd

Log[edit]

Related[edit]

Activities[edit]

See also[edit]

Advertising: