Difference between revisions of "Kubernetes troubleshooting"

From wikieduonline
Jump to navigation Jump to search
 
(64 intermediate revisions by 4 users not shown)
Line 1: Line 1:
* https://learnk8s.io/troubleshooting-deployments
+
* [[Kubernetes troubleshooting steps]]
  
 +
== Commands ==
 
* <code>[[kubectl logs]] [[your_pod]]</code>
 
* <code>[[kubectl logs]] [[your_pod]]</code>
 
* <code>[[kubectl get events -A]]</code>
 
* <code>[[kubectl get events -A]]</code>
 
* <code>[[kubectl describe pod]] your_pod</code>
 
* <code>[[kubectl describe pod]] your_pod</code>
* <code>[[kubectl describe nodes]]</code>, review [[kubectl describe nodes (conditions:)|conditions:]]
+
* <code>[[kubectl describe nodes]]</code>, review <code>[[kubectl describe nodes (conditions:)|conditions:]]</code>
 +
* <code>[[kubectl top]]</code>
 +
* <code>[[kubectl cluster-info dump]]</code>
  
  
* <code>[[kubectl top]] or [[K9s]]</code>
+
* Tools: <code>[[K9s]]</code> and <code>[[crictl]]</code></code>
 +
 
 +
== [[Kubernetes events|Events]] ==
 +
* {{FailedScheduling}}
 +
* {{kubectl get events}}
 +
* {{Kubernetes nodes events}}
 +
 
 +
== [[Kubernetes components]] ==
 +
 
 +
[[Load Balancer]]
 +
* <code>[[UnAvailableLoadBalancer]]</code>
 +
 
 +
[[Kubelet]]
 +
* <code>[[PLEG is not healthy]]</code>
 +
* <code>[[/var/log/kubelet.log]]</code>
 +
 
 +
[[Scheduling]]
 +
* [[Kubernetes scheduling]]
 +
* [[Kubernetes Pod Topology Spread Constraints]]
 +
* [[Kubernetes pod affinity and anti affinity]]
 +
* [[Karpenter]]
 +
* <code>[[ttlSecondsUntilExpired]]</code>,  <code>[[controller.node]] [[Triggering termination for expired node after]] 168h0m0s .../...</code>
 +
 
 +
[[etcd]]
 +
 
 +
== Log ==
 +
* <code>[[Karpenter logs]]</code>
 +
* <code>[[Kubelet logs]]</code>
 +
* <code>[[/var/log/kubelet.log]]</code>
  
 
== Related ==
 
== Related ==
* <code>[[kubectl --cluster]]</code>
+
* [[Readiness]], [[Liveness]], <code>[[Readiness probe errored]]</code>
* [[Kubernetes deployments]]
+
* <code>[[Reason]]: [[ProbeWarning]]</code>
* [[Kubernetes components]]
 
* [[Readiness probe errored]]
 
* [[Readiness]], [[Liveness]]
 
* [[BackOff]]
 
* [[BackoffLimitExceeded]]
 
* [[Type]]
 
* [[Reason]]: [[ProbeWarning]]
 
 
* [[Kubernetes Pod Disruptions]]
 
* [[Kubernetes Pod Disruptions]]
* [[NodeNotReady]]
+
* <code>[[Unable to connect to the server]], [[~/.kube/config]]</code>
* <code>[[Unable to connect to the server]]</code>
+
* <code>[[DiskPressure]]</code>
[[Insufficient cpu]]
+
* <code>[[CalculateExpectedPodCountFailed]]</code>
[[Insufficient memory]]
+
* <code>[[aws eks create-cluster --logging]]</code>
[[MemoryPressure]]
+
* <code>[[Node-pressure Eviction]]</code>
[[DiskPressure]]
+
* <code>[[karpenter.sh/do-not-evict: true]]</code>
* <code>[[crictl]]</code>
+
* <code>[[NodeNotReady]]</code>
 +
* <code>[[kubectl-node-shell]]</code>
 +
* <code>[[kubectl exec]]</code>
 +
* <code>[[kubectl attach]]</code>
 +
* [[EKS troubleshooting]]
 +
 
 +
== Activities ==
 +
* Review: https://learnk8s.io/troubleshooting-deployments
 +
* [[Kubernetes debugging with an ephemeral debug container]]: <code>[[kubectl debug]]</code>
  
 
== See also ==
 
== See also ==
 +
* {{kubectl get events}}
 +
* {{Kubernetes nodes events}}
 
* {{K8s troubleshooting}}
 
* {{K8s troubleshooting}}
  
  
 
[[Category:K8s]]
 
[[Category:K8s]]

Latest revision as of 11:55, 28 February 2024

Commands[edit]


Events[edit]

Kubernetes components[edit]

Load Balancer

Kubelet

Scheduling

etcd

Log[edit]

Related[edit]

Activities[edit]

See also[edit]

Advertising: