Difference between revisions of "Monitoring Kubernetes"
Jump to navigation
Jump to search
(10 intermediate revisions by the same user not shown) | |||
Line 7: | Line 7: | ||
* [[CKA]]: [[Understand how to monitor applications in Kubernetes]] | * [[CKA]]: [[Understand how to monitor applications in Kubernetes]] | ||
− | * [[K8s log collection]]: <code>[[fluentd]], [[fluentbit]], [[promtail]]</code> | + | * [[K8s log collection]]: <code>[[fluentd]], [[fluentbit]], [[promtail]], [[kubectl logs]]</code> |
+ | |||
+ | * [[OpenMetrics]] ([[CNCF]]) | ||
+ | * [[EKS monitoring]]: [[AWS CloudWatch Container Insights]] | ||
+ | |||
+ | * [[Prometheus monitoring Mixin for Kubernetes]]: https://github.com/kubernetes-monitoring/kubernetes-mixin | ||
== Activities == | == Activities == | ||
Line 13: | Line 18: | ||
== Related == | == Related == | ||
− | * <code>[[kubectl | + | * [[Kubernetes events]]: <code>[[kubectl get events]]</code> |
* [[Kubernetes node conditions]] | * [[Kubernetes node conditions]] | ||
* <code>[[Pod The node had condition:]]</code> | * <code>[[Pod The node had condition:]]</code> | ||
Line 24: | Line 29: | ||
* [[Kubernetes node-problem-detector]] | * [[Kubernetes node-problem-detector]] | ||
* [[Kubernetes troubleshooting]] | * [[Kubernetes troubleshooting]] | ||
+ | * <code>[[kubectl get svc -n monitoring]]</code> | ||
== See also == | == See also == |
Latest revision as of 10:01, 1 December 2023
https://kubernetes.io/docs/tasks/debug/debug-cluster/resource-usage-monitoring/
- Pixie (New Relic)
- Prometheus
- Kube-state-metrics (KSM):
helm install myprometheus prometheus-community/prometheus
- LogicMonitor
- CKA: Understand how to monitor applications in Kubernetes
- K8s log collection:
fluentd, fluentbit, promtail, kubectl logs
- Prometheus monitoring Mixin for Kubernetes: https://github.com/kubernetes-monitoring/kubernetes-mixin
Activities[edit]
- Read https://www.datadoghq.com/blog/monitoring-kubernetes-performance-metrics/#cluster-state-metrics
Related[edit]
- Kubernetes events:
kubectl get events
- Kubernetes node conditions
Pod The node had condition:
- Container:
is approaching memory limit
(Datadog) - Prometheus, VictoriaMetrics, Grafana
node-problem-detector
aws-for-fluent-bit
monitoring
namespace- GKE: integrated logging and monitoring
- Kubernetes node-problem-detector
- Kubernetes troubleshooting
kubectl get svc -n monitoring
See also[edit]
Advertising: