Difference between revisions of "Monitoring Kubernetes"
Jump to navigation
Jump to search
(4 intermediate revisions by the same user not shown) | |||
Line 10: | Line 10: | ||
* [[OpenMetrics]] ([[CNCF]]) | * [[OpenMetrics]] ([[CNCF]]) | ||
+ | * [[EKS monitoring]]: [[AWS CloudWatch Container Insights]] | ||
+ | |||
+ | * [[Prometheus monitoring Mixin for Kubernetes]]: https://github.com/kubernetes-monitoring/kubernetes-mixin | ||
== Activities == | == Activities == | ||
Line 26: | Line 29: | ||
* [[Kubernetes node-problem-detector]] | * [[Kubernetes node-problem-detector]] | ||
* [[Kubernetes troubleshooting]] | * [[Kubernetes troubleshooting]] | ||
+ | * <code>[[kubectl get svc -n monitoring]]</code> | ||
== See also == | == See also == |
Latest revision as of 10:01, 1 December 2023
https://kubernetes.io/docs/tasks/debug/debug-cluster/resource-usage-monitoring/
- Pixie (New Relic)
- Prometheus
- Kube-state-metrics (KSM):
helm install myprometheus prometheus-community/prometheus
- LogicMonitor
- CKA: Understand how to monitor applications in Kubernetes
- K8s log collection:
fluentd, fluentbit, promtail, kubectl logs
- Prometheus monitoring Mixin for Kubernetes: https://github.com/kubernetes-monitoring/kubernetes-mixin
Activities[edit]
- Read https://www.datadoghq.com/blog/monitoring-kubernetes-performance-metrics/#cluster-state-metrics
Related[edit]
- Kubernetes events:
kubectl get events
- Kubernetes node conditions
Pod The node had condition:
- Container:
is approaching memory limit
(Datadog) - Prometheus, VictoriaMetrics, Grafana
node-problem-detector
aws-for-fluent-bit
monitoring
namespace- GKE: integrated logging and monitoring
- Kubernetes node-problem-detector
- Kubernetes troubleshooting
kubectl get svc -n monitoring
See also[edit]
Advertising: