Difference between revisions of "Monitoring Kubernetes"
Jump to navigation
Jump to search
Line 13: | Line 13: | ||
== Related == | == Related == | ||
− | |||
* [[Kubernetes node conditions]] | * [[Kubernetes node conditions]] | ||
* <code>[[Pod The node had condition:]]</code> | * <code>[[Pod The node had condition:]]</code> |
Revision as of 07:25, 20 September 2023
https://kubernetes.io/docs/tasks/debug/debug-cluster/resource-usage-monitoring/
- Pixie (New Relic)
- Prometheus
- Kube-state-metrics (KSM):
helm install myprometheus prometheus-community/prometheus
- LogicMonitor
- CKA: Understand how to monitor applications in Kubernetes
- K8s log collection:
fluentd, fluentbit, promtail
Activities
- Read https://www.datadoghq.com/blog/monitoring-kubernetes-performance-metrics/#cluster-state-metrics
Related
- Kubernetes node conditions
Pod The node had condition:
- Container:
is approaching memory limit
(Datadog) - Prometheus, VictoriaMetrics, Grafana
node-problem-detector
aws-for-fluent-bit
monitoring
namespace- GKE: integrated logging and monitoring
- Kubernetes node-problem-detector
- Kubernetes troubleshooting
See also
Advertising: