apps: add back api server availability recording rules#2754
apps: add back api server availability recording rules#2754kristiangronas wants to merge 1 commit intoelastisys:mainfrom
Conversation
| {{- if .Values.defaultRules.recordlabels }} | ||
| {{ toYaml .Values.defaultRules.recordlabels | indent 4 }} |
There was a problem hiding this comment.
For these to be picked up by our prometheus we need the evaluate_prometheus label, which was not getting added as we have this variable in camel case:
| {{- if .Values.defaultRules.recordlabels }} | |
| {{ toYaml .Values.defaultRules.recordlabels | indent 4 }} | |
| {{- if .Values.defaultRules.recordLabels }} | |
| {{ toYaml .Values.defaultRules.recordLabels | indent 4 }} |
There was a problem hiding this comment.
Thanks, that explains why it was only working in wc, also i had to keep the apiserver_request_sli_duration_seconds_bucket and apiserver_request_sli_duration_seconds_count metrics for half of the panels, but i'm not too sure if the space usage of that is worth it, what do you think?
There was a problem hiding this comment.
Could you have a look at how many extra time series this adds, and also if you could check how resource usage for Prometheus differs after adding this?
The reason for this being removed in the past is most likely due to us wanting to reduce resource usage for Prometheus, and the dashboard was never removed when doing this change.
If this is something you really want then you could always add templating to be able to toggle these record rules as well as the dashboard, and keep it default false.
Warning
This is a public repository, ensure not to disclose:
What kind of PR is this?
Required: Mark one of the following that is applicable:
Optional: Mark one or more of the following that are applicable:
Important
Breaking changes should be marked
kind/admin-changeorkind/dev-changedepending on typeCritical security fixes should be marked with
kind/securityWhat does this PR do / why do we need this PR?
The
Kubernetes / API serverdashboard still uses this data, originally it was removed due to the rules not being usedThe rule was copied from https://github.com/prometheus-operator/kube-prometheus/blob/main/manifests/kubernetesControlPlane-prometheusRule.yaml#L877
Information to reviewers
Checklist