Drop restart-scoped labels such as container_id, uid, pod_ip, and image identity labels |
Collapses series that only differ by ephemeral Kubernetes identity |
Removes labels that are not used by dashboards in this repo |
Drop unused metric families such as traces_service_graph_request_*, network drop counters, and PV phase |
Removes whole unused series families before ingestion |
PV phase inventory is no longer sent to Grafana Cloud, while PVC and kubelet volume usage remain available |
Drop most kube_pod_* metrics for tuist-runners |
Removes the highest-churn pod-name fan-out from runner scale events |
Replaces runner phase visibility with a pool-level controller metric |
Keep kube_pod_status_unschedulable |
Preserves the placement failure signal used by the runner dashboard and ad-hoc cluster checks |
Retains one pod-scoped KSM metric family cluster-wide |
Add tuist_runners_pool_phase_replicas{pool,phase} |
Preserves linux alive plus macOS ready vs cold-booting visibility with pool x phase cardinality |
Adds one low-cardinality controller gauge family |
| Split metric cleanup by owner |
Keeps static RunnerPools from losing phase samples when autoscaling is disabled |
Leaves full cleanup to the primary RunnerPool reconciler when the pool object is gone |
| Defer phase metric publishing during reconcile |
Keeps ready and cold-booting counts fresh on create/delete retry paths |
Publishes latest known counts even when a later mutation fails and reconcile retries |
| Narrow Alloy self metrics to config-load health and scrape health |
Keeps stale-config alerting while dropping broad Alloy runtime series |
Detailed Alloy runtime internals are no longer retained in Grafana Cloud |
| Allow-list macOS node exporter metrics |
Keeps host health signals used for Mac mini operations |
Drops long-tail node exporter metrics that are not queried by Tuist dashboards |
| Switch RunnerPool dashboard to controller phase telemetry |
Removes dashboard dependency on KSM pod phase cardinality |
Phase visibility is rolled up by pool and platform, without per-Pod drilldown |
Remove the unverified macOS runner instance relabel |
Avoids normalizing the wrong scrape path and preserves per-machine host health |
Ephemeral runner instance cardinality remains a follow-up until its live source is verified |