Is there any documentation or roadmap which problems on K8s are automatically detected and alerted?
E.g. Cpu throttling, OOM, pods crashing,…
based on my current experience and the Event Documentation there are no problems detected at all? We always need to create custom alerts based on some metrics?
are there plans to change this?