cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 

How to reduce time it takes to perform a health check on dead pod

heramb_sawant
Organizer

Hi Team,
Node/host having k8s pods , later for some reason pods were killed. Though pods were not exist(already killed) , host having oneagent , was still looking for tcp connectivity with process running on that pod. 

Can someone please tell me how we can minimize the time it takes to perform a health check on dead pod?  currently its more than 15 min  and alert remain open for long time,  it is  too long.  I would say DT should look for the pod for one minute and if doesn’t exist don’t report.

Regards,
Heramb Sawant

 

1 REPLY 1

heramb_sawant
Organizer

Any help will be appreciated. 

Featured Posts