04 May 2023 04:36 PM - edited 17 Aug 2023 08:07 AM
Hello
We are experiencing Host metrics gaps (no data) from OneAgent on AIX ver. 7.2 TL 5 SP 5 PowerPC 64-bit. Observed on Managed with OneAgent v 1.257.250.20230202-090730 and v 1.261.201.20230315-145944. Like CPU, disk space, disk throughput, memory RAM, process availability...
This render the monitoring on theses hosts 100% useless.
Any one experiencing this ? Any idea ?
Regards.
For the record : already in touch with support :
https://one.dynatrace.com/hc/en-us/requests/152458
https://one.dynatrace.com/hc/en-us/requests/159914
Solved! Go to Solution.
05 May 2023 07:36 AM - last edited on 09 May 2023 09:58 AM by MaciejNeumann
Hello. Sometimes gaps point that there is network connectivity issues between OneAgent & ActiveGate or Dyntrace Cluster Node.
default log location ruxitagent_host_*:
/var/log/dynatrace/oneagent/os/* |
Example String for search:
info [comm ] URL https: //172.172.172.172:9999/communication not working (Connection timed out after 2000 milliseconds) (occurred X times in the last 1h 0m 0s) |
In any case, contacting support is a good option.
Regards,
Alex Romanenkov
06 May 2023 10:06 AM - edited 06 May 2023 10:06 AM
Hi,
Totally agree with @Romanenkov_Al3x.
You can also read troubleshoot monitoring interruptions and run OneAgent diagnostics, it will be helpful if you contact Dynatrace ONE/support.
Best regards
24 May 2023 08:55 AM
Turns out DB monitoring tool AIX ITM Tivoli GSMA DB2 (among which kuddb2 (/opt/IBM/ITM/aix526/ud/bin/kuddb2) ) seams to be slowing down drastically calls to perfstat_process_util, making OneAgent calls to this utility very slow leading to metrics gaps. I am not sure about what is the root cause kuddb2 or oneagent
We uninstalled the Tivoli thing and tadaa: at once problem gone.
24 May 2023 08:53 PM
@gilles_tabary there can be any reason for that. If you download the support archive, check the logs for the OneAgent (os agent) and there will be timings about metrics collection. From that you can focus on the area which takes longer than expected and results into monitoring gaps.