cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 

AIX Host OneAgent v 1.257.250 / 1.261.201 : host metric gap (no data)

gilles_tabary
Advisor

Hello

We are experiencing Host metrics gaps (no data) from OneAgent on AIX ver. 7.2 TL 5 SP 5 PowerPC 64-bit. Observed on Managed with OneAgent v 1.257.250.20230202-090730 and v 1.261.201.20230315-145944. Like CPU, disk space, disk throughput, memory RAM, process availability...

AIX_OneAgent_host_metrics_gap.png

This render the monitoring on theses hosts 100% useless.

Any one experiencing this ? Any idea ?

Regards.

For the record : already in touch with support : 
https://one.dynatrace.com/hc/en-us/requests/152458
https://one.dynatrace.com/hc/en-us/requests/159914

 

 

4 REPLIES 4

Romanenkov_Al3x
DynaMight Champion
DynaMight Champion

Hello. Sometimes gaps point that there is network connectivity issues between OneAgent & ActiveGate or Dyntrace Cluster Node.

default log location ruxitagent_host_*:

/var/log/dynatrace/oneagent/os/*

Example String for search:

info [comm ] URL https: //172.172.172.172:9999/communication not working (Connection timed out after 2000 milliseconds) (occurred X times in the last 1h 0m 0s)

 

In any case, contacting support is a good option. 

 

Regards, 

Alex Romanenkov

DT_NGINX_FORCE_UNKNOWN_VERSION_INSTRUMENTATION=1

Hi,

Totally agree with @Romanenkov_Al3x

You can also read troubleshoot monitoring interruptions and run OneAgent diagnostics, it will be helpful if you contact Dynatrace ONE/support.

Best regards

❤️ Emacs ❤️ Vim ❤️ Bash ❤️ Perl

gilles_tabary
Advisor

Turns out DB monitoring tool AIX ITM Tivoli GSMA DB2 (among which kuddb2 (/opt/IBM/ITM/aix526/ud/bin/kuddb2) ) seams to be slowing down drastically calls to perfstat_process_util, making OneAgent calls to this utility very slow leading to metrics gaps. I am not sure about what is the root cause kuddb2 or oneagent 

We uninstalled the Tivoli thing and tadaa: at once problem gone.

Julius_Loman
DynaMight Legend
DynaMight Legend

@gilles_tabary  there can be any reason for that. If you download the support archive, check the logs for the OneAgent (os agent) and there will be timings about metrics collection. From that you can focus on the area which takes longer than expected and results into monitoring gaps.

Certified Dynatrace Master | Alanata a.s., Slovakia, Dynatrace Master Partner

Featured Posts