13 Nov 2024 02:41 PM
Hi Community,
After upgrading to OA v1.303, I have several hosts and processes running on them with connectivity issues. Are there others with the same behavior?
I see that the latest release includes several updates related to the Infrastructure Observability | Hosts category. And some networking issues have been resolved:
After reading the lasts updates to last version of OA deployed. I think the issues that have occurred now and not before on some hosts could be related to this update.
Solved! Go to Solution.
14 Nov 2024 12:17 PM
Hi, We have the same problem yesterday over kubernetes platforms. Big problem.
We open a ticket but no answer at this time.
Juan
14 Nov 2024 01:52 PM
Hi,
The affected metric is TCP connection timeout which is causing the activation of a problem that apparently does not really exist. But on Windows platforms we have detected it in the TCP connection refused metric. This has been happening since yesterday after the update to 1.303.
Regards
14 Nov 2024 07:01 PM
thanks @jcurbina I also create a ticket, I keep you posted here of any advance.
14 Nov 2024 10:25 PM
Thank you very much Daniel, so far we have no feedback on our ticket. What we have done is to disable the anomaly detection for TCP connectivity since from the information we have had it is only a false positive.
We will be attentive to any news. Thank you very much. A couple of examples:
15 Nov 2024 06:40 PM
Support has informed us that this is happening to several customers and is under investigation.
15 Nov 2024 07:28 PM
Hi,
After seeing this, i just check 3 client and this is the
updated to 1.303
another client updated
another without the update - 1.295
but in my case in any of them a problem was arise. so we didn't notice until now.
15 Nov 2024 07:41 PM
Hi Ruben,
The problem has been quite strange, because it has not been massive, it occurs in some Hosts and in some clients, there are even similar hosts but it has not had the same behavior. In some cases it has been immediate after a restart, it may be that after the process restart the problem is activated.
In one Windows case the affected metric was TCP connection refused and it generated TCP connectivity type problems in IIS app pools.
16 Nov 2024 12:25 AM
@Mohamed_Hamdy I think we faced the same problem.
19 Nov 2024 08:54 AM
Hello everyone,
We're in the process of fixing this issue. I'll post here when the changes will go live.
20 Nov 2024 07:01 AM
I've seen in the Jira ticket that the fix went live with the newest OneAgent update, and the issue was resolved 😊
20 Nov 2024 01:27 PM
Hello Maciej Neumann, could you tell us the date when the version with the correction will be released?
21 Nov 2024 06:04 PM
Hey jpinto! OneAgent version 1.303.50+ has the fix for this issue and should be rolling out now to environments. If you're still not seeing it in your environment, please reach out via in-product assistance (live chat) or make a support ticket to have our teams push that version to your tenant(s). Thanks!
20 Nov 2024 10:24 PM
Hi Macie,
At one of our customers we have already verified that 15 updated Linux hosts have fixed the problem.
20 Nov 2024 02:53 PM
Hi Maciej
i can see a difference since 1.303.50.20241118-133432 ..
I notice a change after the update, thanks
20 Nov 2024 05:16 PM
Hi Ruben,
At this point, we have not had the opportunity to update any customers or check a case. We hope to be able to check the fix as soon as possible. But this is good news. Thanks for the feedback.
21 Nov 2024 03:16 PM
Buenos días a todos, en mi caso, a pesar de actualizar a la version 1.303.50 continuo con el problema A partir de la actualización de la version 1.303.42 estoy con algunos equipos que la métrica de Availability paso del 100% al 83%
21 Nov 2024 06:30 PM
Linux or Windows? Our validation was done on Linux platforms only.
21 Nov 2024 08:56 PM
En este caso ambos contenedores son Linux - Alpine Linux (kernel 5.10.226-214.880.amzn2.x86_64)