I'm trying to get a list of services that have a high number of calls to child-services. The data explorer seemed like the best place to do this, but I'm getting confusing results.
As a test to make sure we're using the right metrics....
Using a multi-dimensional view for a specific service, I'm seeing the Number of calls to other services metric report the following:
90th percentile: 2
Good so far, we know there are some higher outliers...
When I open a data explorer and filter to the same service, same time period:
This is using the builtin:service.nonDbChildCallCount metric, which is what we get when searching for "Number of calls to other services."
What's going on here?
Also, if we create a calculated service metric using "Number of calls to other services", no splittings or dimensions, and add that to the Data Explorer, it shows the correct numbers we see in the MDA.
Is there a bug with the "Number of calls to other services" built-in metric in the data explorer?
Solved! Go to Solution.
That's because the data source are different for both.
Unlike multidimensional analysis, the Data explorer uses metrics data, not session storage, so values on custom charts might differ from values in a multidimensional analysis.
@mdouds in the MDA - it's per request. The metric is a SUM of the child calls per chart resolution (lowest being one minute).
So you are comparing number of child calls per request vs. number of child calls per minute that's why you have such odd numbers. Just use the code level tab in data explorer and divide the metric builtin:service.nonDbChildCallCount with number builtin:service.requestCount.total to get number of calls per request.
@mdouds no this is not possible. It will still be a metric and you can't use aggregations there.
However, you can use the global MDA view to find such services, then focus on them and find the requests with excessive calls. You don't have to go to the MDA for each individual service.