dt-evals: an open source continuous evaluation tool for LLM apps in Dynatrace AI Observability
TL;DR: We open-sourced dt-evals, a CLI toolkit for evaluating LLM and agent quality using real GenAI traces in Dynatrace AI Observability. Run evaluations on live or recent gen_ai.* spans, score respo...