Evaluate AI agents systematically with Agent-EvalKit
Amazon Web Services released Agent-EvalKit, an open-source toolkit under Apache 2.0, to systematically evaluate AI agents by tracing their full execution paths rather than relying solely on output-lev…