Releases: agentevals-dev/agentevals
Releases · agentevals-dev/agentevals
v0.6.4
What's Changed
- (docs): Add k8s example by @krisztianfekete in #86
- Expose all tool_trajectory_avg_score match types by @ossama-ferjani in #89
- Add zero code Ollama example by @ajimenez1503 in #91
- npm audit fix by @krisztianfekete in #103
- Improve MCP server by @krisztianfekete in #104
- Update README by @krisztianfekete in #105
- gRPC OTel Receiver support by @ajimenez1503 in #98
- Extend SDK with perf metrics by @krisztianfekete in #108
- Support OTel gen_ai semconv v1.40 by @krisztianfekete in #109
- Create StreamingTraceManager once by @ajimenez1503 in #107
New Contributors
- @ossama-ferjani made their first contribution in #89
- @ajimenez1503 made their first contribution in #91
Full Changelog: v0.6.3...v0.6.4
v0.6.3
What's Changed
- Fix otlp session fragmentation by @krisztianfekete in #83
- fix dropping invocation scores and details and UI by @krisztianfekete in #85
Full Changelog: v0.6.2...v0.6.3
v0.6.2
v0.6.1
What's Changed
- Consolidate trace-to-invocation conversion by @krisztianfekete in #76
- Replace _trace_manager variables with FastAPI dependency injection by @krisztianfekete in #78
- Consolidate text extraction into a single source of truth by @krisztianfekete in #79
- Consolidate duplicated tool call extractions by @krisztianfekete in #80
- add OpenAI Agents SDK zero-code example by @shahar-dagan in #72
- agentevals docker build and helm deploy by @esara in #66
New Contributors
- @shahar-dagan made their first contribution in #72
- @esara made their first contribution in #66
Full Changelog: v0.6.0...v0.6.1
v0.6.0
What's Changed
- Updating the readme.md file by @ProfessorSeb in #64
- Readme add overview sections by @ProfessorSeb in #63
- Add venv support for custom evals by @krisztianfekete in #65
- Add openai_eval type to delegate evals to OpenAI APIs by @krisztianfekete in #73
- document OpenAI Graders by @krisztianfekete in #74
- add per eval time metrics to CLI by @krisztianfekete in #75
New Contributors
- @ProfessorSeb made their first contribution in #64
v0.5.3
v0.5.2
v0.5.1
What's Changed
- Fix metrics panel render problem by @krisztianfekete in #56
Full Changelog: v0.5.0...v0.5.1
v0.5.0
What's Changed
- Fix metadata extraction and UI elements for eval results by @krisztianfekete in #35
- Add inline tool call results to conversation transcript views by @krisztianfekete in #36
- add support for custom graders (eval metrics) by @peterj in #33
- add ci by @peterj in #37
- Cleanup server startup by @krisztianfekete in #50
- Add ADK zero code example by @krisztianfekete in #51
- Update docs on OTel best practices by @krisztianfekete in #52
- Add responsible AI usage guidelines by @krisztianfekete in #53
- Update docs with custom evaluators and hand crafting evalsets by @krisztianfekete in #54
- Update README and docs by @krisztianfekete in #55
New Contributors
Full Changelog: v0.4.1...v0.5.0
v0.4.1
What's Changed
- API cleanup by @krisztianfekete in #27
- Add integration and e2e test suites by @krisztianfekete in #29
- Fix npm cve in flatted by @krisztianfekete in #30
- Fix incorrect span groupping on UI by @krisztianfekete in #31
- Fix conversation element ordering issue with batching by @krisztianfekete in #32
Full Changelog: v0.4.0...v0.4.1