keyword-metrics

Here are 2 public repositories matching this topic...

🔍 Run efficient evaluations for prompt and LLM regression testing with this lightweight, secret-free evaluation harness.

Small **LLM evaluation harness** designed as an educational / portfolio project

python nlp testing benchmark machine-learning evaluation ai-safety mlops llm prompt-engineering llm-evaluation keyword-metrics

Add a description, image, and links to the keyword-metrics topic page so that developers can more easily learn about it.

To associate your repository with the keyword-metrics topic, visit your repo's landing page and select "manage topics."