Skip to content

Feautre/make question agent great again#152

Merged
bernomone merged 7 commits into
mainfrom
feautre/make-question-agent-great-again
Jan 31, 2026
Merged

Feautre/make question agent great again#152
bernomone merged 7 commits into
mainfrom
feautre/make-question-agent-great-again

Conversation

@bernomone
Copy link
Copy Markdown
Collaborator

@bernomone bernomone commented Jan 25, 2026

ho aggiunto logfire anche alle evals così si vede meglio che succede. Per ora solo claude haiku 4.5 segue le istruzioni:

===== Model: claude-aws-bedrock =====

Running question evals...
Evaluating case: Who solved Fermat's last theorem?
Evaluating case: In which experimental framework did AlphaFold2 demonstrate high capability in predicting protein structure?
Evaluating case: In which year did AlexNet come out?
Evaluating case: What percentage of DNA has been found to be shared between Sapiens and Neandertals?
Total cases: 4
✅ Passed: 4

Nova Lite 2 fa chiamate infinite ai tools. Ho messo un limite a 20 dopodiché da errore, altrimenti uno va rovinato

Copy link
Copy Markdown
Owner

@martinapugliese martinapugliese left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

vai mergiamo?

@bernomone bernomone marked this pull request as ready for review January 31, 2026 15:40
@bernomone bernomone merged commit ab6eb12 into main Jan 31, 2026
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants