# Issue Run the experiment to evaluate the model ## ToDo - [ ] Run evaluation for #89 - [ ] [Optional] (8 hrs) Evaluation with the `AITGPT dataset`. - [ ] Try NLI with A6000 - [ ] Pilot Human Evaluation (6 participants)
Issue
Run the experiment to evaluate the model
ToDo
AITGPT dataset.