Skip to content

WIP: Package Generalization#27

Open
duncaneddy wants to merge 3 commits intomainfrom
de/generalization
Open

WIP: Package Generalization#27
duncaneddy wants to merge 3 commits intomainfrom
de/generalization

Conversation

@duncaneddy
Copy link
Contributor

Pull Request

Description

This PR is a work in progress on laying the ground work for generalizing the ASTRA-RL package for language model evaluation. It seeks to make sure we can address multiple different types of LM evaluations from running simple prompt-response evaluations to full-on adversarial red-teaming. It tries to make the scoring system more flexible and move to a more dynamic system with user-defined scoring.

Added

  • Some

Changed

  • Lots

Fixed

  • N/A

Removed

  • Need to write

Note to Reviewers

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant