WIP: Package Generalization by duncaneddy · Pull Request #27 · sisl/astra-rl

duncaneddy · 2025-10-13T18:07:03Z

Pull Request

Description

This PR is a work in progress on laying the ground work for generalizing the ASTRA-RL package for language model evaluation. It seeks to make sure we can address multiple different types of LM evaluations from running simple prompt-response evaluations to full-on adversarial red-teaming. It tries to make the scoring system more flexible and move to a more dynamic system with user-defined scoring.

Added

Some

Changed

Lots

Fixed

N/A

Removed

Need to write

Note to Reviewers

duncaneddy added 3 commits October 6, 2025 16:28

Rename probe to utterance

d155826

WIP: Commit generalization

8971d24

Rename utterance to challenge everywhere

1d7c29e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Package Generalization#27

WIP: Package Generalization#27
duncaneddy wants to merge 3 commits intomainfrom
de/generalization

duncaneddy commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

duncaneddy commented Oct 13, 2025

Pull Request

Description

Added

Changed

Fixed

Removed

Note to Reviewers

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant