YIELD: A Large-Scale Dataset and Evaluation Framework for Information Elicitation Agents

This repository contains the codebase and resources for “YIELD: A Large-Scale Dataset and Evaluation Framework for Information Elicitation Agents”

Dataset & Tools

Setup

Create config/config.yaml:

paths:
  proj_store: "/data/yield"
  models: "/data/models"

proj_store: root directory for datasets, outputs, and experiments
models: directory containing downloaded base models

Dataset Construction

For generating data, use the pipeline in the yield/dataset/ folder.
For computing factual novelty, use the pipeline in the yield/factual_novelty/ folder.

Model Training

Dataset

If you are not constructing the dataset, then download YIELD and place it under proj_store/data, e.g.:

proj_store/data/
  ├── yield/
  ├── yield-finetuning/

Models

Download base models from Hugging Face:

LLama: https://huggingface.co/meta-llama
DeepSeek: https://huggingface.co/deepseek-ai

Supported models:

meta-llama/Llama-3.1-8B-Instruct
meta-llama/Llama-3.2-3B-Instruct
deepseek-ai/DeepSeek-R1-Distill-Llama-8B

SFT

The main supervised fine-tuning script is yield/experiments/supervised_finetuning.py. Run this file from the root folder. Training setup must be changing depending to your available resources.

Example usage:

accelerate launch --config_file config/accelerate_config.yaml  ./yield/experiments/supervised_finetuning.py --model_choice meta-llama/Llama-3.1-8B-Instruct --dataset_choice yield_v1_finetuning

ORL

The main supervised fine-tuning script is yield/experiments/agent_llama.py. Run this file from the root folder. Training setup must be changing depending to your available resources. The script is configured to use accelerate.

Example usage:

With DeepSpeed

accelerate launch --deepspeed_config_file config/deepspeed.json   yield/experiments/agent_llama.py   --model_choice meta-llama/Llama-3.1-8B-Instruct   --dataset_choice yield_v1_factualnovelty_rl

No DeepSpeed

accelerate launch yield/experiments/agent_llama.py   --model_choice meta-llama/Llama-3.1-8B-Instruct   --dataset_choice yield_v1_factualnovelty_rl

Evaluating Models

To generate utterances from the models and performing evaluation, use scripts in the yield/evaluation/ folder. The scripts use the companion Elicitation package:

pip install elicitation

Available metrics:

Conformity
Progression
Turn-Length Ratio

Documentation

Citing YIELD

If you use this resource in your projects, please cite the following paper.

@misc{De_Lima_YIELD_A_Large-Scale_2026,
author = {De Lima, Victor and Yang, Grace Hui},
doi = {10.48550/arXiv.2604.10968},
title = {{YIELD: A Large-Scale Dataset and Evaluation Framework for Information Elicitation Agents}},
url = {https://arxiv.org/abs/2604.10968},
year = {2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
config		config
docs		docs
yield		yield
.gitignore		.gitignore
CITATION.CFF		CITATION.CFF
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YIELD: A Large-Scale Dataset and Evaluation Framework for Information Elicitation Agents

Dataset & Tools

Setup

Dataset Construction

Model Training

Dataset

Models

SFT

ORL

Evaluating Models

Documentation

Citing YIELD

About

Uh oh!

Releases

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

YIELD: A Large-Scale Dataset and Evaluation Framework for Information Elicitation Agents

Dataset & Tools

Setup

Dataset Construction

Model Training

Dataset

Models

SFT

ORL

Evaluating Models

Documentation

Citing YIELD

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Contributors

Uh oh!

Languages