This repository contains the codebase and resources for “YIELD: A Large-Scale Dataset and Evaluation Framework for Information Elicitation Agents”
Create config/config.yaml:
paths:
proj_store: "/data/yield"
models: "/data/models"proj_store: root directory for datasets, outputs, and experimentsmodels: directory containing downloaded base models
- For generating data, use the pipeline in the
yield/dataset/folder. - For computing factual novelty, use the pipeline in the
yield/factual_novelty/folder.
If you are not constructing the dataset, then download YIELD and place it under proj_store/data, e.g.:
proj_store/data/
├── yield/
├── yield-finetuning/
Download base models from Hugging Face:
- LLama: https://huggingface.co/meta-llama
- DeepSeek: https://huggingface.co/deepseek-ai
Supported models:
meta-llama/Llama-3.1-8B-Instructmeta-llama/Llama-3.2-3B-Instructdeepseek-ai/DeepSeek-R1-Distill-Llama-8B
The main supervised fine-tuning script is yield/experiments/supervised_finetuning.py. Run this file from the root folder. Training setup must be changing depending to your available resources.
Example usage:
accelerate launch --config_file config/accelerate_config.yaml ./yield/experiments/supervised_finetuning.py --model_choice meta-llama/Llama-3.1-8B-Instruct --dataset_choice yield_v1_finetuning
The main supervised fine-tuning script is yield/experiments/agent_llama.py. Run this file from the root folder. Training setup must be changing depending to your available resources. The script is configured to use accelerate.
Example usage:
With DeepSpeed
accelerate launch --deepspeed_config_file config/deepspeed.json yield/experiments/agent_llama.py --model_choice meta-llama/Llama-3.1-8B-Instruct --dataset_choice yield_v1_factualnovelty_rl
No DeepSpeed
accelerate launch yield/experiments/agent_llama.py --model_choice meta-llama/Llama-3.1-8B-Instruct --dataset_choice yield_v1_factualnovelty_rl
To generate utterances from the models and performing evaluation, use scripts in the yield/evaluation/ folder. The scripts use the companion Elicitation package:
pip install elicitation
Available metrics:
- Conformity
- Progression
- Turn-Length Ratio
If you use this resource in your projects, please cite the following paper.
@misc{De_Lima_YIELD_A_Large-Scale_2026,
author = {De Lima, Victor and Yang, Grace Hui},
doi = {10.48550/arXiv.2604.10968},
title = {{YIELD: A Large-Scale Dataset and Evaluation Framework for Information Elicitation Agents}},
url = {https://arxiv.org/abs/2604.10968},
year = {2026}
}