Agent Action Controller

Layout

agent-action-controller/
  classification/
    01_csv_dataset_creation.py
    02_csv_dataset_EDA.py
    03_cls_zero_shot.py
    04_cls_train_lora_sft.py
    04b_cls_train_lora_sft_binary.py
    05_cls_train_lora_seqcls.py
    06_binary_ensemble_eval.py
    06_cls_train_probe.py
    07_cls_ml_baseline.py
    08_cls_compare.py
    08_cls_inference_benchmark.py
    09_cls_encoder.py
    cls_utils.py
    generative_utils.py

  e2e_search/
    agent_env.py
    build_corpus.py
    run_agent.py
    run_pipeline.py
    evaluate.py
    compare.py
    compare_controller.py
    compare_fixed_generator.py
    analyze_repeated_e2e.py
    e2e_utils.py
    retriever.py
    openrouter_client.py

Workflow

The classification/ directory contains the trajectory-action classification workflow:

01_csv_dataset_creation.py: build the CSV dataset from trace files.
02_csv_dataset_EDA.py: compute dataset statistics and exploratory analysis.
03_cls_zero_shot.py: run zero-shot generative classification.
04_cls_train_lora_sft.py: train multiclass LoRA SFT models.
04b_cls_train_lora_sft_binary.py: train binary LoRA SFT models.
05_cls_train_lora_seqcls.py: train LoRA sequence-classification models.
06_binary_ensemble_eval.py: evaluate the binary ensemble setup.
06_cls_train_probe.py: train probing classifiers.
07_cls_ml_baseline.py: run classical ML baselines.
08_cls_compare.py: compare classification runs.
08_cls_inference_benchmark.py: benchmark inference.
09_cls_encoder.py: train/evaluate encoder-based classifiers.

The files cls_utils.py and generative_utils.py are local helper modules required by the classification scripts.

The e2e_search/ directory contains the end-to-end search-control evaluation code:

build_corpus.py: construct the retrieval corpus.
run_agent.py: run the local vLLM end-to-end agent.
run_pipeline.py: run the controller/generator pipeline variants.
evaluate.py: compute answer-quality and trajectory metrics.
compare.py: compare base and fine-tuned end-to-end runs.
compare_controller.py: compare controller-ablation runs.
compare_fixed_generator.py: compare fixed-generator controller runs.
analyze_repeated_e2e.py: aggregate repeated runs and compute uncertainty estimates.

The files agent_env.py, e2e_utils.py, retriever.py, and openrouter_client.py are local support modules used by the E2E scripts.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
classification		classification
e2e_search		e2e_search
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Agent Action Controller

Layout

Workflow

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Agent Action Controller

Layout

Workflow

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages