Learnable Concept-Base Language Model

Project overview

Create config.toml file where the user picks a dataset, a backbone LLM and all other project parameters
Use retry pattern on llm concept annotations
Generalize to different datasets
Generalize to different backbone LLMs
Use the embeddings of different layers of the backbone LLM
Perform random search for hyperparameter tuning (see ML07 page 60)

Saving to disk the SAE's latent space in dense format is a cumbersome and manual process. Saving it in sparse format makes all downstream tasks unfeasibly slow. In order to avoid these problems, don't save the latent space to disk, but pass the backbone LLM's embeddings through the SAE whenever needed. This adds a small computational overhead on all downstream tasks, but it's negligible compared to the other option.

Name		Name	Last commit message	Last commit date
Latest commit History 264 Commits
.vscode		.vscode
experiments		experiments
mplstyles		mplstyles
plots		plots
src/lcblm		src/lcblm
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
benchmark_ortho_loss.py		benchmark_ortho_loss.py
config.toml		config.toml
probabilistic_formulation.tex		probabilistic_formulation.tex
pyproject.toml		pyproject.toml
uv.lock		uv.lock