Skip to content

LounesMD/LLaDA_Arithmetic

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

72 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LLaDA: From Text Generation to Arithmetic Operations

In this repository, we provide a complete implementation of the training and inference pipeline for LLaDA, applied to arithmetic operations and sorting tasks.

Installation

Install the dependencies:

conda env create -f environment.yaml
conda activate llm_project

Run the project

To run train and run LLaDA:

python src/main.py --method llada --tokenizer group_pad --num_epochs 5 --number_bits 20 --device cpu --data_size 64000 --batch_size 32 --learning_rate 5e-4 --seq_length 21

To train the model using Kaggle's GPU, ensure you have a Kaggle account and API key, adapt the kaggle/kernel-metadata.json file to your Kaggle username, and run:

kaggle kernels push -p kaggle/

Acknowledgements

We would like to thank:

Contact and Contributors

This project is conducted by: Nicolas Sereyjol-Garros, Tom Ravaud, Christopher Marouani, and Lounès Meddahi.

About

LLaDA implementation

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages