How can I reproduce GSM8K result based on given LoRA adapter? What are the hyperparameters?

Hello, thank you for great study.

I want to reproduce results of your paper, but there is no specified hyperparameter settings for evaluation.

For example, for GSM8K test dataset evaluation, what are the specific **output length**, **diffusion steps**, **unmasking strategy**, and **system prompt construction**? 

I noticed that `LLaMA-Factory/examples/inference/llama2_full_ddm-gsm-inf.yaml` is specifying these values, but not sure how to utilize this .yaml file. 

Could you tell me which code to run so that the evaluation runs according to this YAML file configuration?

Also, can I ask you how many time and GPU resource is required for the gsm8k finetuned checkpoint?

<img width="782" height="283" alt="Image" src="https://github.com/user-attachments/assets/7539d9e7-2bd1-4fbd-bd83-7a959735a5da" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How can I reproduce GSM8K result based on given LoRA adapter? What are the hyperparameters? #26

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How can I reproduce GSM8K result based on given LoRA adapter? What are the hyperparameters? #26

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions