Chi-Shan0707

Follow

💭

Leaving the world better than I found it.

Yuhan Chi Chi-Shan0707

💭

Leaving the world better than I found it.

Follow

I have no idea what happened / but now I am not the same.

32 followers · 83 following

Achievements

Achievements

Highlights

Pro

Chi-Shan0707/README.md

Stochasticity makes algorithm more robust. So are humans.
Hence, I shall embrace the uncertainty.

🔗 github-unflag-playbook-cn

Pinned Loading

TinyLoRA-GRPO-Coder TinyLoRA-GRPO-Coder Public

Inspired by 《Learning to Reason in 13 parameters》, use TinyLoRA+GRPO(32 parameters) to fine-tune Qwen2.5-Coder-3B-Instruct(or other models) to accomplish competitive programming.

Python 22 3
Qwen4Luogu-RL Qwen4Luogu-RL Public

This repo can work. But I make some updates in a new repo. Please see more in https://github.com/Chi-Shan0707/TinyLoRA-Qwen-Coder

Python 8
github-unflag-playbook-cn github-unflag-playbook-cn Public

GitHub Unflag Playbook CN：一份写给中国大陆开发者的自救手册与存在档案。如果这份文档对您有帮助的话，阔不阔以留一个star~(￣▽￣)~*

5
microgpt.cpp microgpt.cpp Public

microgpt.cpp in 300 lines!

C++ 4 2
FDUGuideBook/nav-site FDUGuideBook/nav-site Public

复旦信息资源导航站

15 1
Cot-Knot Cot-Knot Public

COT Knot

Python