chenmoacr

IndexGuc chenmoacr

Mechanistic interpretability · Human visual attention · Independent researcher 对大模型使用逆向工程以探索可提升能力的未知途径. 继往开来.

Pinned Loading

amr_wtf amr_wtf Public

一种通过替换模型自答为教师回答来提取和施加行为向量的工具包。 GHOST: Ghost-Host Output Steering Toolkit — Extract and apply behavior vectors from language models by swapping their own answers with teacher answers.

Python
ScanThink ScanThink Public

本文提出 ScanThink，一个利用人类原生眼动注视点和轨迹,引导原本的图像模型进行图像分类的框架

Python
amr_honesty amr_honesty Public

一种模型内部自我自信度量化技术. A 100K-parameter prefix module that lets a frozen Qwen express its own internal uncertainty (read-out, not retraining).

Python 2
AMR_ReplaceNeuron AMR_ReplaceNeuron Public

神经元编辑和回收 Two ways to "replace" a neuron's behaviour in a frozen Gemma 4 E2B-it — catgirl single-column rewrite + LeetCode 233 LoRA/delta-W.

Python 1
neuron_transplant neuron_transplant Public

将gemma431b神经元迁移到E2b

Python
amr_CrabStep amr_CrabStep Public

CrabStep 螃蟹步一种可以挪动的单样本微调和垂直领域微调技术

Python