rango

Personalized LLM via Chameleon approach

🧠 Chameleon + PriME: Personalized & Privacy-Preserving LLM Optimization

This repository contains code, configuration, and experimental framework for a three-month research project that integrates Chameleon-style embedding personalization and PriME-style evolutionary model merging to achieve efficient, private, and high-performance LLM-based recommendation systems.

🚀 Project Overview

This project aims to:

Personalize LLMs using user-specific directions in embedding space (via Chameleon)
Optimize merged models via evolutionary algorithms (PriME)
Support multi-user inference with limited data and strong privacy constraints

📁 Project Structure

chameleon_prime_personalization/ ├── configs/ # YAML/JSON configs for models, training, evaluation ├── data/ │ ├── raw/ # Raw datasets (LaMP-2, LaMP-3, etc.) │ └── processed/ # Tokenized or embedded versions ├── models/ │ ├── base_model/ # Base LLM (e.g., LLaMA 2 7B or Mistral) │ └── user_adapters/ # LoRA/PEFT modules for users ├── notebooks/ # Experiment tracking or visualization notebooks ├── scripts/ │ ├── download_models.py │ ├── download_datasets.py │ └── setup_environment.sh └── README.md

🛠️ Setup

1. Environment

bash scripts/setup_environment.sh
source env/bin/activate
2. Download base models

python scripts/download_models.py --model mistralai/Mistral-7B-v0.1
3. Download datasets

python scripts/download_datasets.py --dataset lamp2
🧩 Method Summary
Embedding Personalization (Chameleon)
SVD-based decomposition of user history embeddings

Extracts user-specific direction vs general direction

Adjust embeddings by shifting along user-personalized vector

Evolutionary Merge (PriME)
Each user gets LoRA/IA3 fine-tuned PEFT modules

Cosine similarity used to identify similar shared users

Evolutionary strategy (CMA-ES or NSGA-II) to merge modules

Objective: maximize user utility (F1, ROUGE), minimize privacy leakage

📊 Evaluation Metrics
ROUGE / F1 / BLEU (text output quality)

Cosine similarity / KL divergence (privacy leakage)

Model diversity and memorization metrics (advanced)

## Strict Compliance Prompt Pack (LaMP-2)

**目的**: 出力を `Answer: <TAG>` の単一行に強制し、形式準拠率 ≥ 0.95 を安定達成。

### ✅ 実測
- 形式準拠率: **98.0%**（目標 95% 超）
- テスト規模: **50 samples**
- 出力形式: 単一行 `Answer: <TAG>`
- デコード制約: `temperature=0, top_p=0, max_tokens=8, stop=["\n"]`
- 厳格検証パターン: `^Answer:\s*([A-Za-z0-9_\- ]+)\s*$`

### 🔧 プロンプト
**SYSTEM**

You are a strict single-line tag classifier.

RULES (絶対遵守):

Output EXACTLY one line: Answer:
No explanations, no extra words, no punctuation after , no emojis.
NO NEWLINES. Output must be a single line only.
Choose ONE best tag from the allowed list, case-sensitive.
If uncertain, still pick the single best tag.

FORBIDDEN: • Multiple lines or trailing spaces • Any text before/after Answer:

Allowed tags: {{ALLOWED_TAGS}} Required output format: Answer:


**USER**

Task

Classify the following movie description into exactly one tag from the allowed list.

Description

User Profile (optional)

Allowed tags (pick ONE, case-sensitive)

Output constraints (絶対遵守) • Single line only. • EXACT string format: Answer: • Nothing else before or after. • No newline characters.

Your response

Answer:


### 🧪 使い方
```bash
# 基本テスト
python run_strict_compliance_test.py --samples 10

# 実データでのテスト
python run_strict_compliance_test.py --data path/to/lamp2_test.jsonl

# カスタムプロンプト
python run_strict_compliance_test.py \
  --system-prompt prompts/lamp2_system_strict.txt \
  --user-template prompts/lamp2_user_template_strict.txt \
  --target-compliance 0.95

🎉 効果

• 無関係出力の排除（例: "Source: …" 等） • LaMP-2向けに最適化された単一タグ選択 • 決定論的デコード設定で再現性担保 • ≥95% 準拠で評価の信頼性向上

📄 License This project is licensed under the Apache License 2.0. See LICENSE for details.

📚 References Chameleon: Personalized Prompt Editing for Large Language Models

PriME: Personalized Model Merging via Evolutionary Search

✍️ Acknowledgments This repository was developed for academic research on privacy-preserving personalization using large language models, with support for datasets such as LaMP-2 and LaMP-3, and tested on NVIDIA A100×2 environment.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.github/workflows		.github/workflows
artifacts/20250810_053000		artifacts/20250810_053000
assets		assets
causal_inference		causal_inference
chameleon_prime_personalization		chameleon_prime_personalization
config		config
docs		docs
embeddings		embeddings
eval		eval
faiss		faiss
graphrag_cfs_weights		graphrag_cfs_weights
graphrag_cfs_weights_strict		graphrag_cfs_weights_strict
manifold_optimization		manifold_optimization
ppr_results		ppr_results
ppr_results_user100		ppr_results_user100
processed/LaMP-2		processed/LaMP-2
production		production
prompts		prompts
rag		rag
results/verification		results/verification
runs		runs
scripts		scripts
templates		templates
tests		tests
tools		tools
utils		utils
.env_faiss310.txt		.env_faiss310.txt
.gitignore		.gitignore
.pip_freeze.txt		.pip_freeze.txt
ADAPTIVE_PIECE_FUSION_COMPLETE_REPORT.md		ADAPTIVE_PIECE_FUSION_COMPLETE_REPORT.md
BENCHMARK_USAGE.md		BENCHMARK_USAGE.md
CAUSAL_INFERENCE_PHASE1_COMPLETION_REPORT.md		CAUSAL_INFERENCE_PHASE1_COMPLETION_REPORT.md
CFS_CHAMELEON_CRITICAL_FIXES_SUMMARY.md		CFS_CHAMELEON_CRITICAL_FIXES_SUMMARY.md
CFS_CHAMELEON_ENHANCEMENT_COMPLETE_SUMMARY.md		CFS_CHAMELEON_ENHANCEMENT_COMPLETE_SUMMARY.md
CFS_CHAMELEON_FINAL_INTEGRATION_REPORT.md		CFS_CHAMELEON_FINAL_INTEGRATION_REPORT.md
CFS_CHAMELEON_IMPLEMENTATION_SUMMARY.md		CFS_CHAMELEON_IMPLEMENTATION_SUMMARY.md
CFS_CHAMELEON_PROBLEM_ANALYSIS_REPORT.md		CFS_CHAMELEON_PROBLEM_ANALYSIS_REPORT.md
CHAMELEON_BATCHED_SCORING_TECHNICAL_REPORT.md		CHAMELEON_BATCHED_SCORING_TECHNICAL_REPORT.md
CHAMELEON_FROZEN_IMPLEMENTATION_SUMMARY.md		CHAMELEON_FROZEN_IMPLEMENTATION_SUMMARY.md
CHAMELEON_PERFORMANCE_ANALYSIS_REPORT.md		CHAMELEON_PERFORMANCE_ANALYSIS_REPORT.md
CLAUDE.md		CLAUDE.md
COMPREHENSIVE_BIAS_BREAKTHROUGH_STRATEGY.md		COMPREHENSIVE_BIAS_BREAKTHROUGH_STRATEGY.md
INTERIM_BREAKTHROUGH_ANALYSIS.md		INTERIM_BREAKTHROUGH_ANALYSIS.md
LAMP2_COMPREHENSIVE_BENCHMARK_REPORT.md		LAMP2_COMPREHENSIVE_BENCHMARK_REPORT.md
LICENSE		LICENSE
PARAMETER_OPTIMIZATION_REPORT.md		PARAMETER_OPTIMIZATION_REPORT.md
PARAMETER_SWEEP_USAGE.md		PARAMETER_SWEEP_USAGE.md
PHASE3A_COMPREHENSIVE_EVALUATION_REPORT.md		PHASE3A_COMPREHENSIVE_EVALUATION_REPORT.md
PHASE3A_CRITICAL_FINDINGS_REPORT.md		PHASE3A_CRITICAL_FINDINGS_REPORT.md
PHASE3_COMPLETION_SUMMARY.md		PHASE3_COMPLETION_SUMMARY.md
PHASE_A_BREAKTHROUGH_ANALYSIS.md		PHASE_A_BREAKTHROUGH_ANALYSIS.md
PHASE_A_CRITICAL_FINDINGS.md		PHASE_A_CRITICAL_FINDINGS.md
PHASE_A_REVISED_STRATEGY.md		PHASE_A_REVISED_STRATEGY.md
PRIOR_PROVIDER_IMPLEMENTATION_REPORT.md		PRIOR_PROVIDER_IMPLEMENTATION_REPORT.md
PRODUCTION_FIX_SUMMARY.md		PRODUCTION_FIX_SUMMARY.md
README.md		README.md
README_CHAMELEON.md		README_CHAMELEON.md
README_lamp2_usage.md		README_lamp2_usage.md
REPORT_fakeit_alignit_audit.md		REPORT_fakeit_alignit_audit.md
REPOSITORY_AUDIT_REPORT_2025-08-25.md		REPOSITORY_AUDIT_REPORT_2025-08-25.md
ROBUSTNESS_ENHANCEMENTS_REPORT.md		ROBUSTNESS_ENHANCEMENTS_REPORT.md
SEMANTIC_SIMILARITY_ENHANCEMENT_REPORT.md		SEMANTIC_SIMILARITY_ENHANCEMENT_REPORT.md
STIEFEL_MANIFOLD_PHASE2_COMPLETION_REPORT.md		STIEFEL_MANIFOLD_PHASE2_COMPLETION_REPORT.md
STRICT_ORCHESTRATOR_READY.md		STRICT_ORCHESTRATOR_READY.md
STRICT_VALIDATION_COMPLETE.md		STRICT_VALIDATION_COMPLETE.md
SYSTEM_WORKFLOW_INVENTORY.md		SYSTEM_WORKFLOW_INVENTORY.md
TASK_BASED_QUALITY_EVALUATION_REPORT.md		TASK_BASED_QUALITY_EVALUATION_REPORT.md
VALIDATION_TOOLKIT_SUMMARY.md		VALIDATION_TOOLKIT_SUMMARY.md
WORKFLOW_fakeit_alignit_samples.md		WORKFLOW_fakeit_alignit_samples.md
adaptive_fusion_cfs_integration.py		adaptive_fusion_cfs_integration.py
adaptive_piece_fusion.py		adaptive_piece_fusion.py
alpha_batch_config.yaml		alpha_batch_config.yaml
alpha_optimization.py		alpha_optimization.py
apply_best_config.py		apply_best_config.py
apply_best_config_detailed.py		apply_best_config_detailed.py
build_faiss_index.py		build_faiss_index.py
calculate_token.ipynb		calculate_token.ipynb
causal_chameleon_evaluator.py		causal_chameleon_evaluator.py
cfs_chameleon_demo.py		cfs_chameleon_demo.py
cfs_chameleon_extension.py		cfs_chameleon_extension.py
cfs_chameleon_graphrag_implementation_report.md		cfs_chameleon_graphrag_implementation_report.md
cfs_chameleon_with_graphrag_part2.md		cfs_chameleon_with_graphrag_part2.md
cfs_comprehensive_evaluation.py		cfs_comprehensive_evaluation.py
cfs_comprehensive_results_report.md		cfs_comprehensive_results_report.md
cfs_config.yaml		cfs_config.yaml
cfs_evaluation_utils.py		cfs_evaluation_utils.py
cfs_improved_integration.py		cfs_improved_integration.py
cfs_quality_integration.py		cfs_quality_integration.py
cfs_quick_evaluation.py		cfs_quick_evaluation.py
cfs_semantic_integration.py		cfs_semantic_integration.py
chameleon_cfs_integrator.py		chameleon_cfs_integrator.py
chameleon_evaluator.py		chameleon_evaluator.py
chameleon_evaluator.py.ast_backup		chameleon_evaluator.py.ast_backup
chameleon_evaluator_fixed.py		chameleon_evaluator_fixed.py
chameleon_frozen_base.py		chameleon_frozen_base.py
chameleon_paper_compliant.py		chameleon_paper_compliant.py
chameleon_paper_evaluation.py		chameleon_paper_evaluation.py
collab_fusion_loader.py		collab_fusion_loader.py
config.yaml		config.yaml
config_optimized.yaml		config_optimized.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rango

🚀 Project Overview

📁 Project Structure

🛠️ Setup

1. Environment

🎉 効果

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

rango

🚀 Project Overview

📁 Project Structure

🛠️ Setup

1. Environment

🎉 効果

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages