LLM Harness Engineering Framework

AI Agent를 Production-ready하게 만드는 마크다운 기반 Harness Engineering Framework

Overview

이 저장소는 LLM(대형 언어 모델) 기반 AI Agent를 신뢰할 수 있게 Production 환경에 배포하기 위한 Harness Engineering Framework입니다.

핵심 철학: "LLM에게 말을 잘하게 하는 것은 Prompt Engineering이고, AI Agent를 믿을 수 있게 만드는 것은 Harness Engineering이다."

Quick Start

1. 프롬프트 사용법

# prompts/analysis/paper-critique.md 참조
# 필요한 프롬프트를 복사하여 Claude Code에서 사용

2. 파이프라인 실행

# Research → Analysis → Influence 파이프라인
# workflows/research-to-influence-pipeline.md 참조

3. 평가 진행

python tools/eval_runner.py --input ./evaluation/benchmarks/sample.md

Directory Structure

harness-engineering/
├── prompts/              # 프롬프트 레지스트리
│   ├── registry.md      # 모든 프롬프트 인덱스
│   ├── analysis/        # 분석용 프롬프트
│   │   ├── paper-summarize.md
│   │   └── paper-critique.md
│   └── generation/      # 생성용 프롬프트
│       └── linkedin-post.md
├── evaluation/          # 평가 시스템
│   ├── metrics.md       # 메트릭스 정의
│   └── results/        # 실험 결과
├── workflows/          # 에이전트 워크플로우
│   └── research-to-influence-pipeline.md
├── configs/            # 모델 설정
│   └── models.yaml
├── guardrails/          # 제약 조건
│   ├── safety.md
│   └── format.md
└── tools/              # 유틸리티 스크립트
    └── eval_runner.py

Core Concepts

Harness Engineering 3 Layers

Layer	설명	예시
Layer 1	Prompt/Instruction	시스템 프롬프트, few-shot examples
Layer 2	Workflow/Tooling/Guardrails	에이전트 워크플로우, 출력 검증
Layer 3	Infrastructure/Monitoring	로깅, 메트릭 수집, CI/CD

Research-to-Influence Pipeline

[논문/트렌드] → [READ & ANALYZE] → [THINK & CRITIQUE] → [BUILD & EXPERIMENT] → [LinkedIn/프로젝트]
     │                  │                    │                    │                    │
  Input          prompts/           evaluation/         tools/            Output
                 analysis           metrics             eval_runner

Author

GitHub: github.com/waylake
LinkedIn: linkedin.com/in/waylake

Contributing

기여를 환영합니다! Please read CONTRIBUTING.md for details.

License

This project is MIT licensed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Harness Engineering Framework

Overview

Quick Start

1. 프롬프트 사용법

2. 파이프라인 실행

3. 평가 진행

Directory Structure

Core Concepts

Harness Engineering 3 Layers

Research-to-Influence Pipeline

Author

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
configs		configs
evaluation		evaluation
guardrails		guardrails
prompts		prompts
tools		tools
workflows		workflows
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

LLM Harness Engineering Framework

Overview

Quick Start

1. 프롬프트 사용법

2. 파이프라인 실행

3. 평가 진행

Directory Structure

Core Concepts

Harness Engineering 3 Layers

Research-to-Influence Pipeline

Author

Contributing

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages