🧭 Awesome Enterprise AI — the CAIO List

An adoption-first index of open-source AI for the enterprise — curated through the eyes of a CAIO.

面向 CAIO（首席 AI 官 / AI 负责人）的「可引入性」开源索引。 Not the coolest projects — the ones you can actually bring into a company.

🌐 caiohome.com · 🏷️ Legend · 🧱 Starter Stack · 🗺️ Decision Chain · 🗂️ Contents · 🤝 Contribute

The one question this list answers: "Can this open-source project be brought into my company — to which layer, used by whom, under what license, and at how much compliance risk?"

本清单只回答一件事：「这个开源项目能不能引入公司、引入到哪一层、谁来用、什么许可证、合规风险多大」。 Maintained by CAIO之家 · caiohome.com — the home for Chief AI Officers.

Every other awesome list organizes by technical category or research topic, serving engineers and researchers. This one flips the axis: it is organized by the CAIO adoption decision chain, and every entry carries a set of enterprise metadata tags and a direct link to its source. Anyone can copy the entries — nobody can copy the judgment. That judgment is the only reason this list exists.

📖 Why another list? · 为什么再做一个清单

Awesome-LLM, Awesome-LLMOps, awesome-mcp-servers and friends are excellent — but they answer "what exists?". A CAIO needs to answer "what can I safely ship, and what will legal/security say?" The market is full of 50k-star projects that no enterprise can touch (wrong license, no air-gap, single-vendor lock-in), and 800-star projects that are perfect to adopt.

So here the organizing principle is the introduction decision itself. A 🔴-license 50k-star repo can be worth less to your company than a 🟢-license 800-star one. Stars are not an inclusion criterion.

市面上的 awesome 清单按技术类别组织，服务工程师与研究者。本清单换一根轴 —— 按 CAIO 的引入决策链 分层，并要求每个条目挂一套企业元数据 + 一个可点击的源头链接。抄条目容易，抄不走这套「引入判断」。

Legend

Tags are decision aids, not endorsements. Final adoption rests with your legal, compliance, and security review. 标签是判断辅助，不是背书。最终引入决策以公司法务、合规、安全评审为准。

License 🟢🟡🔴 and Maturity ⭐🧪👀 are required on every entry. Other tags are best-effort. Every project name is a link to its first-party source.

Axis	Values
License · 许可证	🟢 Permissive — Apache-2.0 / MIT / BSD, commercial use out of the box · 🟡 Conditional — MPL / LGPL / model "community" licenses, commercial with strings (read the terms) · 🔴 Restricted — GPL / AGPL / RAIL / non-commercial / custom, legal review mandatory
Maturity · 成熟度	⭐ Production — proven at scale, de-facto standard · 🧪 Pilot — engineering-complete, good for a 30–90 day trial · 👀 Watch — important direction, still moving, track before adopting
Deployment · 部署	🏠 Self-hostable — on-prem / air-gap, data never leaves · ☁️ Cloud-first — private deploy is costly · 📱 Edge — runs on-device / edge box
Origin · 来源	🇨🇳 China-led · 🌍 Global / overseas-led
Compliance · 合规	🛡️ Xinchuang-ready — adapted to Ascend / domestic silicon · ⚠️ Sensitive — data-residency / privacy / needs medical-affairs or legal sign-off

✅ Inclusion criteria · 收录标准

Included — must satisfy all:

Open source with a stated OSS license (and we note the license type).
Enterprise-introducible — real production or pilot value; not a demo / toy.
Maps to a layer of the CAIO decision chain (see Contents).
Active or de-facto standard — meaningful update in the last ~6 months, or already the standard in its niche.
Metadata complete — at minimum license + maturity.
First-party first — official org accounts beat personal forks beat second-hand aggregators.

Not included / footnoted only:

Closed SaaS (unless it has an OSS core; commercial products appear only as "for comparison" notes).
Pure academic repros with no engineering, unmaintained and without replacement value.
Repos with unclear or contradictory licensing (until clarified).
Hype forks / mirrors / marketing repos.

The CAIO Decision Chain

The figure the original notes referred to, drawn for real. Read it top→bottom: each layer is a place where a CAIO makes an introduce / don't-introduce call. The private/Xinchuang layer underpins everything when data residency is a hard constraint.

flowchart TB
    MOD["🧠 §01 Foundation<br/>open-weight models"]
    TRN["🏗️ §02–03 Train · Tune<br/>fine-tune · RL · kernels"]
    SRV["⚙️ §04–05 Serve · Schedule<br/>inference engines · orchestration"]
    GW["🚪 §06–07 Gateway · Context<br/>routing · keys · token efficiency"]
    APP["🤖 §08–10 · §12 Apps · Agents<br/>agents · RAG · MCP · auto-research"]
    GOV["🛡️ §11 Govern · Observe<br/>eval · tracing · guardrails"]
    INF["🏠 §13 Private · Xinchuang · Edge<br/>air-gap · Ascend · on-device"]
    HUB["🛰️ §14 Source · Host<br/>hubs · clouds · registries"]

    MOD --> TRN --> SRV --> GW --> APP --> GOV
    INF -. underpins .-> MOD
    INF -. underpins .-> SRV
    HUB -. supplies .-> MOD

Reference Starter Stack

🧱 The list's strongest CAIO opinion: an assembled, coherent open-source stack you could actually pilot — not 150 disconnected links. Swap freely, but this is a sane default for a regulated/enterprise pilot with a multi-source model policy.

Layer	Default pick (OSS)	Xinchuang / air-gap swap	Why
Models	`Qwen` + `DeepSeek` (+ `Llama` backup)	same (all self-hostable)	Multi-source: 2 China-led + 1 global backup
Inference	vLLM	`vllm-ascend`	Throughput standard; Ascend backend exists
Orchestration	Ray Serve / llm-d	Ray Serve	Autoscaling, multi-model, K8s-native
Gateway	LiteLLM / One-API	One-API (self-host)	Keys · quota · billing · audit in one place
Agents / Apps	LangGraph + Dify	Dify (private)	Controllable, auditable orchestration
RAG	RAGFlow + Milvus + BGE + MinerU	same (all 🏠)	Parsing quality is the real RAG bottleneck
Govern	Langfuse + promptfoo + NeMo Guardrails	Langfuse (self-host)	Without this layer, nothing is board-defensible
Base	MindSpore / CANN	—	Domestic training/inference stack

🏷️ Legend · 🗺️ Decision Chain · 🧱 Starter Stack
01 · Foundation Models & Open Weights
02 · Training / Fine-tuning / Post-training (incl. RL)
03 · High-performance Kernels & Low-level Systems
04 · Inference Engines
05 · Compute Scheduling & Serving Orchestration
06 · Gateway / Routing / API & Cost Governance
07 · Context Engineering & Token Efficiency
08 · Orchestration Frameworks & Agents
09 · MCP / Tools / Skills
10 · RAG / Knowledge Base / Data Processing
11 · Evaluation / Observability / Guardrails / Governance
12 · Autonomous Research & Scientific Discovery
13 · Private / Xinchuang / Edge Deployment
14 · Platforms, Hubs & Registries — where CAIOs source & host
15 · Orgs & People to Follow (open-source account index)
16 · Other Awesome Lists (meta-index)
🤝 Contributing · ⚖️ Disclaimer

Each section below gives representative anchor entries that demonstrate the tagging format; full coverage is community-driven (see Contributing).

01 · Foundation Models & Open Weights

Adoption logic: fix your base first. A "2 China-led + 1 global/open backup" multi-source policy hedges supply risk. Weight licenses frequently differ from the code license — check each. 引入逻辑：先定底座来源，注意权重许可证常与代码许可证不同。

DeepSeek (deepseek-ai) 🇨🇳 ⭐ 🟢 — V/R-series open weights, mostly MIT. Strong reasoning & code.
Qwen (QwenLM) 🇨🇳 ⭐ 🟢 — Full size range + multimodal, mostly Apache-2.0; the enterprise-friendly default.
GLM (zai-org / THUDM) 🇨🇳 ⭐ 🟡 — GLM series; some versions carry usage terms — confirm before commercial use.
Kimi (moonshotai) 🇨🇳 ⭐ 🟢 — Long-context & agentic models, open weights.
MiniMax (MiniMax-AI) 🇨🇳 ⭐ 🟢 — Open-weight large models with long context.
MiniCPM (OpenBMB) 🇨🇳 ⭐ 🟡 📱 — Edge-friendly small models.
Llama (meta-llama) 🌍 ⭐ 🟡 — Llama Community License (includes an MAU-threshold clause); not pure OSS.
Mistral / Gemma / Phi (mistralai / google / microsoft) 🌍 ⭐ 🟢🟡 — overseas open-weight reference points.
gpt-oss (openai) 🌍 ⭐ 🟢 — OpenAI's first open-weight models (120B / 20B), Apache-2.0; enterprise-safe license, a Western open-weight anchor.
OLMo (allenai) 🌍 🧪 🟢 — Fully open model (training data + code + weights); the reference when full reproducibility / auditability is the requirement.

⚠️ Verify weight licenses one by one: Apache/MIT are commercial-safe; "community licenses" need legal to read the clauses (commercial caps, naming, acceptable-use).

02 · Training / Fine-tuning / Post-training (incl. RL)

Adoption logic: 99% of enterprises do not pretrain. The action is in fine-tuning (domain adaptation) and post-training / RL (alignment + reasoning gains). 引入逻辑：重点是微调（领域适配）与后训练/RL（对齐与推理增强）。

Distributed pretraining / large-scale training

Megatron-LM (NVIDIA) 🌍 ⭐ 🟢 — A de-facto standard for large-scale parallel training.
DeepSpeed (deepspeedai) 🌍 ⭐ 🟢 — ZeRO optimization; memory & throughput.
NeMo (NVIDIA) 🌍 ⭐ 🟢 — End-to-end training framework; 🛡️ evaluate on non-Ascend domestic chips.
ColossalAI (hpcaitech) 🧪 🟢 — Parallel-training toolbox.
TorchTitan (pytorch) 🧪 🟢 — PyTorch-native large-model training reference.

Fine-tuning (most common)

LLaMA-Factory (hiyouga) 🇨🇳 ⭐ 🟢 🏠 — One-stop fine-tuning; the most widely deployed in China.
ms-swift (modelscope) 🇨🇳 ⭐ 🟢 🏠 🛡️ — ModelScope training/tuning suite; good domestic-ecosystem fit.
Unsloth (unslothai) 🌍 ⭐ 🟢 — Efficient single-GPU fine-tuning; saves VRAM.
Axolotl (axolotl-ai-cloud) 🌍 ⭐ 🟢 — Config-driven fine-tuning.
TRL (huggingface) 🌍 ⭐ 🟢 — HF post-training library (SFT / DPO / GRPO).

RLHF / RL (the reasoning-boost hot zone)

verl (volcengine / verl-project) 🇨🇳 ⭐ 🟢 — ByteDance Seed's production-grade RL framework (HybridFlow).
OpenRLHF (OpenRLHF) 🌍 ⭐ 🟢 — Early-popular, approachable RLHF library.
ROLL (alibaba) 🇨🇳 🧪 🟢 — Alibaba large-scale RL framework.
AReaL (inclusionAI / Ant) 🇨🇳 🧪 🟢 — Asynchronous RL, throughput-focused.
slime (THUDM) 🇨🇳 🧪 🟢 — Zhipu/Tsinghua-lineage RL scaling.
NeMo-RL (NVIDIA-NeMo) 🌍 🧪 🟢 — NVIDIA post-training RL.
DAPO (BytedTsinghua-SIA) 🇨🇳 👀 🟢 — ByteDance × Tsinghua open RL system/algorithm + dataset (built on verl).

Learn from scratch (capability-building for your seed engineers)

Karpathy (karpathy) 🌍 ⭐ 🟢 — nanoGPT / llm.c / nanochat / micrograd; the best "understand LLMs from zero" teaching code.

03 · High-performance Kernels & Low-level Systems

Adoption logic: in-house teams need these; almost everyone else only uses them, never edits them. Understanding them is what lets you push inference/training cost down. 引入逻辑：绝大多数公司只「用」不「改」，但理解它们决定你能不能压低成本。

DeepSeek open-infra-index (deepseek-ai) 🇨🇳 ⭐ 🟢 — Index of FlashMLA (MLA decode kernel), DeepEP (MoE comm library), DeepGEMM (FP8 GEMM), DualPipe (pipeline parallel), 3FS (parallel filesystem). Production-validated, mostly MIT.
FlashAttention (Dao-AILab) 🌍 ⭐ 🟢 — The attention-acceleration standard.
Triton (triton-lang) 🌍 ⭐ 🟢 — Language for writing GPU kernels.
CUTLASS (NVIDIA) 🌍 ⭐ 🟢 — CUDA matrix-op template library.
Liger-Kernel (linkedin) 🌍 🧪 🟢 — Fused training kernels; saves VRAM.

04 · Inference Engines

Adoption logic: this is the main cost battleground. Engine choice directly sets per-GPU throughput and concurrency cost. 引入逻辑：降本主战场，引擎选型直接决定单卡吞吐与并发成本。

vLLM (vllm-project) 🌍 ⭐ 🟢 🏠 — High-throughput inference standard (PagedAttention); 🛡️ vllm-ascend Ascend fork exists.
SGLang (sgl-project) 🌍 ⭐ 🟢 🏠 — High-performance serving; common for RAG / structured output.
TensorRT-LLM (NVIDIA) 🌍 ⭐ 🟢 — Peak optimization on NVIDIA GPUs.
LMDeploy (InternLM) 🇨🇳 ⭐ 🟢 🏠 — InternLM team's deploy stack; domestic-ecosystem friendly.
llama.cpp (ggml-org) 🌍 ⭐ 🟢 🏠 📱 — CPU / edge / quantized deployment.
Xinference (xorbitsai) 🇨🇳 🧪 🟢 🏠 — Multi-model local inference server.

05 · Compute Scheduling & Serving Orchestration

Adoption logic: once you have multi-GPU / multi-node clusters, you need a layer above the engine for load balancing, autoscaling, and multi-tenancy. 引入逻辑：引擎之上需要调度与编排做负载均衡、扩缩容、多租户。

llm-d (llm-d) 🌍 🧪 🟢 🏠 — vLLM/SGLang orchestration on K8s: smart routing, tiered KV-cache, prefill/decode disaggregation, SLO autoscaling.
llmaz (InftyAI) 🌍 🧪 🟢 🏠 — Lightweight inference platform on K8s.
K8s scheduler-plugins (kubernetes-sigs) 🌍 ⭐ 🟢 🏠 — GPU scheduling plugins.
Ray / Ray Serve (ray-project) 🌍 ⭐ 🟢 🏠 — Distributed serving & orchestration; elastic, multi-model.
KServe (kserve) 🌍 ⭐ 🟢 🏠 — The K8s model-serving standard.
NVIDIA Triton + Dynamo (triton-inference-server / ai-dynamo) 🌍 ⭐ 🟢 — Official inference serving, enterprise support.
BentoML / OpenLLM (bentoml) 🌍 ⭐ 🟢 🏠 — Packaging & deployment.
AIBrix (vllm-project) 🌍 🧪 🟢 🏠 — Cost-efficient, pluggable K8s infra for LLM inference (ByteDance-origin): LLM-aware autoscaling, KV-cache offload, routing.
LMCache (LMCache) 🌍 🧪 🟢 🏠 — KV-cache layer that accelerates serving via cross-request reuse / offload; pairs with vLLM and llm-d.

06 · Gateway / Routing / API & Cost Governance

Adoption logic: the first piece of enterprise AI infrastructure — unify multi-source access, keys/quota/billing, audit, rate-limit, fallback. 引入逻辑：企业级 AI「第一块基础设施」，统一接入与治理。

LLM gateways / API aggregation

LiteLLM (BerriAI) 🌍 ⭐ 🟢 🏠 — Widest ecosystem, fastest to integrate, self-hostable.
Portkey Gateway (Portkey-AI) 🌍 ⭐ 🟢 🏠 — Routing / caching / guardrails / observability / budget control.
Kong AI Gateway (Kong) 🌍 ⭐ 🟢 — AI features on a mature API gateway; first choice if you already run Kong.
Helicone (Helicone) 🌍 🧪 🟢 🏠 — Lightweight gateway/observability, easy to integrate.
One-API (songquanpeng, 🟢 MIT) / New-API (Calcium-Ion, 🔴 AGPL-3.0) 🇨🇳 ⭐ 🏠 — Multi-tenant gateways with keys/quota/billing/audit; a top self-host choice in China. ⚠️ New-API is AGPL — legal review before SaaS redistribution.
Higress (higress-group / Alibaba) 🇨🇳 ⭐ 🟢 🏠 — AI-native API gateway (Envoy-based) with token rate-limiting, semantic cache and content-safety plugins; China-origin, good Xinchuang/medical-content-safety story.

Model routing (pick a model per prompt — cut cost, raise quality)

RouteLLM (lm-sys) 🌍 🧪 🟢 — Strong/weak model routing framework.
Semantic Router (aurelio-labs) 🌍 🧪 🟢 — Routing on semantic embeddings.

⚠️ The gateway is where keys and logs converge — design security / audit / PII-redaction here; it's the cheapest place to do it.

07 · Context Engineering & Token Efficiency

Adoption logic: cuts the API/inference bill directly. As "Vibe Coding" scales across your dev team, token cost rises — this layer is explicit ROI. 引入逻辑：直接砍 API/推理账单，显性 ROI。

rtk (Rust Token Killer) (rtk-ai) 🌍 ⭐ 🟢 🏠 — CLI proxy that filters/compresses command output before it enters context; saves 60–90% tokens; transparent hooks into Claude Code / Codex / Cursor / Gemini CLI.
LLMLingua (microsoft) 🌍 🧪 🟢 — Prompt / context compression.
Repomix (yamadashy) 🌍 ⭐ 🟢 — Pack a repo into a single file to feed a model.
files-to-prompt / code2prompt (simonw / mufeedvh) 🌍 🧪 🟢 — Code-context assembly.

08 · Orchestration Frameworks & Agents

Adoption logic: the main vehicle for internal enablement (sales / support / docs / R&D). In regulated/medical settings, prefer controllable, auditable, deterministic frameworks. 引入逻辑：内部赋能落地的主载体；合规场景优先可控、可审计、确定性强的框架。

LangGraph (langchain-ai) 🌍 ⭐ 🟢 🏠 — Graph-based agent orchestration; production-ready, highly controllable.
LlamaIndex (run-llama) 🌍 ⭐ 🟢 🏠 — RAG / agent data framework.
AutoGen (microsoft) 🌍 ⭐ 🟢 — Multi-agent orchestration.
DSPy (stanfordnlp) 🌍 🧪 🟢 — Declarative prompt / program optimization.
DeerFlow (bytedance) 🇨🇳 🧪 🟢 🏠 — Deep-research multi-agent; production-base candidate.
Dify (langgenius) 🇨🇳 ⭐ 🟢 🏠 — LLM app platform; low-code + self-hostable.
MetaGPT (FoundationAgents) 🇨🇳 ⭐ 🟢 — Multi-agent "software team" paradigm.
OpenHands (All-Hands-AI) 🌍 ⭐ 🟢 🏠 — Open-source coding agent.
n8n (n8n-io) 🌍 ⭐ 🟡 🏠 — Workflow automation (Sustainable Use License — confirm terms).
CrewAI (crewAIInc) 🌍 ⭐ 🟢 — Role-playing multi-agent orchestration; popular for collaborative agent teams.
Pydantic AI (pydantic) 🌍 ⭐ 🟢 — Type-safe agent framework; strong fit for controllable, testable enterprise agents.
Google ADK (google) 🌍 ⭐ 🟢 — Code-first Agent Development Kit: build, evaluate, deploy.
Agno (agno-agi) 🌍 ⭐ 🟢 — High-performance framework to build and run agent platforms.
Bisheng (dataelement) 🇨🇳 ⭐ 🟢 🏠 — Enterprise LLM DevOps platform: workflow + RAG + agents + fine-tune + observability, self-hostable.

⚠️ Medical Class II/III: fully dynamic multi-agent systems conflict with NMPA "deterministic, auditable" requirements — freeze a traceable chain at the orchestration layer.

09 · MCP / Tools / Skills

Adoption logic: the protocol + capability layer that lets agents safely touch enterprise systems. The enterprise focus is gatewayed MCP + audit + permissions. 引入逻辑：让 Agent 安全接入企业系统的协议层与能力层。

Model Context Protocol (modelcontextprotocol) 🌍 ⭐ 🟢 — Official protocol + SDKs.
awesome-mcp-servers (punkpeye) 🌍 — The master index of MCP servers.
awesome-mcp-enterprise (bh-rat) 🌍 — Enterprise-grade MCP subset.
MCP Gateway (lasso-security) + mcpo 🌍 🧪 🟢 🏠 — MCP gateway / auth / audit.
Composio (ComposioHQ) 🌍 🧪 🟢 — Tool integration.
FastMCP (PrefectHQ) 🌍 ⭐ 🟢 — The fast, Pythonic way to build MCP servers and clients.
awesome-claude-skills (ComposioHQ) 🌍 — Skills asset index.

10 · RAG / Knowledge Base / Data Processing

Adoption logic: the base for internal-knowledge enablement and product RAG. Document-parsing quality is usually the real make-or-break of RAG. 引入逻辑：文档解析质量往往是 RAG 成败的真正瓶颈。

Vector DB / retrieval

Milvus (milvus-io / Zilliz) 🇨🇳 ⭐ 🟢 🏠 — Mainstream vector database.
Qdrant (qdrant) 🌍 ⭐ 🟢 🏠 — Rust vector DB.
pgvector (pgvector) 🌍 ⭐ 🟢 🏠 — Postgres extension; lowest ops overhead.
TurboVec (RyanCodrai) 🌍 🧪 🟢 🏠 — Embedded vector index (FAISS-class, not a server DB) on Google Research's TurboQuant quantizer (ICLR 2026); ~~16× compression vs float32 (10M→~~4GB), recall on par with FAISS-PQ, modest speed edge (1–20%). MIT. New & ~single-maintainer — fit for local/private-RAG PoC; vet maturity before production.

🧪 = emerging/assess. Note: an embedded index (like FAISS/ScaNN), not a server-side vector DB (Milvus/Qdrant/pgvector) — no distributed/clustering, filtering is id-allowlist only.

Embedding / rerank

BGE (FlagOpen / BAAI) 🇨🇳 ⭐ 🟢 🏠 — BGE-M3 / BGE-Reranker; first choice for Chinese-language RAG retrieval.

RAG frameworks / app platforms

RAGFlow (infiniflow) 🇨🇳 ⭐ 🟢 🏠 — RAG engine with deep document understanding.
LightRAG (HKUDS) 🌍 🧪 🟢 — Graph-augmented RAG.
GraphRAG (microsoft) 🌍 🧪 🟢 — Knowledge-graph RAG.
FastGPT (labring) 🇨🇳 ⭐ 🟢 🏠 — Knowledge-base Q&A platform.

Document parsing (the real bottleneck)

MinerU (opendatalab) 🇨🇳 ⭐ 🟡 🏠 — PDF / layout parsing; the domestic first choice. ⚠️ Apache-2.0 + additional terms — confirm for large-scale commercial use.
Docling (docling-project / IBM) 🌍 ⭐ 🟢 🏠 — Documents → structured data.
Unstructured (Unstructured-IO) 🌍 ⭐ 🟡 — Multi-format parsing.
markitdown (microsoft) 🌍 ⭐ 🟢 — Convert Office / PDF / HTML and more into clean Markdown for LLMs.

Web ingestion & memory

Crawl4AI (unclecode) 🌍 ⭐ 🟢 — LLM-friendly web crawler / scraper for RAG data ingestion.
mem0 (mem0ai) 🌍 ⭐ 🟢 — Memory layer for agents; persistent user / agent memory across sessions.

11 · Evaluation / Observability / Guardrails / Governance

Adoption logic: the CAIO's shield. Without this layer, everything above is unauditable and indefensible to the board. 引入逻辑：CAIO 的「盾」。没有这一层，前面所有引入都不可审计、不可向董事会交代。

Evaluation

lm-evaluation-harness (EleutherAI) 🌍 ⭐ 🟢 — The academic eval standard.
OpenCompass (open-compass) 🇨🇳 ⭐ 🟢 — Comprehensive China-origin eval.
promptfoo (promptfoo) 🌍 ⭐ 🟢 🏠 — Engineering-grade prompt/model eval & red-teaming.
Ragas (vibrantlabsai) 🌍 ⭐ 🟢 — RAG evaluation toolkit (faithfulness, answer relevancy, context metrics).
DeepEval (confident-ai) 🌍 ⭐ 🟢 — LLM evaluation / unit-testing framework with many built-in metrics.

Observability / tracing

Langfuse (langfuse) 🌍 ⭐ 🟢 🏠 — Open-source LLM observability, self-hostable.
Phoenix (Arize-ai) 🌍 ⭐ 🟢 🏠 — Tracing & evaluation.
OpenLLMetry (traceloop) 🌍 🧪 🟢 — OpenTelemetry semantic conventions for LLMs.

Guardrails / safety

NeMo Guardrails (NVIDIA) 🌍 ⭐ 🟢 🏠 — Conversational guardrails.
Guardrails AI (guardrails-ai) 🌍 🧪 🟢 — Output validation.
Llama Guard / PurpleLlama (meta-llama) 🌍 ⭐ 🟡 — Safety classification.
garak (NVIDIA) 🌍 🧪 🟢 — LLM vulnerability / red-team scanner.
Presidio (microsoft) 🌍 ⭐ 🟢 🏠 ⚠️ — PII / PHI detection, redaction and anonymization; the gateway-side control for medical / financial data.

12 · Autonomous Research & Scientific Discovery

Adoption logic: the "research accelerator" for R&D / medical affairs. Mostly 👀 Watch for now — mind the non-standard licenses and reproducibility. 引入逻辑：研发/医学事务的「研究加速器」，当前多为观察期。

AI Scientist / v2 (SakanaAI) 🌍 👀 🔴 — Pioneering end-to-end autonomous research. ⚠️ Non-standard license (RAIL-derived, with a mandatory-disclosure clause) — legal review mandatory before any enterprise use.
AI-Researcher (HKUDS) 🌍 👀 🟢 — HKU Data Intelligence Lab; autonomous research innovation.
open-ai-co-scientist (llnl) 🌍 👀 🟢 — Open reproduction of Google's AI co-scientist multi-agent system.
GPT-Researcher (assafelovic) 🌍 🧪 🟢 🏠 — Autonomous deep-research agent.
DeerFlow (see §08) 🇨🇳 — Deep-research orchestration, self-hostable.

13 · Private / Xinchuang / Edge Deployment

Adoption logic: the hard constraints of medical / government / SOE — data never leaves, domestic substitution, edge/on-device. This layer decides whether everything above can legally land. 引入逻辑：数据不出域、国产化替代、边缘端侧 —— 决定前面所有项目能不能合规落地。

awesome-private-ai (tdi) 🌍 — On-prem / air-gap / self-hosted curated list.
MindSpore / CANN (mindspore-ai / Huawei Ascend) 🇨🇳 ⭐ 🟢 🏠 🛡️ — Ascend training/inference stack.
vllm-ascend (vllm-project) 🇨🇳 🧪 🟢 🏠 🛡️ — vLLM Ascend backend; the Xinchuang inference path.
LocalAI (mudler) 🌍 ⭐ 🟢 🏠 📱 — OpenAI-compatible local inference.
Jan (menloresearch) 🌍 ⭐ 🟢 🏠 📱 — Offline AI assistant.
Ollama (ollama) 🌍 ⭐ 🟢 🏠 📱 — Local model runtime (MIT; mind trademark & commercial positioning, not the license).

14 · Platforms, Hubs & Registries — where CAIOs source & host

Adoption logic: the repos are only half the story. A CAIO also needs the sourcing & hosting map — where models actually live, where you pull them behind the firewall, and which managed clouds and domestic silicon you can stand on. 引入逻辑：知道仓库还不够，还要知道「去哪取模型、在哪托管、靠哪个国产栈」。

⚠️ Managed clouds below are listed as sourcing/hosting venues, not as OSS entries — included because that is where CAIOs source & host in practice.

Model hubs & registries · 模型枢纽

Hugging Face 🌍 — The default global hub for models, datasets & Spaces.
ModelScope 魔搭 🇨🇳 — Alibaba-backed hub; the de-facto China mirror for weights & datasets.
GitCode AI / 模型广场 🇨🇳 — CSDN-backed mirror of OSS & models.
Kaggle Models 🌍 — Hosted models + notebooks.
Ollama Library 🌍 📱 — One-command local model pulls.

Code hosting · 代码托管

GitHub 🌍 — Where most of this list lives.
Gitee 码云 🇨🇳 — China's largest host; many domestic OSS mirrors.
GitLab 🌍 — Self-hostable CE, common behind the firewall.
AtomGit 🇨🇳 🛡️ — OpenAtom Foundation hosting, Xinchuang-aligned.

Managed AI clouds — overseas · 海外云

AWS Bedrock / SageMaker 🌍 ☁️
Azure AI Foundry 🌍 ☁️
Google Vertex AI 🌍 ☁️
NVIDIA NGC / NIM 🌍 ☁️ 🏠 — Containers, models, microservices.

Managed AI clouds — China · 国内云

Alibaba Cloud Model Studio 百炼 🇨🇳 ☁️
Volcengine Ark 火山方舟 🇨🇳 ☁️
Baidu Qianfan 千帆 🇨🇳 ☁️
Tencent Cloud TI / Hunyuan 腾讯云 🇨🇳 ☁️
Huawei Cloud ModelArts 华为云 🇨🇳 ☁️ 🛡️

Inference-as-a-service / API aggregators · 推理即服务

OpenRouter 🌍 — Many models behind one key.
Together / Fireworks / Groq / DeepInfra 🌍 — Hosted open-weight inference at speed.
SiliconFlow 硅基流动 🇨🇳 — Low-cost hosted open-weight inference.

Domestic compute stacks · 国产算力栈 (信创)

Huawei Ascend CANN 🇨🇳 🛡️ — The Ascend NPU compute architecture (the CUDA analog for Xinchuang).
MindSpore 🇨🇳 🛡️ — Huawei's AI framework.
Cambricon 寒武纪 · Moore Threads 摩尔线程 · Hygon 海光 · Biren 壁仞 🇨🇳 🛡️ — Domestic accelerators to evaluate for Xinchuang procurement.

15 · Orgs & People to Follow (open-source account index)

Adoption logic: follow the source, not the repo. Watching these official org accounts gets you the signal earlier than chasing single repos. Prioritize org accounts; keep personal accounts minimal. 引入逻辑：跟仓库不如跟「源头」。

Models & low-level — China: deepseek-ai · QwenLM · zai-org (GLM) · OpenBMB · ByteDanceSeed · MoonshotAI (Kimi) · MiniMax-AI · FlagOpen (BAAI) · InternLM · modelscope

Models & low-level — global: NVIDIA · microsoft · meta-llama · mistralai · huggingface · google-research

Inference / infrastructure: vllm-project · sgl-project · ray-project · InftyAI · llm-d · BerriAI (LiteLLM)

Agents / apps / RAG: langchain-ai · run-llama · langgenius (Dify) · infiniflow (RAGFlow) · HKUDS · opendatalab

Protocol / tools: modelcontextprotocol · ComposioHQ

People (OSS maintainers, follow as needed): karpathy (understand LLMs from zero) — otherwise track via the org accounts above.

16 · Other Awesome Lists (meta-index)

This list doesn't reinvent the wheel. Below are deeper, domain-specific lists — use them as drill-down entry points. 本清单不重复造轮子；以下是各细分领域更深的专门清单。

tensorchord/Awesome-LLMOps — Full-stack LLMOps; best-structured.
Not-Diamond/awesome-ai-model-routing — Model-routing focus.
xlite-dev/Awesome-LLM-Inference — Inference acceleration: papers + code.
deepseek-ai/open-infra-index — DeepSeek's official infra index.
Hannibal046/Awesome-LLM — The classic master LLM list.
EthicalML/awesome-production-machine-learning — Production ML / governance, the elder list.
punkpeye/awesome-mcp-servers · bh-rat/awesome-mcp-enterprise — MCP ecosystem.
Shubhamsaboo/awesome-llm-apps · e2b-dev/awesome-ai-agents — Apps & agents.

Contributing

PRs welcome — see CONTRIBUTING.md. The short version:

One project per PR, in the right section, sorted by maturity then name.
License + maturity tags are mandatory; add deployment/origin/compliance tags where you can.
Every entry must link to its first-party source.
Maintainer review before merge: link validity, license matches the code's own declaration, activity in the last ~6 months.
One sentence on which layer it fits and what problem it solves — no marketing fluff.
Closed/commercial products don't go in the body; if there's an OSS core, link the core repo.

🤖 A GitHub Action checks every link on each PR, and the scripts/audit.py helper cross-checks stars + license against the GitHub API.

Disclaimer

This list is a decision aid, not legal, compliance, or procurement advice. Final license, compliance, and security judgments rest with your company's legal, compliance, and security teams. Tags can go stale as projects evolve — always confirm against the project's current upstream state before adoption.

本清单为引入决策的辅助参考，不构成法律、合规或采购建议。引入前请以项目官方仓库的当前状态为准。

⭐ Star history

Maintained with ☕ by CAIO之家 · caiohome.com — the home for Chief AI Officers. Content licensed under CC BY 4.0. Attribution: "Awesome CAIO — caiohome.com".

_{If this saved you one bad procurement decision, give it a ⭐ and pass it to your AI lead.}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github		.github
assets		assets
scripts		scripts
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
awesome.md		awesome.md
lychee.toml		lychee.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧭 Awesome Enterprise AI — the CAIO List

Legend

The CAIO Decision Chain

Reference Starter Stack

Contents

01 · Foundation Models & Open Weights

02 · Training / Fine-tuning / Post-training (incl. RL)

03 · High-performance Kernels & Low-level Systems

04 · Inference Engines

05 · Compute Scheduling & Serving Orchestration

06 · Gateway / Routing / API & Cost Governance

07 · Context Engineering & Token Efficiency

08 · Orchestration Frameworks & Agents

09 · MCP / Tools / Skills

10 · RAG / Knowledge Base / Data Processing

11 · Evaluation / Observability / Guardrails / Governance

12 · Autonomous Research & Scientific Discovery

13 · Private / Xinchuang / Edge Deployment

14 · Platforms, Hubs & Registries — where CAIOs source & host

15 · Orgs & People to Follow (open-source account index)

16 · Other Awesome Lists (meta-index)

Contributing

Disclaimer

⭐ Star history

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧭 Awesome Enterprise AI — the CAIO List

Legend

The CAIO Decision Chain

Reference Starter Stack

Contents

01 · Foundation Models & Open Weights

02 · Training / Fine-tuning / Post-training (incl. RL)

03 · High-performance Kernels & Low-level Systems

04 · Inference Engines

05 · Compute Scheduling & Serving Orchestration

06 · Gateway / Routing / API & Cost Governance

07 · Context Engineering & Token Efficiency

08 · Orchestration Frameworks & Agents

09 · MCP / Tools / Skills

10 · RAG / Knowledge Base / Data Processing

11 · Evaluation / Observability / Guardrails / Governance

12 · Autonomous Research & Scientific Discovery

13 · Private / Xinchuang / Edge Deployment

14 · Platforms, Hubs & Registries — where CAIOs source & host

15 · Orgs & People to Follow (open-source account index)

16 · Other Awesome Lists (meta-index)

Contributing

Disclaimer

⭐ Star history

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages