Skip to content

caiohome/awesome-caio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

9 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Awesome Enterprise AI โ€” the CAIO List

๐Ÿงญ Awesome Enterprise AI โ€” the CAIO List

Awesome License: CC BY 4.0 PRs Welcome Link check Website Stars Last commit

An adoption-first index of open-source AI for the enterprise โ€” curated through the eyes of a CAIO.

้ขๅ‘ CAIO๏ผˆ้ฆ–ๅธญ AI ๅฎ˜ / AI ่ดŸ่ดฃไบบ๏ผ‰็š„ใ€Œๅฏๅผ•ๅ…ฅๆ€งใ€ๅผ€ๆบ็ดขๅผ•ใ€‚ Not the coolest projects โ€” the ones you can actually bring into a company.

๐ŸŒ caiohome.com ยท ๐Ÿท๏ธ Legend ยท ๐Ÿงฑ Starter Stack ยท ๐Ÿ—บ๏ธ Decision Chain ยท ๐Ÿ—‚๏ธ Contents ยท ๐Ÿค Contribute


The one question this list answers: "Can this open-source project be brought into my company โ€” to which layer, used by whom, under what license, and at how much compliance risk?"

ๆœฌๆธ…ๅ•ๅชๅ›ž็ญ”ไธ€ไปถไบ‹๏ผšใ€Œ่ฟ™ไธชๅผ€ๆบ้กน็›ฎ่ƒฝไธ่ƒฝๅผ•ๅ…ฅๅ…ฌๅธใ€ๅผ•ๅ…ฅๅˆฐๅ“ชไธ€ๅฑ‚ใ€่ฐๆฅ็”จใ€ไป€ไนˆ่ฎธๅฏ่ฏใ€ๅˆ่ง„้ฃŽ้™ฉๅคšๅคงใ€ใ€‚ Maintained by CAIOไน‹ๅฎถ ยท caiohome.com โ€” the home for Chief AI Officers.

Every other awesome list organizes by technical category or research topic, serving engineers and researchers. This one flips the axis: it is organized by the CAIO adoption decision chain, and every entry carries a set of enterprise metadata tags and a direct link to its source. Anyone can copy the entries โ€” nobody can copy the judgment. That judgment is the only reason this list exists.

๐Ÿ“– Why another list? ยท ไธบไป€ไนˆๅ†ๅšไธ€ไธชๆธ…ๅ•

Awesome-LLM, Awesome-LLMOps, awesome-mcp-servers and friends are excellent โ€” but they answer "what exists?". A CAIO needs to answer "what can I safely ship, and what will legal/security say?" The market is full of 50k-star projects that no enterprise can touch (wrong license, no air-gap, single-vendor lock-in), and 800-star projects that are perfect to adopt.

So here the organizing principle is the introduction decision itself. A ๐Ÿ”ด-license 50k-star repo can be worth less to your company than a ๐ŸŸข-license 800-star one. Stars are not an inclusion criterion.

ๅธ‚้ขไธŠ็š„ awesome ๆธ…ๅ•ๆŒ‰ๆŠ€ๆœฏ็ฑปๅˆซ็ป„็ป‡๏ผŒๆœๅŠกๅทฅ็จ‹ๅธˆไธŽ็ ”็ฉถ่€…ใ€‚ๆœฌๆธ…ๅ•ๆขไธ€ๆ น่ฝด โ€”โ€” ๆŒ‰ CAIO ็š„ๅผ•ๅ…ฅๅ†ณ็ญ–้“พ ๅˆ†ๅฑ‚๏ผŒๅนถ่ฆๆฑ‚ๆฏไธชๆก็›ฎๆŒ‚ไธ€ๅฅ—ไผไธšๅ…ƒๆ•ฐๆฎ + ไธ€ไธชๅฏ็‚นๅ‡ป็š„ๆบๅคด้“พๆŽฅใ€‚ๆŠ„ๆก็›ฎๅฎนๆ˜“๏ผŒๆŠ„ไธ่ตฐ่ฟ™ๅฅ—ใ€Œๅผ•ๅ…ฅๅˆคๆ–ญใ€ใ€‚


Legend

Tags are decision aids, not endorsements. Final adoption rests with your legal, compliance, and security review. ๆ ‡็ญพๆ˜ฏๅˆคๆ–ญ่พ…ๅŠฉ๏ผŒไธๆ˜ฏ่ƒŒไนฆใ€‚ๆœ€็ปˆๅผ•ๅ…ฅๅ†ณ็ญ–ไปฅๅ…ฌๅธๆณ•ๅŠกใ€ๅˆ่ง„ใ€ๅฎ‰ๅ…จ่ฏ„ๅฎกไธบๅ‡†ใ€‚

License ๐ŸŸข๐ŸŸก๐Ÿ”ด and Maturity โญ๐Ÿงช๐Ÿ‘€ are required on every entry. Other tags are best-effort. Every project name is a link to its first-party source.

Axis Values
License ยท ่ฎธๅฏ่ฏ ๐ŸŸข Permissive โ€” Apache-2.0 / MIT / BSD, commercial use out of the box ยท ๐ŸŸก Conditional โ€” MPL / LGPL / model "community" licenses, commercial with strings (read the terms) ยท ๐Ÿ”ด Restricted โ€” GPL / AGPL / RAIL / non-commercial / custom, legal review mandatory
Maturity ยท ๆˆ็†Ÿๅบฆ โญ Production โ€” proven at scale, de-facto standard ยท ๐Ÿงช Pilot โ€” engineering-complete, good for a 30โ€“90 day trial ยท ๐Ÿ‘€ Watch โ€” important direction, still moving, track before adopting
Deployment ยท ้ƒจ็ฝฒ ๐Ÿ  Self-hostable โ€” on-prem / air-gap, data never leaves ยท โ˜๏ธ Cloud-first โ€” private deploy is costly ยท ๐Ÿ“ฑ Edge โ€” runs on-device / edge box
Origin ยท ๆฅๆบ ๐Ÿ‡จ๐Ÿ‡ณ China-led ยท ๐ŸŒ Global / overseas-led
Compliance ยท ๅˆ่ง„ ๐Ÿ›ก๏ธ Xinchuang-ready โ€” adapted to Ascend / domestic silicon ยท โš ๏ธ Sensitive โ€” data-residency / privacy / needs medical-affairs or legal sign-off
โœ… Inclusion criteria ยท ๆ”ถๅฝ•ๆ ‡ๅ‡†

Included โ€” must satisfy all:

  1. Open source with a stated OSS license (and we note the license type).
  2. Enterprise-introducible โ€” real production or pilot value; not a demo / toy.
  3. Maps to a layer of the CAIO decision chain (see Contents).
  4. Active or de-facto standard โ€” meaningful update in the last ~6 months, or already the standard in its niche.
  5. Metadata complete โ€” at minimum license + maturity.
  6. First-party first โ€” official org accounts beat personal forks beat second-hand aggregators.

Not included / footnoted only:

  • Closed SaaS (unless it has an OSS core; commercial products appear only as "for comparison" notes).
  • Pure academic repros with no engineering, unmaintained and without replacement value.
  • Repos with unclear or contradictory licensing (until clarified).
  • Hype forks / mirrors / marketing repos.

The CAIO Decision Chain

The figure the original notes referred to, drawn for real. Read it topโ†’bottom: each layer is a place where a CAIO makes an introduce / don't-introduce call. The private/Xinchuang layer underpins everything when data residency is a hard constraint.

flowchart TB
    MOD["๐Ÿง  ยง01 Foundation<br/>open-weight models"]
    TRN["๐Ÿ—๏ธ ยง02โ€“03 Train ยท Tune<br/>fine-tune ยท RL ยท kernels"]
    SRV["โš™๏ธ ยง04โ€“05 Serve ยท Schedule<br/>inference engines ยท orchestration"]
    GW["๐Ÿšช ยง06โ€“07 Gateway ยท Context<br/>routing ยท keys ยท token efficiency"]
    APP["๐Ÿค– ยง08โ€“10 ยท ยง12 Apps ยท Agents<br/>agents ยท RAG ยท MCP ยท auto-research"]
    GOV["๐Ÿ›ก๏ธ ยง11 Govern ยท Observe<br/>eval ยท tracing ยท guardrails"]
    INF["๐Ÿ  ยง13 Private ยท Xinchuang ยท Edge<br/>air-gap ยท Ascend ยท on-device"]
    HUB["๐Ÿ›ฐ๏ธ ยง14 Source ยท Host<br/>hubs ยท clouds ยท registries"]

    MOD --> TRN --> SRV --> GW --> APP --> GOV
    INF -. underpins .-> MOD
    INF -. underpins .-> SRV
    HUB -. supplies .-> MOD
Loading

Reference Starter Stack

๐Ÿงฑ The list's strongest CAIO opinion: an assembled, coherent open-source stack you could actually pilot โ€” not 150 disconnected links. Swap freely, but this is a sane default for a regulated/enterprise pilot with a multi-source model policy.

Layer Default pick (OSS) Xinchuang / air-gap swap Why
Models Qwen + DeepSeek (+ Llama backup) same (all self-hostable) Multi-source: 2 China-led + 1 global backup
Inference vLLM vllm-ascend Throughput standard; Ascend backend exists
Orchestration Ray Serve / llm-d Ray Serve Autoscaling, multi-model, K8s-native
Gateway LiteLLM / One-API One-API (self-host) Keys ยท quota ยท billing ยท audit in one place
Agents / Apps LangGraph + Dify Dify (private) Controllable, auditable orchestration
RAG RAGFlow + Milvus + BGE + MinerU same (all ๐Ÿ ) Parsing quality is the real RAG bottleneck
Govern Langfuse + promptfoo + NeMo Guardrails Langfuse (self-host) Without this layer, nothing is board-defensible
Base MindSpore / CANN โ€” Domestic training/inference stack

Contents

Each section below gives representative anchor entries that demonstrate the tagging format; full coverage is community-driven (see Contributing).


01 ยท Foundation Models & Open Weights

Adoption logic: fix your base first. A "2 China-led + 1 global/open backup" multi-source policy hedges supply risk. Weight licenses frequently differ from the code license โ€” check each. ๅผ•ๅ…ฅ้€ป่พ‘๏ผšๅ…ˆๅฎšๅบ•ๅบงๆฅๆบ๏ผŒๆณจๆ„ๆƒ้‡่ฎธๅฏ่ฏๅธธไธŽไปฃ็ ่ฎธๅฏ่ฏไธๅŒใ€‚

  • DeepSeek (deepseek-ai) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข โ€” V/R-series open weights, mostly MIT. Strong reasoning & code.
  • Qwen (QwenLM) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข โ€” Full size range + multimodal, mostly Apache-2.0; the enterprise-friendly default.
  • GLM (zai-org / THUDM) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸก โ€” GLM series; some versions carry usage terms โ€” confirm before commercial use.
  • Kimi (moonshotai) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข โ€” Long-context & agentic models, open weights.
  • MiniMax (MiniMax-AI) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข โ€” Open-weight large models with long context.
  • MiniCPM (OpenBMB) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸก ๐Ÿ“ฑ โ€” Edge-friendly small models.
  • Llama (meta-llama) ๐ŸŒ โญ ๐ŸŸก โ€” Llama Community License (includes an MAU-threshold clause); not pure OSS.
  • Mistral / Gemma / Phi (mistralai / google / microsoft) ๐ŸŒ โญ ๐ŸŸข๐ŸŸก โ€” overseas open-weight reference points.
  • gpt-oss (openai) ๐ŸŒ โญ ๐ŸŸข โ€” OpenAI's first open-weight models (120B / 20B), Apache-2.0; enterprise-safe license, a Western open-weight anchor.
  • OLMo (allenai) ๐ŸŒ ๐Ÿงช ๐ŸŸข โ€” Fully open model (training data + code + weights); the reference when full reproducibility / auditability is the requirement.

โš ๏ธ Verify weight licenses one by one: Apache/MIT are commercial-safe; "community licenses" need legal to read the clauses (commercial caps, naming, acceptable-use).

02 ยท Training / Fine-tuning / Post-training (incl. RL)

Adoption logic: 99% of enterprises do not pretrain. The action is in fine-tuning (domain adaptation) and post-training / RL (alignment + reasoning gains). ๅผ•ๅ…ฅ้€ป่พ‘๏ผš้‡็‚นๆ˜ฏๅพฎ่ฐƒ๏ผˆ้ข†ๅŸŸ้€‚้…๏ผ‰ไธŽๅŽ่ฎญ็ปƒ/RL๏ผˆๅฏน้ฝไธŽๆŽจ็†ๅขžๅผบ๏ผ‰ใ€‚

Distributed pretraining / large-scale training

  • Megatron-LM (NVIDIA) ๐ŸŒ โญ ๐ŸŸข โ€” A de-facto standard for large-scale parallel training.
  • DeepSpeed (deepspeedai) ๐ŸŒ โญ ๐ŸŸข โ€” ZeRO optimization; memory & throughput.
  • NeMo (NVIDIA) ๐ŸŒ โญ ๐ŸŸข โ€” End-to-end training framework; ๐Ÿ›ก๏ธ evaluate on non-Ascend domestic chips.
  • ColossalAI (hpcaitech) ๐Ÿงช ๐ŸŸข โ€” Parallel-training toolbox.
  • TorchTitan (pytorch) ๐Ÿงช ๐ŸŸข โ€” PyTorch-native large-model training reference.

Fine-tuning (most common)

  • LLaMA-Factory (hiyouga) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข ๐Ÿ  โ€” One-stop fine-tuning; the most widely deployed in China.
  • ms-swift (modelscope) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข ๐Ÿ  ๐Ÿ›ก๏ธ โ€” ModelScope training/tuning suite; good domestic-ecosystem fit.
  • Unsloth (unslothai) ๐ŸŒ โญ ๐ŸŸข โ€” Efficient single-GPU fine-tuning; saves VRAM.
  • Axolotl (axolotl-ai-cloud) ๐ŸŒ โญ ๐ŸŸข โ€” Config-driven fine-tuning.
  • TRL (huggingface) ๐ŸŒ โญ ๐ŸŸข โ€” HF post-training library (SFT / DPO / GRPO).

RLHF / RL (the reasoning-boost hot zone)

  • verl (volcengine / verl-project) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข โ€” ByteDance Seed's production-grade RL framework (HybridFlow).
  • OpenRLHF (OpenRLHF) ๐ŸŒ โญ ๐ŸŸข โ€” Early-popular, approachable RLHF library.
  • ROLL (alibaba) ๐Ÿ‡จ๐Ÿ‡ณ ๐Ÿงช ๐ŸŸข โ€” Alibaba large-scale RL framework.
  • AReaL (inclusionAI / Ant) ๐Ÿ‡จ๐Ÿ‡ณ ๐Ÿงช ๐ŸŸข โ€” Asynchronous RL, throughput-focused.
  • slime (THUDM) ๐Ÿ‡จ๐Ÿ‡ณ ๐Ÿงช ๐ŸŸข โ€” Zhipu/Tsinghua-lineage RL scaling.
  • NeMo-RL (NVIDIA-NeMo) ๐ŸŒ ๐Ÿงช ๐ŸŸข โ€” NVIDIA post-training RL.
  • DAPO (BytedTsinghua-SIA) ๐Ÿ‡จ๐Ÿ‡ณ ๐Ÿ‘€ ๐ŸŸข โ€” ByteDance ร— Tsinghua open RL system/algorithm + dataset (built on verl).

Learn from scratch (capability-building for your seed engineers)

03 ยท High-performance Kernels & Low-level Systems

Adoption logic: in-house teams need these; almost everyone else only uses them, never edits them. Understanding them is what lets you push inference/training cost down. ๅผ•ๅ…ฅ้€ป่พ‘๏ผš็ปๅคงๅคšๆ•ฐๅ…ฌๅธๅชใ€Œ็”จใ€ไธใ€Œๆ”นใ€๏ผŒไฝ†็†่งฃๅฎƒไปฌๅ†ณๅฎšไฝ ่ƒฝไธ่ƒฝๅŽ‹ไฝŽๆˆๆœฌใ€‚

  • DeepSeek open-infra-index (deepseek-ai) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข โ€” Index of FlashMLA (MLA decode kernel), DeepEP (MoE comm library), DeepGEMM (FP8 GEMM), DualPipe (pipeline parallel), 3FS (parallel filesystem). Production-validated, mostly MIT.
  • FlashAttention (Dao-AILab) ๐ŸŒ โญ ๐ŸŸข โ€” The attention-acceleration standard.
  • Triton (triton-lang) ๐ŸŒ โญ ๐ŸŸข โ€” Language for writing GPU kernels.
  • CUTLASS (NVIDIA) ๐ŸŒ โญ ๐ŸŸข โ€” CUDA matrix-op template library.
  • Liger-Kernel (linkedin) ๐ŸŒ ๐Ÿงช ๐ŸŸข โ€” Fused training kernels; saves VRAM.

04 ยท Inference Engines

Adoption logic: this is the main cost battleground. Engine choice directly sets per-GPU throughput and concurrency cost. ๅผ•ๅ…ฅ้€ป่พ‘๏ผš้™ๆœฌไธปๆˆ˜ๅœบ๏ผŒๅผ•ๆ“Ž้€‰ๅž‹็›ดๆŽฅๅ†ณๅฎšๅ•ๅกๅžๅไธŽๅนถๅ‘ๆˆๆœฌใ€‚

  • vLLM (vllm-project) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” High-throughput inference standard (PagedAttention); ๐Ÿ›ก๏ธ vllm-ascend Ascend fork exists.
  • SGLang (sgl-project) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” High-performance serving; common for RAG / structured output.
  • TensorRT-LLM (NVIDIA) ๐ŸŒ โญ ๐ŸŸข โ€” Peak optimization on NVIDIA GPUs.
  • LMDeploy (InternLM) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข ๐Ÿ  โ€” InternLM team's deploy stack; domestic-ecosystem friendly.
  • llama.cpp (ggml-org) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  ๐Ÿ“ฑ โ€” CPU / edge / quantized deployment.
  • Xinference (xorbitsai) ๐Ÿ‡จ๐Ÿ‡ณ ๐Ÿงช ๐ŸŸข ๐Ÿ  โ€” Multi-model local inference server.

05 ยท Compute Scheduling & Serving Orchestration

Adoption logic: once you have multi-GPU / multi-node clusters, you need a layer above the engine for load balancing, autoscaling, and multi-tenancy. ๅผ•ๅ…ฅ้€ป่พ‘๏ผšๅผ•ๆ“Žไน‹ไธŠ้œ€่ฆ่ฐƒๅบฆไธŽ็ผ–ๆŽ’ๅš่ดŸ่ฝฝๅ‡่กกใ€ๆ‰ฉ็ผฉๅฎนใ€ๅคš็งŸๆˆทใ€‚

  • llm-d (llm-d) ๐ŸŒ ๐Ÿงช ๐ŸŸข ๐Ÿ  โ€” vLLM/SGLang orchestration on K8s: smart routing, tiered KV-cache, prefill/decode disaggregation, SLO autoscaling.
  • llmaz (InftyAI) ๐ŸŒ ๐Ÿงช ๐ŸŸข ๐Ÿ  โ€” Lightweight inference platform on K8s.
  • K8s scheduler-plugins (kubernetes-sigs) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” GPU scheduling plugins.
  • Ray / Ray Serve (ray-project) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” Distributed serving & orchestration; elastic, multi-model.
  • KServe (kserve) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” The K8s model-serving standard.
  • NVIDIA Triton + Dynamo (triton-inference-server / ai-dynamo) ๐ŸŒ โญ ๐ŸŸข โ€” Official inference serving, enterprise support.
  • BentoML / OpenLLM (bentoml) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” Packaging & deployment.
  • AIBrix (vllm-project) ๐ŸŒ ๐Ÿงช ๐ŸŸข ๐Ÿ  โ€” Cost-efficient, pluggable K8s infra for LLM inference (ByteDance-origin): LLM-aware autoscaling, KV-cache offload, routing.
  • LMCache (LMCache) ๐ŸŒ ๐Ÿงช ๐ŸŸข ๐Ÿ  โ€” KV-cache layer that accelerates serving via cross-request reuse / offload; pairs with vLLM and llm-d.

06 ยท Gateway / Routing / API & Cost Governance

Adoption logic: the first piece of enterprise AI infrastructure โ€” unify multi-source access, keys/quota/billing, audit, rate-limit, fallback. ๅผ•ๅ…ฅ้€ป่พ‘๏ผšไผไธš็บง AIใ€Œ็ฌฌไธ€ๅ—ๅŸบ็ก€่ฎพๆ–ฝใ€๏ผŒ็ปŸไธ€ๆŽฅๅ…ฅไธŽๆฒป็†ใ€‚

LLM gateways / API aggregation

  • LiteLLM (BerriAI) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” Widest ecosystem, fastest to integrate, self-hostable.
  • Portkey Gateway (Portkey-AI) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” Routing / caching / guardrails / observability / budget control.
  • Kong AI Gateway (Kong) ๐ŸŒ โญ ๐ŸŸข โ€” AI features on a mature API gateway; first choice if you already run Kong.
  • Helicone (Helicone) ๐ŸŒ ๐Ÿงช ๐ŸŸข ๐Ÿ  โ€” Lightweight gateway/observability, easy to integrate.
  • One-API (songquanpeng, ๐ŸŸข MIT) / New-API (Calcium-Ion, ๐Ÿ”ด AGPL-3.0) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐Ÿ  โ€” Multi-tenant gateways with keys/quota/billing/audit; a top self-host choice in China. โš ๏ธ New-API is AGPL โ€” legal review before SaaS redistribution.
  • Higress (higress-group / Alibaba) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข ๐Ÿ  โ€” AI-native API gateway (Envoy-based) with token rate-limiting, semantic cache and content-safety plugins; China-origin, good Xinchuang/medical-content-safety story.

Model routing (pick a model per prompt โ€” cut cost, raise quality)

  • RouteLLM (lm-sys) ๐ŸŒ ๐Ÿงช ๐ŸŸข โ€” Strong/weak model routing framework.
  • Semantic Router (aurelio-labs) ๐ŸŒ ๐Ÿงช ๐ŸŸข โ€” Routing on semantic embeddings.

โš ๏ธ The gateway is where keys and logs converge โ€” design security / audit / PII-redaction here; it's the cheapest place to do it.

07 ยท Context Engineering & Token Efficiency

Adoption logic: cuts the API/inference bill directly. As "Vibe Coding" scales across your dev team, token cost rises โ€” this layer is explicit ROI. ๅผ•ๅ…ฅ้€ป่พ‘๏ผš็›ดๆŽฅ็  API/ๆŽจ็†่ดฆๅ•๏ผŒๆ˜พๆ€ง ROIใ€‚

  • rtk (Rust Token Killer) (rtk-ai) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” CLI proxy that filters/compresses command output before it enters context; saves 60โ€“90% tokens; transparent hooks into Claude Code / Codex / Cursor / Gemini CLI.
  • LLMLingua (microsoft) ๐ŸŒ ๐Ÿงช ๐ŸŸข โ€” Prompt / context compression.
  • Repomix (yamadashy) ๐ŸŒ โญ ๐ŸŸข โ€” Pack a repo into a single file to feed a model.
  • files-to-prompt / code2prompt (simonw / mufeedvh) ๐ŸŒ ๐Ÿงช ๐ŸŸข โ€” Code-context assembly.

08 ยท Orchestration Frameworks & Agents

Adoption logic: the main vehicle for internal enablement (sales / support / docs / R&D). In regulated/medical settings, prefer controllable, auditable, deterministic frameworks. ๅผ•ๅ…ฅ้€ป่พ‘๏ผšๅ†…้ƒจ่ต‹่ƒฝ่ฝๅœฐ็š„ไธป่ฝฝไฝ“๏ผ›ๅˆ่ง„ๅœบๆ™ฏไผ˜ๅ…ˆๅฏๆŽงใ€ๅฏๅฎก่ฎกใ€็กฎๅฎšๆ€งๅผบ็š„ๆก†ๆžถใ€‚

  • LangGraph (langchain-ai) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” Graph-based agent orchestration; production-ready, highly controllable.
  • LlamaIndex (run-llama) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” RAG / agent data framework.
  • AutoGen (microsoft) ๐ŸŒ โญ ๐ŸŸข โ€” Multi-agent orchestration.
  • DSPy (stanfordnlp) ๐ŸŒ ๐Ÿงช ๐ŸŸข โ€” Declarative prompt / program optimization.
  • DeerFlow (bytedance) ๐Ÿ‡จ๐Ÿ‡ณ ๐Ÿงช ๐ŸŸข ๐Ÿ  โ€” Deep-research multi-agent; production-base candidate.
  • Dify (langgenius) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข ๐Ÿ  โ€” LLM app platform; low-code + self-hostable.
  • MetaGPT (FoundationAgents) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข โ€” Multi-agent "software team" paradigm.
  • OpenHands (All-Hands-AI) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” Open-source coding agent.
  • n8n (n8n-io) ๐ŸŒ โญ ๐ŸŸก ๐Ÿ  โ€” Workflow automation (Sustainable Use License โ€” confirm terms).
  • CrewAI (crewAIInc) ๐ŸŒ โญ ๐ŸŸข โ€” Role-playing multi-agent orchestration; popular for collaborative agent teams.
  • Pydantic AI (pydantic) ๐ŸŒ โญ ๐ŸŸข โ€” Type-safe agent framework; strong fit for controllable, testable enterprise agents.
  • Google ADK (google) ๐ŸŒ โญ ๐ŸŸข โ€” Code-first Agent Development Kit: build, evaluate, deploy.
  • Agno (agno-agi) ๐ŸŒ โญ ๐ŸŸข โ€” High-performance framework to build and run agent platforms.
  • Bisheng (dataelement) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข ๐Ÿ  โ€” Enterprise LLM DevOps platform: workflow + RAG + agents + fine-tune + observability, self-hostable.

โš ๏ธ Medical Class II/III: fully dynamic multi-agent systems conflict with NMPA "deterministic, auditable" requirements โ€” freeze a traceable chain at the orchestration layer.

09 ยท MCP / Tools / Skills

Adoption logic: the protocol + capability layer that lets agents safely touch enterprise systems. The enterprise focus is gatewayed MCP + audit + permissions. ๅผ•ๅ…ฅ้€ป่พ‘๏ผš่ฎฉ Agent ๅฎ‰ๅ…จๆŽฅๅ…ฅไผไธš็ณป็ปŸ็š„ๅ่ฎฎๅฑ‚ไธŽ่ƒฝๅŠ›ๅฑ‚ใ€‚

10 ยท RAG / Knowledge Base / Data Processing

Adoption logic: the base for internal-knowledge enablement and product RAG. Document-parsing quality is usually the real make-or-break of RAG. ๅผ•ๅ…ฅ้€ป่พ‘๏ผšๆ–‡ๆกฃ่งฃๆž่ดจ้‡ๅพ€ๅพ€ๆ˜ฏ RAG ๆˆ่ดฅ็š„็œŸๆญฃ็“ถ้ขˆใ€‚

Vector DB / retrieval

  • Milvus (milvus-io / Zilliz) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข ๐Ÿ  โ€” Mainstream vector database.
  • Qdrant (qdrant) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” Rust vector DB.
  • pgvector (pgvector) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” Postgres extension; lowest ops overhead.
  • TurboVec (RyanCodrai) ๐ŸŒ ๐Ÿงช ๐ŸŸข ๐Ÿ  โ€” Embedded vector index (FAISS-class, not a server DB) on Google Research's TurboQuant quantizer (ICLR 2026); 16ร— compression vs float32 (10Mโ†’4GB), recall on par with FAISS-PQ, modest speed edge (1โ€“20%). MIT. New & ~single-maintainer โ€” fit for local/private-RAG PoC; vet maturity before production.

๐Ÿงช = emerging/assess. Note: an embedded index (like FAISS/ScaNN), not a server-side vector DB (Milvus/Qdrant/pgvector) โ€” no distributed/clustering, filtering is id-allowlist only.

Embedding / rerank

  • BGE (FlagOpen / BAAI) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข ๐Ÿ  โ€” BGE-M3 / BGE-Reranker; first choice for Chinese-language RAG retrieval.

RAG frameworks / app platforms

  • RAGFlow (infiniflow) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข ๐Ÿ  โ€” RAG engine with deep document understanding.
  • LightRAG (HKUDS) ๐ŸŒ ๐Ÿงช ๐ŸŸข โ€” Graph-augmented RAG.
  • GraphRAG (microsoft) ๐ŸŒ ๐Ÿงช ๐ŸŸข โ€” Knowledge-graph RAG.
  • FastGPT (labring) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข ๐Ÿ  โ€” Knowledge-base Q&A platform.

Document parsing (the real bottleneck)

  • MinerU (opendatalab) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸก ๐Ÿ  โ€” PDF / layout parsing; the domestic first choice. โš ๏ธ Apache-2.0 + additional terms โ€” confirm for large-scale commercial use.
  • Docling (docling-project / IBM) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” Documents โ†’ structured data.
  • Unstructured (Unstructured-IO) ๐ŸŒ โญ ๐ŸŸก โ€” Multi-format parsing.
  • markitdown (microsoft) ๐ŸŒ โญ ๐ŸŸข โ€” Convert Office / PDF / HTML and more into clean Markdown for LLMs.

Web ingestion & memory

  • Crawl4AI (unclecode) ๐ŸŒ โญ ๐ŸŸข โ€” LLM-friendly web crawler / scraper for RAG data ingestion.
  • mem0 (mem0ai) ๐ŸŒ โญ ๐ŸŸข โ€” Memory layer for agents; persistent user / agent memory across sessions.

11 ยท Evaluation / Observability / Guardrails / Governance

Adoption logic: the CAIO's shield. Without this layer, everything above is unauditable and indefensible to the board. ๅผ•ๅ…ฅ้€ป่พ‘๏ผšCAIO ็š„ใ€Œ็›พใ€ใ€‚ๆฒกๆœ‰่ฟ™ไธ€ๅฑ‚๏ผŒๅ‰้ขๆ‰€ๆœ‰ๅผ•ๅ…ฅ้ƒฝไธๅฏๅฎก่ฎกใ€ไธๅฏๅ‘่‘ฃไบ‹ไผšไบคไปฃใ€‚

Evaluation

  • lm-evaluation-harness (EleutherAI) ๐ŸŒ โญ ๐ŸŸข โ€” The academic eval standard.
  • OpenCompass (open-compass) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข โ€” Comprehensive China-origin eval.
  • promptfoo (promptfoo) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” Engineering-grade prompt/model eval & red-teaming.
  • Ragas (vibrantlabsai) ๐ŸŒ โญ ๐ŸŸข โ€” RAG evaluation toolkit (faithfulness, answer relevancy, context metrics).
  • DeepEval (confident-ai) ๐ŸŒ โญ ๐ŸŸข โ€” LLM evaluation / unit-testing framework with many built-in metrics.

Observability / tracing

  • Langfuse (langfuse) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” Open-source LLM observability, self-hostable.
  • Phoenix (Arize-ai) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” Tracing & evaluation.
  • OpenLLMetry (traceloop) ๐ŸŒ ๐Ÿงช ๐ŸŸข โ€” OpenTelemetry semantic conventions for LLMs.

Guardrails / safety

  • NeMo Guardrails (NVIDIA) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โ€” Conversational guardrails.
  • Guardrails AI (guardrails-ai) ๐ŸŒ ๐Ÿงช ๐ŸŸข โ€” Output validation.
  • Llama Guard / PurpleLlama (meta-llama) ๐ŸŒ โญ ๐ŸŸก โ€” Safety classification.
  • garak (NVIDIA) ๐ŸŒ ๐Ÿงช ๐ŸŸข โ€” LLM vulnerability / red-team scanner.
  • Presidio (microsoft) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  โš ๏ธ โ€” PII / PHI detection, redaction and anonymization; the gateway-side control for medical / financial data.

12 ยท Autonomous Research & Scientific Discovery

Adoption logic: the "research accelerator" for R&D / medical affairs. Mostly ๐Ÿ‘€ Watch for now โ€” mind the non-standard licenses and reproducibility. ๅผ•ๅ…ฅ้€ป่พ‘๏ผš็ ”ๅ‘/ๅŒปๅญฆไบ‹ๅŠก็š„ใ€Œ็ ”็ฉถๅŠ ้€Ÿๅ™จใ€๏ผŒๅฝ“ๅ‰ๅคšไธบ่ง‚ๅฏŸๆœŸใ€‚

  • AI Scientist / v2 (SakanaAI) ๐ŸŒ ๐Ÿ‘€ ๐Ÿ”ด โ€” Pioneering end-to-end autonomous research. โš ๏ธ Non-standard license (RAIL-derived, with a mandatory-disclosure clause) โ€” legal review mandatory before any enterprise use.
  • AI-Researcher (HKUDS) ๐ŸŒ ๐Ÿ‘€ ๐ŸŸข โ€” HKU Data Intelligence Lab; autonomous research innovation.
  • open-ai-co-scientist (llnl) ๐ŸŒ ๐Ÿ‘€ ๐ŸŸข โ€” Open reproduction of Google's AI co-scientist multi-agent system.
  • GPT-Researcher (assafelovic) ๐ŸŒ ๐Ÿงช ๐ŸŸข ๐Ÿ  โ€” Autonomous deep-research agent.
  • DeerFlow (see ยง08) ๐Ÿ‡จ๐Ÿ‡ณ โ€” Deep-research orchestration, self-hostable.

13 ยท Private / Xinchuang / Edge Deployment

Adoption logic: the hard constraints of medical / government / SOE โ€” data never leaves, domestic substitution, edge/on-device. This layer decides whether everything above can legally land. ๅผ•ๅ…ฅ้€ป่พ‘๏ผšๆ•ฐๆฎไธๅ‡บๅŸŸใ€ๅ›ฝไบงๅŒ–ๆ›ฟไปฃใ€่พน็ผ˜็ซฏไพง โ€”โ€” ๅ†ณๅฎšๅ‰้ขๆ‰€ๆœ‰้กน็›ฎ่ƒฝไธ่ƒฝๅˆ่ง„่ฝๅœฐใ€‚

  • awesome-private-ai (tdi) ๐ŸŒ โ€” On-prem / air-gap / self-hosted curated list.
  • MindSpore / CANN (mindspore-ai / Huawei Ascend) ๐Ÿ‡จ๐Ÿ‡ณ โญ ๐ŸŸข ๐Ÿ  ๐Ÿ›ก๏ธ โ€” Ascend training/inference stack.
  • vllm-ascend (vllm-project) ๐Ÿ‡จ๐Ÿ‡ณ ๐Ÿงช ๐ŸŸข ๐Ÿ  ๐Ÿ›ก๏ธ โ€” vLLM Ascend backend; the Xinchuang inference path.
  • LocalAI (mudler) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  ๐Ÿ“ฑ โ€” OpenAI-compatible local inference.
  • Jan (menloresearch) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  ๐Ÿ“ฑ โ€” Offline AI assistant.
  • Ollama (ollama) ๐ŸŒ โญ ๐ŸŸข ๐Ÿ  ๐Ÿ“ฑ โ€” Local model runtime (MIT; mind trademark & commercial positioning, not the license).

14 ยท Platforms, Hubs & Registries โ€” where CAIOs source & host

Adoption logic: the repos are only half the story. A CAIO also needs the sourcing & hosting map โ€” where models actually live, where you pull them behind the firewall, and which managed clouds and domestic silicon you can stand on. ๅผ•ๅ…ฅ้€ป่พ‘๏ผš็Ÿฅ้“ไป“ๅบ“่ฟ˜ไธๅคŸ๏ผŒ่ฟ˜่ฆ็Ÿฅ้“ใ€ŒๅŽปๅ“ชๅ–ๆจกๅž‹ใ€ๅœจๅ“ชๆ‰˜็ฎกใ€้ ๅ“ชไธชๅ›ฝไบงๆ ˆใ€ใ€‚

โš ๏ธ Managed clouds below are listed as sourcing/hosting venues, not as OSS entries โ€” included because that is where CAIOs source & host in practice.

Model hubs & registries ยท ๆจกๅž‹ๆžข็บฝ

Code hosting ยท ไปฃ็ ๆ‰˜็ฎก

  • GitHub ๐ŸŒ โ€” Where most of this list lives.
  • Gitee ็ ไบ‘ ๐Ÿ‡จ๐Ÿ‡ณ โ€” China's largest host; many domestic OSS mirrors.
  • GitLab ๐ŸŒ โ€” Self-hostable CE, common behind the firewall.
  • AtomGit ๐Ÿ‡จ๐Ÿ‡ณ ๐Ÿ›ก๏ธ โ€” OpenAtom Foundation hosting, Xinchuang-aligned.

Managed AI clouds โ€” overseas ยท ๆตทๅค–ไบ‘

Managed AI clouds โ€” China ยท ๅ›ฝๅ†…ไบ‘

Inference-as-a-service / API aggregators ยท ๆŽจ็†ๅณๆœๅŠก

Domestic compute stacks ยท ๅ›ฝไบง็ฎ—ๅŠ›ๆ ˆ (ไฟกๅˆ›)

15 ยท Orgs & People to Follow (open-source account index)

Adoption logic: follow the source, not the repo. Watching these official org accounts gets you the signal earlier than chasing single repos. Prioritize org accounts; keep personal accounts minimal. ๅผ•ๅ…ฅ้€ป่พ‘๏ผš่ทŸไป“ๅบ“ไธๅฆ‚่ทŸใ€Œๆบๅคดใ€ใ€‚

Models & low-level โ€” China: deepseek-ai ยท QwenLM ยท zai-org (GLM) ยท OpenBMB ยท ByteDanceSeed ยท MoonshotAI (Kimi) ยท MiniMax-AI ยท FlagOpen (BAAI) ยท InternLM ยท modelscope

Models & low-level โ€” global: NVIDIA ยท microsoft ยท meta-llama ยท mistralai ยท huggingface ยท google-research

Inference / infrastructure: vllm-project ยท sgl-project ยท ray-project ยท InftyAI ยท llm-d ยท BerriAI (LiteLLM)

Agents / apps / RAG: langchain-ai ยท run-llama ยท langgenius (Dify) ยท infiniflow (RAGFlow) ยท HKUDS ยท opendatalab

Protocol / tools: modelcontextprotocol ยท ComposioHQ

People (OSS maintainers, follow as needed): karpathy (understand LLMs from zero) โ€” otherwise track via the org accounts above.

16 ยท Other Awesome Lists (meta-index)

This list doesn't reinvent the wheel. Below are deeper, domain-specific lists โ€” use them as drill-down entry points. ๆœฌๆธ…ๅ•ไธ้‡ๅค้€ ่ฝฎๅญ๏ผ›ไปฅไธ‹ๆ˜ฏๅ„็ป†ๅˆ†้ข†ๅŸŸๆ›ดๆทฑ็š„ไธ“้—จๆธ…ๅ•ใ€‚


Contributing

PRs welcome โ€” see CONTRIBUTING.md. The short version:

  1. One project per PR, in the right section, sorted by maturity then name.
  2. License + maturity tags are mandatory; add deployment/origin/compliance tags where you can.
  3. Every entry must link to its first-party source.
  4. Maintainer review before merge: link validity, license matches the code's own declaration, activity in the last ~6 months.
  5. One sentence on which layer it fits and what problem it solves โ€” no marketing fluff.
  6. Closed/commercial products don't go in the body; if there's an OSS core, link the core repo.

๐Ÿค– A GitHub Action checks every link on each PR, and the scripts/audit.py helper cross-checks stars + license against the GitHub API.

Disclaimer

This list is a decision aid, not legal, compliance, or procurement advice. Final license, compliance, and security judgments rest with your company's legal, compliance, and security teams. Tags can go stale as projects evolve โ€” always confirm against the project's current upstream state before adoption.

ๆœฌๆธ…ๅ•ไธบๅผ•ๅ…ฅๅ†ณ็ญ–็š„่พ…ๅŠฉๅ‚่€ƒ๏ผŒไธๆž„ๆˆๆณ•ๅพ‹ใ€ๅˆ่ง„ๆˆ–้‡‡่ดญๅปบ่ฎฎใ€‚ๅผ•ๅ…ฅๅ‰่ฏทไปฅ้กน็›ฎๅฎ˜ๆ–นไป“ๅบ“็š„ๅฝ“ๅ‰็Šถๆ€ไธบๅ‡†ใ€‚


โญ Star history

Star History Chart

Maintained with โ˜• by CAIOไน‹ๅฎถ ยท caiohome.com โ€” the home for Chief AI Officers. Content licensed under CC BY 4.0. Attribution: "Awesome CAIO โ€” caiohome.com".

If this saved you one bad procurement decision, give it a โญ and pass it to your AI lead.

About

๐Ÿงญ Awesome Enterprise AI โ€” the CAIO list. An adoption-first index of open-source AI for the enterprise (license ยท maturity ยท deployment ยท compliance tags). by caiohome.com

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages