diff --git a/doc/JIEBA_SEGMENTATION_FEASIBILITY.md b/doc/JIEBA_SEGMENTATION_FEASIBILITY.md
new file mode 100644
index 00000000..c56c82aa
--- /dev/null
+++ b/doc/JIEBA_SEGMENTATION_FEASIBILITY.md
@@ -0,0 +1,559 @@
+# Jieba 分词算法集成可行性分析
+
+## 摘要
+
+本文档分析将 OpenCC 的分词算法从 mmseg（最大正向匹配）扩展支持 jieba 分词的可行性。重点探讨两种实现方案：
+1. 通过集成 cppjieba 实现 C++ 原生支持
+2. 通过 pybind11 嵌入 Python jieba 库
+
+两种方案均采用**实验性配置**的方式，不影响现有的 mmseg 配置和功能。
+
+---
+
+## 一、背景与动机
+
+### 1.1 OpenCC 当前分词机制
+
+OpenCC 目前使用 **最大正向匹配（Maximum Match Segmentation, mmseg）** 作为唯一的分词算法：
+
+- **实现位置**：`src/MaxMatchSegmentation.cpp`
+- **算法原理**：从左到右扫描文本，在词典中查找最长的前缀匹配
+- **特点**：
+  - 简单高效，时间复杂度 O(n)
+  - 完全依赖词典，无法处理未登录词（OOV）
+  - 对于歧义切分，仅依赖最长匹配原则，缺乏上下文理解
+
+**架构设计**：
+```cpp
+// 抽象接口 (src/Segmentation.hpp)
+class Segmentation {
+public:
+  virtual SegmentsPtr Segment(const std::string& text) const = 0;
+};
+
+// 当前唯一实现 (src/MaxMatchSegmentation.hpp)
+class MaxMatchSegmentation : public Segmentation {
+public:
+  MaxMatchSegmentation(const DictPtr _dict);
+  virtual SegmentsPtr Segment(const std::string& text) const override;
+private:
+  const DictPtr dict;
+};
+```
+
+**配置解析** (`src/Config.cpp:138-151`)：
+```cpp
+SegmentationPtr ParseSegmentation(const JSONValue& doc) {
+  std::string type = GetStringProperty(doc, "type");
+  if (type == "mmseg") {
+    DictPtr dict = ParseDict(GetObjectProperty(doc, "dict"));
+    segmentation = SegmentationPtr(new MaxMatchSegmentation(dict));
+  } else {
+    throw InvalidFormat("Unknown segmentation type: " + type);
+  }
+  return segmentation;
+}
+```
+
+### 1.2 为什么考虑 Jieba？
+
+**Jieba 的优势**：
+1. **智能分词**：结合词典、动态规划和 HMM，能更好地处理歧义和未登录词
+2. **多种模式**：精确模式、全模式、搜索引擎模式满足不同场景需求
+3. **自适应能力**：通过 HMM 模型识别新词和人名、地名等专有名词
+4. **成熟生态**：拥有 C++ (cppjieba) 和 Python 的成熟实现
+
+**应用场景**：
+- 处理包含大量网络新词、专有名词的现代文本
+- 需要更精确的语义理解的转换场景
+- 搜索引擎优化的分词需求
+
+---
+
+## 二、Jieba 分词算法原理
+
+### 2.1 核心算法
+
+Jieba 采用 **三层混合算法**：
+
+#### 1. 基于 Trie 树的词典匹配
+- 使用前缀树（Trie）高效存储词典
+- 构建有向无环图（DAG）记录所有可能的切分路径
+- 时间复杂度：O(n)
+
+#### 2. 动态规划求最大概率路径
+- 基于词频计算每个切分的概率
+- 使用动态规划找到概率最大的切分序列
+- 公式：`P(w1, w2, ..., wn) = P(w1) × P(w2|w1) × ... × P(wn|wn-1)`
+
+#### 3. HMM 处理未登录词
+- 使用隐马尔可夫模型（Hidden Markov Model）
+- 四状态标注：B（词首）、M（词中）、E（词尾）、S（单字词）
+- Viterbi 算法求解最优状态序列
+
+### 2.2 分词模式
+
+| 模式 | 说明 | 适用场景 |
+|------|------|----------|
+| **精确模式** | 最精确的切分，适合文本分析 | 简繁转换的主要场景 |
+| **全模式** | 扫描所有可能的词，速度快但会产生歧义 | 词汇提取 |
+| **搜索引擎模式** | 在精确模式基础上，对长词再次切分 | 搜索优化 |
+
+---
+
+## 三、方案一：集成 cppjieba（推荐）
+
+### 3.1 cppjieba 简介
+
+**项目地址**：[yanyiwu/cppjieba](https://github.com/yanyiwu/cppjieba)
+
+**特点**：
+- ✅ **Header-only 设计**：可通过 `libcppjieba` 以头文件方式集成
+- ✅ **零额外依赖**：仅需 C++ 标准库
+- ✅ **高性能**：纯 C++ 实现，线上环境验证
+- ✅ **完整功能**：支持所有 jieba 分词模式和自定义词典
+- ✅ **跨平台**：Linux / macOS / Windows
+- ✅ **UTF-8 原生支持**：与 OpenCC 一致的编码
+
+**API 接口**：
+```cpp
+class Jieba {
+public:
+  Jieba(const string& dict_path,
+        const string& model_path,
+        const string& user_dict_path = "",
+        const string& idf_path = "",
+        const string& stop_word_path = "");
+
+  // 混合分词（推荐用于 OpenCC）
+  void Cut(const string& sentence, vector<string>& words, bool hmm = true);
+
+  // 带位置信息的分词
+  void Cut(const string& sentence, vector<Word>& words, bool hmm = true);
+
+  // 其他模式
+  void CutAll(const string& sentence, vector<string>& words);
+  void CutForSearch(const string& sentence, vector<string>& words);
+};
+```
+
+### 3.2 集成方案设计
+
+#### 3.2.1 新增类 `JiebaSegmentation`
+
+**文件**：`src/JiebaSegmentation.hpp` / `.cpp`
+
+```cpp
+// src/JiebaSegmentation.hpp
+#pragma once
+
+#include "Common.hpp"
+#include "Segmentation.hpp"
+#include <memory>
+
+// 前向声明，避免在头文件中暴露 cppjieba
+namespace cppjieba {
+class Jieba;
+}
+
+namespace opencc {
+
+class OPENCC_EXPORT JiebaSegmentation : public Segmentation {
+public:
+  // 构造函数：接受 jieba 词典路径
+  JiebaSegmentation(const std::string& dict_path,
+                    const std::string& model_path,
+                    const std::string& user_dict_path = "");
+
+  virtual ~JiebaSegmentation();
+
+  virtual SegmentsPtr Segment(const std::string& text) const override;
+
+private:
+  // PIMPL 模式隐藏实现细节
+  std::unique_ptr<cppjieba::Jieba> jieba_;
+};
+
+} // namespace opencc
+```
+
+```cpp
+// src/JiebaSegmentation.cpp
+#include "JiebaSegmentation.hpp"
+#include "cppjieba/Jieba.hpp"
+
+using namespace opencc;
+
+JiebaSegmentation::JiebaSegmentation(
+    const std::string& dict_path,
+    const std::string& model_path,
+    const std::string& user_dict_path)
+    : jieba_(new cppjieba::Jieba(dict_path, model_path, user_dict_path)) {
+}
+
+JiebaSegmentation::~JiebaSegmentation() = default;
+
+SegmentsPtr JiebaSegmentation::Segment(const std::string& text) const {
+  SegmentsPtr segments(new Segments);
+  std::vector<std::string> words;
+
+  // 使用混合模式（词典 + HMM）
+  jieba_->Cut(text, words, true);
+
+  for (const auto& word : words) {
+    segments->AddSegment(word);
+  }
+
+  return segments;
+}
+```
+
+#### 3.2.2 修改配置解析
+
+**文件**：`src/Config.cpp`
+
+```cpp
+// 在 ParseSegmentation 函数中添加 jieba 分支
+SegmentationPtr ParseSegmentation(const JSONValue& doc) {
+  std::string type = GetStringProperty(doc, "type");
+
+  if (type == "mmseg") {
+    DictPtr dict = ParseDict(GetObjectProperty(doc, "dict"));
+    return SegmentationPtr(new MaxMatchSegmentation(dict));
+  }
+#ifdef ENABLE_JIEBA
+  else if (type == "jieba") {
+    // 读取 jieba 配置
+    std::string dict_path = GetStringProperty(doc, "dict_path");
+    std::string model_path = GetStringProperty(doc, "model_path");
+
+    std::string user_dict_path;
+    if (doc.HasMember("user_dict_path") && doc["user_dict_path"].IsString()) {
+      user_dict_path = doc["user_dict_path"].GetString();
+    }
+
+    return SegmentationPtr(new JiebaSegmentation(
+        dict_path, model_path, user_dict_path));
+  }
+#endif
+  else {
+    throw InvalidFormat("Unknown segmentation type: " + type);
+  }
+}
+```
+
+#### 3.2.3 实验性配置文件
+
+**文件**：`data/config/s2t_jieba.json`
+
+```json
+{
+  "name": "Simplified to Traditional (Jieba Segmentation - Experimental)",
+  "segmentation": {
+    "type": "jieba",
+    "dict_path": "jieba_dict/jieba.dict.utf8",
+    "model_path": "jieba_dict/hmm_model.utf8",
+    "user_dict_path": "jieba_dict/user.dict.utf8"
+  },
+  "conversion_chain": [{
+    "dict": {
+      "type": "group",
+      "dicts": [{
+        "type": "ocd2",
+        "file": "STPhrases.ocd2"
+      }, {
+        "type": "ocd2",
+        "file": "STCharacters.ocd2"
+      }]
+    }
+  }]
+}
+```
+
+#### 3.2.4 CMake 集成
+
+```cmake
+# CMakeLists.txt 添加选项
+option(ENABLE_JIEBA "Enable Jieba segmentation support" OFF)
+
+if(ENABLE_JIEBA)
+  # 添加 cppjieba 子模块或 FetchContent
+  add_subdirectory(deps/cppjieba EXCLUDE_FROM_ALL)
+
+  # 或使用 header-only 版本
+  # include_directories(deps/libcppjieba/include)
+
+  target_compile_definitions(libopencc PRIVATE ENABLE_JIEBA)
+  target_sources(libopencc PRIVATE src/JiebaSegmentation.cpp)
+  target_link_libraries(libopencc PRIVATE cppjieba)
+endif()
+```
+
+### 3.3 优势与劣势分析
+
+#### ✅ 优势
+1. **性能优秀**：纯 C++ 实现，无跨语言调用开销
+2. **部署简单**：编译时集成，运行时无额外依赖
+3. **维护成本低**：cppjieba 是成熟项目，社区活跃
+4. **一致性好**：与 OpenCC 同为 C++ 项目，编码和内存管理一致
+5. **可选编译**：通过 `ENABLE_JIEBA` 开关控制，不影响默认构建
+
+#### ❌ 劣势
+1. **词典额外空间**：jieba 词典（~20MB）需单独维护
+2. **编译时间增加**：增加了编译单元
+3. **初始化开销**：jieba 启动时需加载词典和 HMM 模型（约 100ms）
+
+### 3.4 实施建议
+
+1. **渐进式集成**：
+   - 第一阶段：添加 `JiebaSegmentation` 类，通过编译选项控制
+   - 第二阶段：提供示例配置文件和词典
+   - 第三阶段：性能测试和文档完善
+
+2. **词典管理**：
+   - 将 jieba 词典放在 `data/jieba_dict/` 目录
+   - 提供下载脚本，避免增大主仓库体积
+
+3. **测试策略**：
+   - 对比 mmseg 和 jieba 的分词差异
+   - 验证转换结果的准确性
+   - 性能基准测试
+
+---
+
+## 四、方案二：嵌入 Python jieba
+
+### 4.1 pybind11 简介
+
+**项目地址**：[pybind/pybind11](https://github.com/pybind/pybind11)
+
+**特点**：
+- ✅ 轻量级的 C++/Python 互操作库
+- ✅ 支持在 C++ 中嵌入 Python 解释器
+- ✅ Header-only 可选
+- ✅ 支持 Python 3.6+
+
+### 4.2 集成方案设计
+
+#### 4.2.1 新增类 `PythonJiebaSegmentation`
+
+```cpp
+// src/PythonJiebaSegmentation.hpp
+#pragma once
+
+#include "Common.hpp"
+#include "Segmentation.hpp"
+#include <pybind11/embed.h>
+
+namespace py = pybind11;
+
+namespace opencc {
+
+class OPENCC_EXPORT PythonJiebaSegmentation : public Segmentation {
+public:
+  PythonJiebaSegmentation();
+  virtual ~PythonJiebaSegmentation();
+
+  virtual SegmentsPtr Segment(const std::string& text) const override;
+
+private:
+  static bool python_initialized_;
+  py::object jieba_module_;
+  py::object cut_function_;
+};
+
+} // namespace opencc
+```
+
+```cpp
+// src/PythonJiebaSegmentation.cpp
+#include "PythonJiebaSegmentation.hpp"
+
+namespace opencc {
+
+bool PythonJiebaSegmentation::python_initialized_ = false;
+
+PythonJiebaSegmentation::PythonJiebaSegmentation() {
+  if (!python_initialized_) {
+    py::initialize_interpreter();
+    python_initialized_ = true;
+  }
+
+  // 导入 jieba 模块
+  jieba_module_ = py::module_::import("jieba");
+  cut_function_ = jieba_module_.attr("cut");
+}
+
+PythonJiebaSegmentation::~PythonJiebaSegmentation() {
+  // 注意：不应在这里 finalize，可能有多个实例
+}
+
+SegmentsPtr PythonJiebaSegmentation::Segment(const std::string& text) const {
+  SegmentsPtr segments(new Segments);
+
+  // 调用 jieba.cut(text, cut_all=False, HMM=True)
+  py::list words = cut_function_(text, false, true);
+
+  for (auto word : words) {
+    segments->AddSegment(word.cast<std::string>());
+  }
+
+  return segments;
+}
+
+} // namespace opencc
+```
+
+### 4.3 优势与劣势分析
+
+#### ✅ 优势
+1. **功能完整**：直接使用官方 Python jieba，功能最全
+2. **维护简单**：无需维护 C++ 移植版本
+3. **灵活性高**：可轻松切换 jieba 版本或使用其他 Python 分词库
+
+#### ❌ 劣势（严重）
+1. **性能开销巨大**：
+   - Python 解释器启动时间：~50-200ms
+   - 跨语言调用：每次分词增加 10-50% 开销
+   - GIL 锁限制：无法利用多线程
+
+2. **部署复杂**：
+   - 运行时依赖 Python 和 jieba 包
+   - 跨平台部署困难（Python 版本兼容性）
+   - Docker/容器化需额外配置
+
+3. **稳定性风险**：
+   - Python 异常处理复杂
+   - 内存管理冲突（C++ 和 Python 的 GC）
+   - 多线程环境下的 GIL 竞争
+
+4. **用户体验差**：
+   - 增加了对 Python 环境的强依赖
+   - 安装和配置复杂
+
+### 4.4 结论
+
+**不推荐此方案**，仅在以下极端场景考虑：
+- 仅用于原型验证或内部工具
+- 性能不是关键指标
+- 已有完整的 Python 运行环境
+
+---
+
+## 五、方案对比总结
+
+| 维度 | 方案一：cppjieba | 方案二：Python jieba |
+|------|------------------|----------------------|
+| **性能** | ⭐⭐⭐⭐⭐ 原生 C++，零开销 | ⭐⭐ 跨语言调用，10-50% 开销 |
+| **部署** | ⭐⭐⭐⭐⭐ 静态编译，零依赖 | ⭐⭐ 需 Python + jieba 包 |
+| **维护** | ⭐⭐⭐⭐ cppjieba 成熟稳定 | ⭐⭐⭐⭐⭐ 官方 jieba，无需移植 |
+| **集成难度** | ⭐⭐⭐⭐ 中等（CMake + 代码） | ⭐⭐⭐ 复杂（pybind11 + Python） |
+| **稳定性** | ⭐⭐⭐⭐⭐ 纯 C++，无异常风险 | ⭐⭐⭐ GIL + 异常传播 |
+| **用户体验** | ⭐⭐⭐⭐⭐ 透明集成 | ⭐⭐ 需安装 Python 依赖 |
+
+**推荐方案**：**方案一（cppjieba）**
+
+---
+
+## 六、实施路线图
+
+### 阶段 1：基础集成（1-2 周）
+- [ ] 添加 cppjieba 作为 Git 子模块或依赖
+- [ ] 实现 `JiebaSegmentation` 类
+- [ ] 修改 `Config.cpp` 支持 `type: "jieba"`
+- [ ] 添加编译选项 `ENABLE_JIEBA`
+
+### 阶段 2：词典和配置（1 周）
+- [ ] 准备 jieba 词典文件（`data/jieba_dict/`）
+- [ ] 创建示例配置文件（`s2t_jieba.json` 等）
+- [ ] 编写词典下载/更新脚本
+
+### 阶段 3：测试和优化（2 周）
+- [ ] 编写单元测试（`src/JiebaSegmentationTest.cpp`）
+- [ ] 对比测试：mmseg vs jieba 的分词差异
+- [ ] 性能基准测试（时间、内存）
+- [ ] 集成到 CI/CD
+
+### 阶段 4：文档和发布（1 周）
+- [ ] 更新 `README.md` 和 `CONTRIBUTING.md`
+- [ ] 编写用户指南（如何启用 jieba）
+- [ ] 发布 beta 版本，收集社区反馈
+
+---
+
+## 七、潜在问题与解决方案
+
+### 7.1 词典冲突
+**问题**：jieba 和 OpenCC 的词典可能对同一个词有不同的切分。
+
+**解决方案**：
+- 允许用户提供自定义 jieba 词典
+- 提供"简繁转换优化"的 jieba 词典版本
+- 在文档中说明差异和选择建议
+
+### 7.2 性能回退
+**问题**：jieba 的初始化和 HMM 计算可能比 mmseg 慢。
+
+**解决方案**：
+- 在转换器初始化时预加载 jieba（避免每次转换都初始化）
+- 提供性能模式开关（禁用 HMM，仅用词典）
+- 缓存分词结果（针对重复文本）
+
+### 7.3 跨平台兼容性
+**问题**：cppjieba 在 Windows 上可能有编码问题。
+
+**解决方案**：
+- 统一使用 UTF-8 编码
+- 添加 Windows 特定的测试用例
+- 在 CI 中覆盖 Linux / macOS / Windows
+
+### 7.4 版本兼容性
+**问题**：未来 cppjieba 更新可能破坏 API。
+
+**解决方案**：
+- 锁定 cppjieba 版本（Git submodule 指定 commit）
+- 使用 PIMPL 模式隔离实现细节
+- 提供适配层抽象
+
+---
+
+## 八、参考资料
+
+### 技术文档
+- [jieba 中文分词](https://github.com/fxsjy/jieba)
+- [cppjieba C++ 实现](https://github.com/yanyiwu/cppjieba)
+- [pybind11 文档](https://pybind11.readthedocs.io/)
+
+### 算法原理
+- [中文分词原理理解 + jieba 分词详解](https://zhuanlan.zhihu.com/p/66904318)
+- [jieba 结巴分词原理浅析与理解](https://cloud.tencent.com/developer/article/1831564)
+
+### 集成案例
+- [NodeJieba - Node.js 绑定](https://github.com/yanyiwu/nodejieba)
+- [GoJieba - Go 语言绑定](https://github.com/yanyiwu/gojieba)
+
+---
+
+## 九、结论与建议
+
+### 核心建议
+1. ✅ **推荐方案一（cppjieba）**：性能、部署、维护综合最优
+2. ⚠️ **实验性功能**：通过配置文件选择，不影响现有用户
+3. 📊 **数据驱动**：提供性能对比数据，让用户自主选择
+4. 📖 **文档完善**：清晰说明两种分词算法的差异和适用场景
+
+### 长期展望
+- 支持更多分词算法（如 HanLP、LTP）
+- 提供分词插件机制（动态加载）
+- 社区贡献优化词典
+
+### 风险评估
+- **低风险**：通过编译选项和独立配置隔离
+- **高收益**：显著提升现代文本的转换准确度
+- **可逆性**：随时可移除或禁用 jieba 支持
+
+---
+
+**文档版本**：v1.0
+**最后更新**：2026-01-16
+**作者**：Claude (Anthropic AI)