From a2e6f1603f53f3dfa33ff4dba6b65369bc4a375d Mon Sep 17 00:00:00 2001
From: leagames0221-sys <leagames0221@users.noreply.github.com>
Date: Mon, 18 May 2026 05:12:02 +0900
Subject: [PATCH] docs(data-analytics-demo): add Verified ML metrics section
 citing CI-enforced floors
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Documents the two CI-enforced floors (Churn ROC-AUC ≥ 0.70 via AC-3.2,
Upsell lift @ top-10% ≥ 1.5× via AC-3.7) with literal links to the
asserting test files and explanation of the seed-based reproduction
contract. External-facing portfolio copy can now cite this section
instead of presenting unsourced specific numbers.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 packages/data-analytics-demo/README.md | 18 ++++++++++++++++++
 1 file changed, 18 insertions(+)

diff --git a/packages/data-analytics-demo/README.md b/packages/data-analytics-demo/README.md
index 08d3afa..74684a9 100644
--- a/packages/data-analytics-demo/README.md
+++ b/packages/data-analytics-demo/README.md
@@ -41,6 +41,24 @@ open dashboard/build/index.html                       # (or your platform equiva
 | `tests/`                   | pytest suite — one file per layer plus an end-to-end test                                                                                                                                                                            |
 | `docs/architecture.md`     | Pipeline diagram + per-layer details                                                                                                                                                                                                 |
 
+## Verified ML metrics (CI-enforced floors)
+
+The ML layer ships with two acceptance criteria literally asserted by the test
+suite and re-checked on every push by [python-test.yml](../../.github/workflows/python-test.yml).
+Both floors are enforced at `make demo` time as well — the pipeline halts with a
+non-zero exit if either model regresses below floor (AC-α.2 + AC-3.2 + AC-3.7).
+
+| Metric                   | Floor  | Model selection                                                                                                        | Test gate                                                         |
+| ------------------------ | ------ | ---------------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------- |
+| Churn — hold-out ROC-AUC | ≥ 0.70 | Higher of LogisticRegression baseline vs XGBoost; chosen model + score persisted to `ml/artifacts/churn_metadata.json` | [tests/test_ml_churn.py:82-91](tests/test_ml_churn.py) (AC-3.2)   |
+| Upsell — lift @ top-10%  | ≥ 1.5× | LogisticRegression with stratified train/test; lift report persisted to `ml/artifacts/upsell_lift_report.json`         | [tests/test_ml_upsell.py:74-86](tests/test_ml_upsell.py) (AC-3.7) |
+
+The reproduction command is `DEMO_RANDOM_SEED=42 make data && make dbt && make ml`;
+seed 42 yields byte-deterministic outputs (AC-1.5). The generated
+`ml/artifacts/*.json` files contain the literal achieved scores from that run —
+they are gitignored because the source of truth is the regeneration command, not
+a frozen snapshot.
+
 ## Architecture (one-line summary per layer)
 
 ```