Production-simulated RAG + agentic migration intelligence platform for version-safe developer change decisions.
flowchart TD
INPUT["🔍 User Query / Migration Goal\ne.g. migrate requests v2 → v3"] --> QP
subgraph QM["Query Mode"]
QP["Query Parser\nVersion extraction · complexity routing"]
RET["Version-Safe Retrieval\nHard version filter · wrong-version rate = 0.0\nHybrid Recall@5 = 0.94"]
CD["Conflict Detector\nStale · contradictory · deprecated\nConflict Macro F1 = 0.966"]
SYN["LLM-Last Synthesis\nOnly when evidence is grounded\nCitation assembly · fallback audit"]
REP["Migration Report\nSAFE / RISKY / BLOCKED"]
end
QP --> RET --> CD --> SYN --> REP
REP --> GP
subgraph GM["Goal Mode"]
GP["GoalParser + DependencyDeltaDetector"]
TP["TaskPlanner → TaskExecutor\nBounded retry · staged migration"]
RD["RecoveryDecider · Escalation"]
PS["PlanSummaryReporter"]
end
GP --> TP --> RD --> PS
PS --> RA
subgraph EXT["Extensions"]
RA["Repo-Aware Analysis\n10 risky callsites · DO_NOT_APPLY_WITHOUT_REVIEW"]
PP["Patch + PR Simulation\nDiff · test sim · triage · rollback plan"]
BT["37-Day Backtest\n2,479 queries · 70 evidence artifacts"]
end
RA --> PP --> BT
BT --> DASH["📊 Evidence Dashboard\nGitHub Pages · public"]
| Layer | What it proves |
|---|---|
| Version-safe RAG | Hard version filtering, zero wrong-version answer rate, citation-backed synthesis |
| Deterministic conflict detection | Stale, contradictory, deprecated, and cross-source conflict handling |
| Agentic Goal Mode | Bounded task planning, recovery, escalation, and final migration decisioning |
| Repo-aware analysis | Local repository scan, dependency usage mapping, risky callsite detection |
| Patch + PR simulation | Reviewer-safe patch diff, test simulation, triage, checklist, rollback plan |
| Evidence dashboard | Public visual dashboard summarizing 70 generated artifacts |
DevPulse helps engineering teams reason about dependency migrations by combining version-aware retrieval, deterministic conflict detection, LLM-last synthesis boundaries, agentic migration planning, repo-aware callsite analysis, patch proposal simulation, test/triage simulation, and a static evidence dashboard.
Open the visual showcase:
https://sidharthkriplani.github.io/devpulse_platform/
The dashboard gives a fast view of:
- PRD completion status
- Query Mode and Goal Mode flow
- RAG evaluation metrics
- repo-aware migration risk
- patch and PR simulation
- final validation artifacts
- truth boundary and evidence inventory
Dependency migrations often fail because teams rely on stale docs, ambiguous version references, contradictory changelogs, or incomplete migration notes.
DevPulse is designed to answer questions like:
- “How should I migrate this dependency from v2 to v3?”
- “Are these docs stale or contradictory?”
- “Which source-code callsites are risky?”
- “Can this migration proceed safely, or should it be blocked?”
- “What evidence supports the recommendation?”
The system intentionally avoids blind LLM output. It uses deterministic gates first, then allows synthesis only when evidence is sufficiently grounded.
flowchart TD
INPUT["🔍 User Query / Migration Goal\ne.g. 'migrate requests v2 → v3'"] --> QM
subgraph QM["Query Mode src/query/"]
QP["Query Parser\nVersion extraction · complexity routing"]
RET["Version-Safe Retrieval\nHard version filter · wrong-version rate = 0.0\nHybrid Recall@5 = 0.94"]
CD["Conflict Detector\nStale · contradictory · deprecated · cross-source\nConflict Macro F1 = 0.966"]
SYN["LLM-Last Synthesis\nOnly when evidence is grounded\nCitation assembly · fallback audit"]
REP["Migration Report\nSAFE / RISKY / BLOCKED"]
end
QP --> RET --> CD --> SYN --> REP
REP --> GM
subgraph GM["Goal Mode src/goal/"]
GP["GoalParser"]
DD["DependencyDeltaDetector"]
TP["TaskPlanner → TaskExecutor\nBounded retry cap · staged migration"]
RD["RecoveryDecider\nEscalation on repeated failure"]
PS["PlanSummaryReporter"]
end
GP --> DD --> TP --> RD --> PS
PS --> RA
subgraph RA["Repo-Aware Extension src/repo/"]
SCAN["Local Repo Scanner\n10 callsites found"]
MAP["Dependency Usage Mapper"]
RISK["Risky Callsite Detector\n10/10 risky · DO_NOT_APPLY_WITHOUT_REVIEW"]
end
RA --> PP
subgraph PP["Patch + PR Simulation src/patch/"]
DIFF["Patch Diff Proposal"]
TEST["Before/After Test Simulation"]
TRIAGE["Failure Triage · Reviewer Checklist"]
ROLL["Rollback Plan"]
end
PP --> DASH["📊 Evidence Dashboard\nGitHub Pages · 70 artifacts\n37-day backtest · 2,479 queries"]
PP --> BT["37-Day Backtest\nRAG eval · 180 queries"]
| Area | Result |
|---|---|
| PRD v3.0 status | pass |
| Query Mode artifacts | 24 |
| Goal Mode artifacts | 8 |
| Total evidence artifacts | 32 |
| Failure/recovery scenarios | 19 |
| Wrong-version answer rate | 0.0 |
| RAG eval query count | 180 |
| Hybrid Recall@5 | 0.94 |
| Reranker simulated Recall@5 | 0.97 |
| Conflict Macro F1 | 0.966 |
| 37-day backtest queries | 2,479 |
| Repo-aware callsites found | 10 |
| Risky callsites found | 10 |
| Patch recommendation | DO_NOT_APPLY_WITHOUT_REVIEW |
| Dashboard artifact inventory | 70 artifacts |
DevPulse Query Mode includes:
- deterministic query parsing
- version extraction
- complexity routing
- version-filtered retrieval simulation
- deterministic conflict detection
- SAFE / RISKY / BLOCKED migration reports
- LLM-last synthesis boundary
- programmatic citation assembly
- fallback and audit artifacts
- 24 evidence artifacts
- 10 failure/recovery scenarios
DevPulse Goal Mode includes:
GoalParserDependencyDeltaDetectorTaskPlannerTaskExecutorRecoveryDeciderPlanSummaryReporter- controlled dependency target registry
- bounded retry cap
- staged migration recommendation
- 8 evidence artifacts
- 9 failure/recovery scenarios
The repo-aware extension scans a local sample repository and maps dependency risk to source-code callsites.
Artifacts include:
outputs/repo_aware/repo_inspection_report.json
outputs/repo_aware/dependency_usage_map.json
outputs/repo_aware/risky_callsite_report.json
outputs/repo_aware/repo_aware_extension_summary.json
DevPulse generates reviewer-safe migration proposal artifacts:
outputs/patches/proposed_file_changes.json
outputs/patches/proposed_migration_patch.diff
outputs/reports/patch_risk_report.json
outputs/test_simulation/before_tests_report.json
outputs/test_simulation/after_patch_tests_report.json
outputs/test_simulation/test_failure_triage_report.json
outputs/pr_simulation/pr_body.md
outputs/pr_simulation/pr_diff.patch
outputs/pr_simulation/reviewer_checklist.md
outputs/pr_simulation/rollback_plan.md
DevPulse includes deeper RAG evaluation artifacts:
outputs/rag_eval/retrieval_ablation_report.json
outputs/rag_eval/reranker_simulation_report.json
outputs/rag_eval/conflict_confusion_matrix.json
outputs/rag_eval/traffic_backtest_37_day_report.json
outputs/rag_eval/corpus_perturbation_report.json
outputs/rag_eval/rag_eval_hardening_summary_v35.json
Run the full PRD validation bundle:
PYTHONPATH=. python3 scripts/run_devpulse_complete_v3.py
PYTHONPATH=. python3 scripts/show_final_demo_report.pyRun the v3.5 extensions:
PYTHONPATH=. python3 scripts/run_repo_aware_scan_v35.py
PYTHONPATH=. python3 scripts/validate_repo_aware_extension_v35.py
PYTHONPATH=. python3 scripts/run_patch_pr_simulation_v35.py
PYTHONPATH=. python3 scripts/validate_patch_pr_simulation_v35.py
PYTHONPATH=. python3 scripts/run_rag_eval_hardening_v35.py
PYTHONPATH=. python3 scripts/validate_rag_eval_hardening_v35.py
PYTHONPATH=. python3 scripts/build_dashboard_v35.py
PYTHONPATH=. python3 scripts/validate_dashboard_v35.py
open outputs/dashboard/index.html| Artifact | Path |
|---|---|
| Final PRD completion report | outputs/reports/devpulse_prd_completion_report_v3.json |
| Final demo report | outputs/reports/devpulse_final_demo_report.txt |
| Query Mode demo | outputs/evidence/devpulse_demo_report.txt |
| Agentic demo | outputs/evidence/agentic_demo_report.txt |
| Goal plan summary | outputs/evidence/plan_summary_report.json |
| RAG hardening summary | outputs/rag_eval/rag_eval_hardening_summary_v35.json |
| Repo inspection report | outputs/repo_aware/repo_inspection_report.json |
| Risky callsite report | outputs/repo_aware/risky_callsite_report.json |
| Patch risk report | outputs/reports/patch_risk_report.json |
| PR simulation body | outputs/pr_simulation/pr_body.md |
| Static dashboard | docs/index.html |
DevPulse is a solo-built, non-production, production-simulated system.
It does not claim:
- real production SaaS usage
- real production traffic
- real users
- live npm/PyPI/Maven registry integration
- real GitHub PR creation
- real CI execution
- autonomous production code mutation
- autonomous merge safety
- production deployment
The project is intentionally evidence-backed and simulation-bounded. Every major claim is supported by executable scripts, generated artifacts, validation reports, and a dashboard.
Built DevPulse, a production-simulated RAG + agentic migration intelligence platform with version-aware retrieval, deterministic conflict detection, LLM-last grounded synthesis, SAFE/RISKY/BLOCKED decisioning, repo-aware callsite risk analysis, patch/PR simulation, 37-day RAG backtesting, and a static evidence dashboard covering 70 generated artifacts.
configs/ controlled registries and scope config
src/devpulse/ core Query Mode and Goal Mode modules
scripts/ executable demo, validation, and artifact builders
sample_repos/ controlled local repo used for repo-aware simulation
outputs/evidence/ core evidence artifacts
outputs/validation/ validation reports
outputs/repo_aware/ repo-aware migration scan artifacts
outputs/patches/ patch proposal artifacts
outputs/pr_simulation/ PR-ready simulation package
outputs/rag_eval/ RAG evaluation hardening artifacts
outputs/dashboard/ local static dashboard
docs/ public GitHub Pages dashboard and documentation
DevPulse is complete at the production-simulated repo-evidence level and is publicly showcaseable through GitHub Pages.