feat(ai-project-generator): add DeepSeek as a selectable LLM provider by parth0025 · Pull Request #203 · aliansoftwareteam/AlianHub-Project-Management-System

parth0025 · 2026-06-01T11:09:33Z

Summary

Adds DeepSeek as a third LLM provider for the AI-Powered Project Creation feature, alongside the existing OpenAI and Anthropic providers. Selectable via LLM_PROVIDER="deepseek". Also fixes a latent bug in the AI task time estimator that affected all thinking models.

Changes

New provider — `llmProvider/deepseekProvider.js`

DeepSeek exposes an OpenAI-compatible Chat Completions API, so this mirrors openaiProvider.js (same body shape, Bearer auth, response_format: json_object, choices[0].message + usage.*).
Configurable base URL via DEEPSEEK_BASE_URL (default https://api.deepseek.com).
Omits temperature for deepseek-reasoner (thinking mode); sends it for deepseek-chat / deepseek-v4-flash / deepseek-v4-pro.
Full error mapping → friendly codes: 402 + quota-429 → LLM_QUOTA_EXCEEDED, 401/403 → LLM_AUTH_FAILED, 400 → LLM_BAD_REQUEST, 502/503 → LLM_UNAVAILABLE, timeout → LLM_TIMEOUT.
Output clamp 65536 — above the orchestrator's 32000 default, under DeepSeek V4's 384K ceiling — so plan requests pass through unclamped.

Registry — `llmProvider/index.js`

Registered deepseek in SUPPORTED, added to the fallback chain and isAnyProviderConfigured().

Config — `config.js` + `.env.example`

Exposes DEEPSEEK_API_KEY / DEEPSEEK_MODEL; documents model options, timeout, and base-URL overrides.

Estimator fix — `EstimatedTime/aiTaskEstimator.js`

Raised the estimate call's maxTokens 1024 → 8192 and timeout 60s → 120s.
Why: thinking models (deepseek-reasoner, deepseek-v4-pro, OpenAI o-series) spend their output budget on hidden reasoning before the visible JSON. At 1024 the answer came back empty (finish_reason="length") → parseMinutes returned null → no estimate was ever saved. This is why DeepSeek v4-pro plans had zero task estimates while GPT-4.1 worked.

Verification

Check	Result
Provider selection via `LLM_PROVIDER="deepseek"`	✅
Per-model `temperature` handling (reasoner omits, others send)	✅
Output clamp pass-through (32000 unclamped; >ceiling → 65536)	✅
Live DeepSeek auth (401) + quota (402) error mapping	✅
Estimator sends 8192 + persists end-to-end	✅
`gpt-4.1` unchanged — live call: 108 tokens, `finish=stop`, est=85	✅
Bonus: same fix repairs OpenAI o-series estimator path	✅

Existing functionality

OpenAI and Anthropic providers are untouched and verified unchanged (gpt-4.1 produces identical results — the higher maxTokens cap is never hit, so zero cost/behavior change).

Notes for testing

DeepSeek's deepseek-chat / deepseek-reasoner are aliases that now resolve to V4-Flash; deepseek-v4-flash / deepseek-v4-pro are the canonical names.
Thinking models (reasoner, v4-pro) are slower and bill reasoning tokens; deepseek-chat/v4-flash are the cheapest/fastest for estimates.

🤖 Generated with Claude Code

Adds DeepSeek (LLM_PROVIDER="deepseek") alongside the existing OpenAI and Anthropic providers for the AI-Powered Project Creation feature. New provider — Modules/AIProjectGenerator/llmProvider/deepseekProvider.js - OpenAI-compatible Chat Completions API (same body shape, Bearer auth, response_format json_object, choices[0].message + usage.*) - Configurable base URL (DEEPSEEK_BASE_URL), default https://api.deepseek.com - Omits temperature for deepseek-reasoner (thinking mode); sends it for deepseek-chat / deepseek-v4-flash / deepseek-v4-pro - Full error mapping: 402 + quota-429 -> LLM_QUOTA_EXCEEDED, 401/403 -> LLM_AUTH_FAILED, 400 -> LLM_BAD_REQUEST, 502/503 -> LLM_UNAVAILABLE, timeout -> LLM_TIMEOUT - Output clamp set to 65536 (well above the orchestrator's 32000 default, under DeepSeek V4's 384K ceiling) so plan requests pass through unclamped Registry — llmProvider/index.js - Registered "deepseek" in SUPPORTED; added to fallback chain and isAnyProviderConfigured() Config — config.js + .env.example - Exposes DEEPSEEK_API_KEY / DEEPSEEK_MODEL - Documents all DeepSeek vars (model options, timeout, base URL) Estimator fix — Modules/EstimatedTime/aiTaskEstimator.js - Raised the estimate call's maxTokens 1024 -> 8192 and timeout 60s -> 120s. Thinking models (deepseek-reasoner, deepseek-v4-pro, OpenAI o-series) spend their output budget on hidden reasoning before the visible JSON; at 1024 the answer came back EMPTY and no estimate was saved. Verified safe + unchanged for gpt-4.1 (non-thinking: stops at ~108 tokens, zero cost change) and now also fixes the same latent bug for OpenAI o-series. Verified: provider selection, per-model temperature handling, output clamp pass-through, and a live gpt-4.1 estimate call (108 tokens, finish=stop). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

parth0025 self-assigned this Jun 1, 2026

parth0025 merged commit 0085e1b into staging Jun 1, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ai-project-generator): add DeepSeek as a selectable LLM provider#203

feat(ai-project-generator): add DeepSeek as a selectable LLM provider#203
parth0025 merged 1 commit into
stagingfrom
feat/deepseek-llm-provider

parth0025 commented Jun 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

parth0025 commented Jun 1, 2026

Summary

Changes

New provider — llmProvider/deepseekProvider.js

Registry — llmProvider/index.js

Config — config.js + .env.example

Estimator fix — EstimatedTime/aiTaskEstimator.js

Verification

Existing functionality

Notes for testing

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

New provider — `llmProvider/deepseekProvider.js`

Registry — `llmProvider/index.js`

Config — `config.js` + `.env.example`

Estimator fix — `EstimatedTime/aiTaskEstimator.js`