feat: production-ready OpenAI Responses API support by tasoo-oos · Pull Request #1446 · kwaroran/Risuai

tasoo-oos · 2026-05-14T02:16:54Z

PR Checklist

Summary

Rewrites and significantly expands the OpenAI Responses API support. The previous implementation was a minimal stub: no streaming, no tool calls, no structured output, no reasoning, no NanoGPT wiring, and several correctness issues.

Related Issues

None.

Changes

Refactor

requestOpenAIResponseAPI was extracted from requests.ts into a new responses.ts module and re-exported.
LocalNetworkRequestOptions / getLocalNetworkRequestOptions extracted into a new shared.ts used by both modules.
ResponseOutputItem.status type corrected from 'complete' to 'completed'.

Behavior added/changed

Streaming support for Responses API (SSE TransformStream parser).
Function/tool call execution with multi-turn continuation.
Sanitized continuation payloads for store: false: server IDs and reasoning items are stripped before re-submission.
Reasoning summaries and reasoning_text content extracted and wrapped in <Thoughts>...</Thoughts>.
Refusal-only responses surfaced as text.
JSON schema structured output via text.format.
reasoning_effort and verbosity model parameters.
Web search tool via web_search_preview.
Multimodal inputs (input_image, input_file).
developer role conversion for models with the DeveloperRole flag.
NanoGPT Responses endpoint, model selection, auth, and provider header.
- It is not exposed to the UI, so end user cannot select it.
Reverse proxy and custom provider endpoint autofill, additional parameters, and headers.
Local-network routing and streaming timeout support.
Null-safe aiModel?.startsWith('xcustom:::') in the Chat Completions path.

Tests added

requests.responses.test.ts: covers request body construction, reasoning summaries, custom/reverse proxy params, NanoGPT, text extraction, incomplete/failed responses, tool continuation sanitization (non-streaming and streaming), SSE chunk parsing, streamed reasoning, and streaming error events.

Impact

The Chat Completions path (requestOpenAI) is unchanged.
All LLMFormat.OpenAIResponseAPI models now correctly support streaming, tool calls, structured output, and reasoning.
NanoGPT Responses format is properly wired (was dead code before).
- It is not exposed to the UI yet.
Reverse proxy and custom providers support the Responses format with the same additional-params/headers behavior as Chat Completions.
Streaming for Responses API is entirely new.

Additional Notes

Manual E2E tests completed:

With OpenAI's official Responses API
- Plain Streaming / Non-streaming request
- Multimodal Input (inlay image file)
- Integrated tool use (web_search_preview via Chat bot -> Others -> Tools -> Search)
- External tool use (Risuai Access MCP / RisuAI Dice MCP) with Streaming / Non-streaming request
- Summarized Reasoning Inclusion with tool use
With OpenRouter Responses API (by Custom API + https://openrouter.ai/api/v1/responses endpoint)
- Plain Streaming / Non-streaming request
- Multimodal Input (inlay image file with hasImageInput custom flags)
- Tool use (RisuAI Dice MCP) with Streaming / Non-streaming request
- Reasoning Inclusion with tool use
With NanoGPT Responses API (by Custom API + https://nano-gpt.com/api/v1/responses endpoint)
- Plain Streaming / Non-streaming request
- Multimodal Input (inlay image file with hasImageInput custom flags)
- Tool use (RisuAI Dice MCP) with Streaming / Non-streaming request

Modifies the behavior of prompting, requesting, or handling responses from AI models. ↩
Over 80% of the code is AI generated. ↩

tasoo-oos added 13 commits May 12, 2026 15:45

feat: expand OpenAI responses API support

9c267ec

test: validate OpenAI responses API

28febee

fix: harden OpenAI responses API

2b3b2de

test: finalize responses API validation

c2bb76a

chore: remove unnecessary LOG.md

fac8264

fix: sanitize responses tool continuation

bc26fd8

fix: strip responses server ids from continuation

6041087

fix: parse responses reasoning content

5df1c71

feat: request responses reasoning summaries

5c16f13

use double newline for reasoning formatting

671997a

refactor: split OpenAI responses API module

0d65d16

fix: change ResponseOutput type and change thought wrapping

e749770

chore: remove LOG.md again

ec7ebc0

tasoo-oos force-pushed the feat-openai-response-api-overhaul branch from 57bafae to ec7ebc0 Compare May 16, 2026 16:02

fix: reasoning parsing change matched

729d851

tasoo-oos marked this pull request as ready for review May 16, 2026 17:23

refactor: type responses api items

50078ec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: production-ready OpenAI Responses API support#1446

feat: production-ready OpenAI Responses API support#1446
tasoo-oos wants to merge 15 commits into
kwaroran:mainfrom
tasoo-oos:feat-openai-response-api-overhaul

tasoo-oos commented May 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

tasoo-oos commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Checklist

Summary

Related Issues

Changes

Impact

Additional Notes

Footnotes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

tasoo-oos commented May 14, 2026 •

edited

Loading