feat(inference): forward extra_body to remote VLM (DashScope enable_thinking)#65
Merged
Conversation
…hinking) Plumb a new extra_body kwarg from DescribeRequest through engine.analyze_frame and BaseBackend.generate to RemoteHTTPBackend, where it is passed verbatim to the OpenAI SDK (which threads it into the JSON payload). This lets cortex toggle DashScope-specific knobs like enable_thinking=False per-request without adding model-specific fields to the public schema. Local backends accept and ignore the kwarg. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
extra_bodykwarg fromDescribeRequestthroughengine.analyze_frameandBaseBackend.generatetoRemoteHTTPBackend, passed verbatim to the OpenAI SDK.enable_thinking=False) per-request without adding model-specific fields to the public schema.Test plan
/describerequest withextra_body={"enable_thinking": false}against a DashScope-backed remote VLM and confirm the field reaches the upstream payload.🤖 Generated with Claude Code