[Feature] Considering adding pruning thinking token

**Problem:**
When using reasoning models (e.g., Gemini 3 Pro/Flash, Claude Opus 4.5 with extended thinking), thinking tokens consume context window space but provide no utility once the final response is generated.

**Proposed Solution:**

Detect reasoning blocks across providers:
- OpenAI: reasoning field in response
- Anthropic: <thinking> tags
- Other providers as needed

Dynamically prune thinking tokens from context before subsequent requests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Considering adding pruning thinking token #258

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Feature] Considering adding pruning thinking token #258

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions