fix: incremental context compression — correct token tracking, add ea… by competitiveNN · Pull Request #78 · proxysoul/soulforge

competitiveNN · 2026-05-14T17:49:39Z

…rly budget check, fix UI display

Forge: cache actual system prompt size in instructionsCache and expose via getCachedInstructionsSize()
ContextManager: use cached size in getContextBreakdown() instead of hardcoded 1800; add getContextWindow() accessor
useChat: use API inputTokens for post-compact token accounting instead of char-estimation; fix contextTokens reset
ContextBar: remove subagentChars double-counting in API mode
step-utils: add early context budget check at 65% in prepareStep to force text-only before overflow
subagent-tools: add estimateMessageTokens utility and maybeCompact callback interface

…rly budget check, fix UI display - Forge: cache actual system prompt size in instructionsCache and expose via getCachedInstructionsSize() - ContextManager: use cached size in getContextBreakdown() instead of hardcoded 1800; add getContextWindow() accessor - useChat: use API inputTokens for post-compact token accounting instead of char-estimation; fix contextTokens reset - ContextBar: remove subagentChars double-counting in API mode - step-utils: add early context budget check at 65% in prepareStep to force text-only before overflow - subagent-tools: add estimateMessageTokens utility and maybeCompact callback interface

Fix indentation in step-utils.ts that caused TS1128 parse error. Reorder imports in manager.ts to match Biome's expected order.

coderabbitai · 2026-05-15T14:40:48Z

📝 Walkthrough

⚠ Behavior-critical changes to agent core loop and token accounting:

Core/Agent loop:

src/core/agents/forge.ts:export — cache actual system prompt size in instructionsCache and export getCachedInstructionsSize() to provide accurate system-instructions sizing.
src/core/context/manager.ts:method — getContextWindow() accessor added; getContextBreakdown() now uses cached instructions size (fallback 1800) for more accurate context breakdown.
src/core/agents/step-utils.ts:logic — add EARLY_COMPACT_PCT (65%) early budget check in prepareStep; when exceeded (step ≥2) emit optional _nudge and force text-only (toolChoice="none") to avoid overflow.

Tools:

src/core/agents/subagent-tools.ts:export — add estimateMessageTokens(messages) (chars→tokens heuristic) and maybeCompact callback interface to allow subagents to participate in compaction decisions.

UI/Chat:

src/components/layout/ContextBar.tsx:display — stop double-counting subagentChars in API mode; use contextTokens as authoritative prompt size.
src/hooks/useChat.ts:logic — after compaction, use actual compact model compactUsage.inputTokens when available (fallback to char estimate), set contextTokens to estimatedTokens (not 0), and prefer compactUsage cacheRead/cacheWrite values for token accounting.

Author	Files	+Lines	−Lines
competitiveNN	6	98	20

⚠ Touches agent core (src/core/** and src/hooks/useChat.ts) — verify tests/docs updated

Walkthrough

src/core/agents/step-utils.ts:570-580 — early compaction threshold and token estimation added.

Adds early context compaction at 65% context-window usage to prevent overflow. Introduces token estimation from message payloads, caches instruction sizes, exposes context-window visibility, wires an optional parent-dispatched compaction nudge, and refines token tracking and post-compaction accounting.

Changes

Early Context Compaction with Token Tracking

Layer / File(s)	Summary
Token estimation utility `src/core/agents/subagent-tools.ts`, `src/core/agents/step-utils.ts`	`estimateMessageTokens` approximates token count from message payloads (chars/4). `buildPrepareStep` imports and uses it to measure full-message token totals used for early compaction decisions.
Early compaction detection & nudge `src/core/agents/step-utils.ts`	`buildPrepareStep` computes an early-compact threshold (65% of context window, step ≥ 2). When triggered it may emit an `_nudge` subagent event, add a "context budget reached" hint, and force the next step into text-only mode (`toolChoice: "none"`, `activeTools: []`).
Instruction cache sizing & context visibility `src/core/agents/forge.ts`, `src/core/context/manager.ts`	Instructions cache entries now include `size` (cached char length). Exported helper returns cached size. `ContextManager.getContextWindow()` added; context breakdown uses cached instruction size (fallback 1800) for "System prompt + tools".
UI token formula & compaction post-processing `src/components/layout/ContextBar.tsx`, `src/hooks/useChat.ts`	ContextBar’s API-mode token computation removes `state.subagentChars` to avoid double-counting. useChat post-compaction prefers `compactUsage.inputTokens` to compute post-compact `contextTokens` and updates `cacheRead`/`cacheWrite` from `compactUsage` when present.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Poem

Context nears the ceiling, a gentle tap,
at sixty-five percent the forge takes stock,
a nudge, a compaction, tools step back,
the prompt trimmed clean to let fresh thoughts walk,
code measures, counts, and keeps the engine on track.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 28.57% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	Title follows Conventional Commits format with fix type and clear scope. Describes main focus: token tracking and early budget checks. No vague language or multiple unrelated changes.
Description check	✅ Passed	Description directly details the changeset across all touched files: Forge caching, ContextManager accessors, useChat token accounting, ContextBar double-counting fix, step-utils early budget logic, and subagent-tools utilities.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Warning

There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure.

🔧 ESLint

If the error stems from missing dependencies, add them to the package.json file. For unrecoverable errors (e.g., due to private dependencies), disable the tool in the CodeRabbit configuration.

ESLint skipped: no ESLint configuration detected in root package.json. To enable, add eslint to devDependencies.

Tip

💬 Introducing Slack Agent: The best way for teams to turn conversations into code.

Slack Agent is built on CodeRabbit's deep understanding of your code, so your team can collaborate across the entire SDLC without losing context.

Generate code and open pull requests
Plan features and break down work
Investigate incidents and troubleshoot customer tickets together
Automate recurring tasks and respond to alerts with triggers
Summarize progress and report instantly

Built for teams:

Shared memory across your entire org—no repeating context
Per-thread sandboxes to safely plan and execute work
Governance built-in—scoped access, auditability, and budget controls

One agent for your entire SDLC. Right inside Slack.

👉 Get started

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 5

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/core/agents/step-utils.ts`:
- Around line 717-735: The early compaction branch (guarded by
shouldEarlyCompact and !nudgeFired) currently forces text-only output but does
not mark the nudge as fired; update this block (the section that calls
emitSubagentStep with toolName "_nudge", pushes into hints, and sets
result.toolChoice = "none" / result.activeTools = []) to set nudgeFired = true
after emitting the subagent step so that subsequent checks (and the later
token-budget nudge logic) won’t emit a duplicate nudge.
- Around line 569-589: The block computing totalMsgChars and totalMsgTokens
duplicates logic; replace it by importing and calling the shared utility
estimateMessageTokens (from subagent-tools) instead of reimplementing the
chars/4 heuristic: remove the reducer that builds totalMsgChars and set
totalMsgTokens = estimateMessageTokens(messages). Ensure you add the import for
estimateMessageTokens at the top and keep the rest of the surrounding logic that
uses totalMsgTokens unchanged.

In `@src/core/agents/subagent-tools.ts`:
- Around line 63-64: Remove the dead maybeCompact callback: delete the
maybeCompact?: () => Promise<void> declaration from the SubagentTools type in
subagent-tools.ts, remove the maybeCompact parameter from buildSubagentTools and
any related function signatures, and remove the forwarding of that argument in
forge.ts (where the value is passed into buildSubagentTools). Also update any
call sites and types that reference maybeCompact (including the parameter name
in forge.ts at the place that constructs/forwards subagent tools) so the code
compiles without the unused property.

In `@src/hooks/useChat.ts`:
- Around line 1207-1208: The post-compaction usage reset is incorrectly falling
back to previous cache totals (prev.cacheRead / prev.cacheWrite), which leaks
old counters; update the fallback to the zeroed baseline instead of prev by
using compactUsage?.cacheReadTokens ?? ZERO_USAGE.cacheRead and
compactUsage?.cacheWriteTokens ?? ZERO_USAGE.cacheWrite (referencing
compactUsage, prev, ZERO_USAGE, and the cacheRead/cacheWrite fields in
useChat.ts) so that when compaction omits cache details the counters reset to
zero.
- Around line 1175-1185: The code adds compactUsage.inputTokens (the compaction
request prompt size) plus a recentTokenEstimate, which inflates contextTokens;
instead use the post-compaction context size instead of inputTokens. Replace the
branch that sets estimatedTokens to compactUsage.inputTokens +
recentTokenEstimate with logic that uses compactUsage.outputTokens (or whatever
field represents the compacted/context token count) as the base; if no
post-compaction field exists, fallback to using recentTokenEstimate or
Math.ceil(afterChars / charsPerToken). Update the assignment to estimatedTokens
and keep setContextTokens(estimatedTokens) unchanged, referencing compactUsage,
inputTokens, outputTokens (or equivalent), recentTokenEstimate, newCoreChars,
charsPerToken, afterChars, and contextWindow to locate the change.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: ASSERTIVE

Plan: Pro Plus

Run ID: 7ced4137-42bc-4b2a-a0f3-9358bd6afc29

📥 Commits

Reviewing files that changed from the base of the PR and between e028952 and 2fc0705.

📒 Files selected for processing (6)

src/components/layout/ContextBar.tsx
src/core/agents/forge.ts
src/core/agents/step-utils.ts
src/core/agents/subagent-tools.ts
src/core/context/manager.ts
src/hooks/useChat.ts

coderabbitai · 2026-05-15T14:47:02Z

+            cacheRead: compactUsage?.cacheReadTokens ?? prev.cacheRead,
+            cacheWrite: compactUsage?.cacheWriteTokens ?? prev.cacheWrite,


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

src/hooks/useChat.ts:1207 carries stale cache counters after reset.

Line 1207 and Line 1208 fall back to prev.cacheRead/cacheWrite even though this block resets usage with ...ZERO_USAGE. If compaction usage omits cache details, old cache totals leak into the new post-compaction baseline.

Proposed fix

- cacheRead: compactUsage?.cacheReadTokens ?? prev.cacheRead, - cacheWrite: compactUsage?.cacheWriteTokens ?? prev.cacheWrite, + cacheRead: compactUsage?.cacheReadTokens ?? 0, + cacheWrite: compactUsage?.cacheWriteTokens ?? 0,

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

cacheRead: compactUsage?.cacheReadTokens ?? prev.cacheRead,

cacheWrite: compactUsage?.cacheWriteTokens ?? prev.cacheWrite,

cacheRead: compactUsage?.cacheReadTokens ?? 0,

cacheWrite: compactUsage?.cacheWriteTokens ?? 0,

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/hooks/useChat.ts` around lines 1207 - 1208, The post-compaction usage reset is incorrectly falling back to previous cache totals (prev.cacheRead / prev.cacheWrite), which leaks old counters; update the fallback to the zeroed baseline instead of prev by using compactUsage?.cacheReadTokens ?? ZERO_USAGE.cacheRead and compactUsage?.cacheWriteTokens ?? ZERO_USAGE.cacheWrite (referencing compactUsage, prev, ZERO_USAGE, and the cacheRead/cacheWrite fields in useChat.ts) so that when compaction omits cache details the counters reset to zero.

…rly budget check, fix UI display

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (1)

src/hooks/useChat.ts (1)

1172-1183: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

compactUsage.inputTokens is being treated as post-compaction context, but it is compaction-call usage.

src/hooks/useChat.ts:1177-1179 uses compaction request inputTokens as the new contextTokens baseline. That distorts post-compaction context and can mis-trigger threshold logic.

Proposed fix

-        let estimatedTokens: number;
-        if (compactUsage && compactUsage.inputTokens > 0) {
-          estimatedTokens = compactUsage.inputTokens;
-        } else {
-          estimatedTokens = Math.ceil(afterChars / charsPerToken);
-        }
+        const estimatedTokens = Math.ceil(afterChars / charsPerToken);

In the Vercel AI SDK, what does `usage.inputTokens` from `generateText`/`streamText` represent exactly? Is it strictly the token count for that specific API call input, or can it be used as token count for a different subsequent prompt context?

Also applies to: 1185-1205

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/hooks/useChat.ts` around lines 1172 - 1183, compactUsage.inputTokens is
the compaction API call's input usage and should not be treated as the
post-compaction context size; using it directly to set estimatedTokens and call
setContextTokens (see compactUsage.inputTokens, estimatedTokens,
setContextTokens, contextWindow, afterChars, charsPerToken) can misrepresent the
actual prompt size and trigger thresholds incorrectly. Fix by computing
post-compaction context tokens from the compaction response that explicitly
reports post-compaction context (or if unavailable, continue to fall back to
char-based estimation using afterChars/charsPerToken) — only assign
setContextTokens with a value that is guaranteed to represent the new prompt
size (e.g., a dedicated postCompactionContextTokens field from the compaction
output), and do not reuse compactUsage.inputTokens as the context baseline
unless the API docs confirm it is the post-compaction context size.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/core/agents/forge.ts`:
- Around line 546-547: In buildInstructions ensure the cache entry always
contains the size so getCachedInstructionsSize never returns undefined: when
computing text/key inside buildInstructions, write instructionsCache.set(cm, {
text, key, size: text.length }) unconditionally (not only inside the snapshot
branch) and apply the same unconditional size inclusion for the other cache
writes referenced around the 550-553 block; update any code paths that currently
only call instructionsCache.set(...) when snapshot is truthy to include the size
property so getCachedInstructionsSize can rely on it.

---

Duplicate comments:
In `@src/hooks/useChat.ts`:
- Around line 1172-1183: compactUsage.inputTokens is the compaction API call's
input usage and should not be treated as the post-compaction context size; using
it directly to set estimatedTokens and call setContextTokens (see
compactUsage.inputTokens, estimatedTokens, setContextTokens, contextWindow,
afterChars, charsPerToken) can misrepresent the actual prompt size and trigger
thresholds incorrectly. Fix by computing post-compaction context tokens from the
compaction response that explicitly reports post-compaction context (or if
unavailable, continue to fall back to char-based estimation using
afterChars/charsPerToken) — only assign setContextTokens with a value that is
guaranteed to represent the new prompt size (e.g., a dedicated
postCompactionContextTokens field from the compaction output), and do not reuse
compactUsage.inputTokens as the context baseline unless the API docs confirm it
is the post-compaction context size.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: ASSERTIVE

Plan: Pro Plus

Run ID: 7f154483-7397-49d6-9d5d-8a5d0a892b49

📥 Commits

Reviewing files that changed from the base of the PR and between 2fc0705 and b133eaa.

📒 Files selected for processing (4)

src/core/agents/forge.ts
src/core/agents/step-utils.ts
src/core/agents/subagent-tools.ts
src/hooks/useChat.ts

coderabbitai · 2026-05-15T16:32:36Z

+  if (snapshot) instructionsCache.set(cm, { text, key, size: text.length });
  return text;


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Cache size unconditionally in buildInstructions.

src/core/agents/forge.ts:546 only writes cache when snapshot exists, so getCachedInstructionsSize can return undefined for valid instruction payloads and force fallback sizing.

Proposed fix

- if (snapshot) instructionsCache.set(cm, { text, key, size: text.length }); + instructionsCache.set(cm, { text, key, size: text.length });

Also applies to: 550-553

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/core/agents/forge.ts` around lines 546 - 547, In buildInstructions ensure the cache entry always contains the size so getCachedInstructionsSize never returns undefined: when computing text/key inside buildInstructions, write instructionsCache.set(cm, { text, key, size: text.length }) unconditionally (not only inside the snapshot branch) and apply the same unconditional size inclusion for the other cache writes referenced around the 550-553 block; update any code paths that currently only call instructionsCache.set(...) when snapshot is truthy to include the size property so getCachedInstructionsSize can rely on it.

competitiveNN requested a review from proxysoul as a code owner May 14, 2026 17:49

competitiveNN and others added 2 commits May 15, 2026 14:33

style: normalize indentation and import ordering for Biome compatibility

2fc0705

Fix indentation in step-utils.ts that caused TS1128 parse error. Reorder imports in manager.ts to match Biome's expected order.

competitiveNN force-pushed the fix/context-tracking branch from f40191d to 2fc0705 Compare May 15, 2026 14:40

coderabbitai Bot reviewed May 15, 2026

View reviewed changes

fix: incremental context compression — correct token tracking, add ea…

b133eaa

…rly budget check, fix UI display

coderabbitai Bot reviewed May 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: incremental context compression — correct token tracking, add ea…#78

fix: incremental context compression — correct token tracking, add ea…#78
competitiveNN wants to merge 3 commits into
proxysoul:mainfrom
competitiveNN:fix/context-tracking

competitiveNN commented May 14, 2026

Uh oh!

coderabbitai Bot commented May 15, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot May 15, 2026

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		cacheRead: compactUsage?.cacheReadTokens ?? prev.cacheRead,
		cacheWrite: compactUsage?.cacheWriteTokens ?? prev.cacheWrite,

		if (snapshot) instructionsCache.set(cm, { text, key, size: text.length });
		return text;

Uh oh!

Conversation

competitiveNN commented May 14, 2026

Uh oh!

coderabbitai Bot commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

coderabbitai Bot commented May 15, 2026 •

edited

Loading