feat: context engineering — selective tools + adaptive compaction by kienbui1995 · Pull Request #69 · kienbui1995/mc-code

kienbui1995 · 2026-04-15T03:31:05Z

Context Engineering for Small Models

Selective Tool Loading

Tier	Tools sent	Tokens saved
1 (Claude, GPT)	30	0
2 (Gemini, DeepSeek)	25	~300
3 (Llama, Mistral)	8	~1500
4 (Qwen 3.5)	10	~1300

Adaptive Compaction

Context window	Full compact	Light compact
<64K (Qwen, small)	70%	55%
≥64K (frontier)	90%	80%

Critical for Qwen 3.5 9B self-hosted (32K context).

274 tests, 0 fail.

Summary by CodeRabbit

New Features
- Tool availability is now model-dependent, providing tiered access to different tool sets based on your selected model.
Improvements
- Improved context auto-compaction with dynamic thresholds that adapt to your context window size for better token efficiency.

1. Selective tool loading by model tier: - Tier 1 (frontier): all 30 tools - Tier 2 (strong): 25 tools (no worktree/notebook/mcp) - Tier 3 (local): 8 core tools only - Tier 4 (Qwen): 10 tools (core + memory + codebase_search) Saves ~1500 tokens for small models. 2. Adaptive compaction thresholds: - Small context (<64K): compact at 70%, light at 55% - Large context (≥64K): compact at 90%, light at 80% Prevents Qwen 3.5 (32K) from running out of context. 274 tests, 0 fail.

coderabbitai · 2026-04-15T03:31:18Z

📝 Walkthrough

Walkthrough

This PR introduces model-based tool-tier restrictions to the conversation runtime. A new tool_tier field is added to ConversationRuntime, initialized from the selected model via model_prompt_tier(), and used to filter which tool specs are available in LLM requests. Context compaction thresholds are also adjusted based on context window size.

Changes

Cohort / File(s)	Summary
TUI Initialization `mc/crates/mc-cli/src/main.rs`	Set `tool_tier` on runtime based on model selection using `model_prompt_tier(&model)` during TUI setup.
Core Runtime Tool Filtering `mc/crates/mc-core/src/runtime.rs`	Added public `tool_tier: u8` field to `ConversationRuntime`. Introduced `tool_allowed_for_tier()` function to filter tool specs by tier; applied filtering when building tools list for non-plan mode. Updated context compaction thresholds to vary based on context window size instead of fixed cutoffs.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

feat: model-tier prompt profiles — adapt per model capability #68: Directly uses model_prompt_tier(&model) and integrates the new ConversationRuntime.tool_tier field with model-based tool filtering logic.

Poem

🐰 A rabbit's ode to tiers so fine,
Models now choose their tools divine,
Tool filters based on tier's decree,
Context windows dance wild and free,
Each prompt finds its perfect spree! ✨

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Description check	⚠️ Warning	The description is incomplete. While it contains well-organized information about selective tool loading and adaptive compaction with supporting data tables, it is missing the required structure from the template: What, Why, and How sections, plus the Checklist with test/lint verification status.	Reorganize the description to follow the template structure: add a 'What' section summarizing the changes, a 'Why' section explaining the motivation, a 'How' section with key implementation details, and complete the Checklist confirming cargo fmt, cargo test, and cargo clippy have passed.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly summarizes the main changes: selective tool loading by tier and adaptive compaction thresholds based on context window size.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/context-engineering

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

sonarqubecloud · 2026-04-15T03:33:04Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

gemini-code-assist

Code Review

This pull request introduces a tiered tool system and dynamic context compaction thresholds to optimize performance for models with smaller context windows. Key feedback includes the need to initialize and update the tool tier in all execution modes and model changes, and to use an enum for better type safety. Additionally, the compaction logic should be adjusted to prevent shadowing of micro-compaction, token estimation should account for filtered tools, and magic numbers should be replaced with constants.

gemini-code-assist · 2026-04-15T03:33:19Z

        let mut rt =
            mc_core::ConversationRuntime::new(model.to_string(), max_tokens, system.to_string());
        rt.set_tool_registry(tool_registry);
+        rt.tool_tier = model_prompt_tier(&model);


The tool_tier is correctly initialized here for the TUI mode, but it is missing in the run_single function (around line 1593). This will cause models to use the default tier 1 (all tools) when running in CLI or pipe mode, which may exceed the context window or reasoning capabilities of smaller models like Qwen or Llama. Additionally, the system prompt (built at line 187) will be tiered, but the actual tools sent in the request will not be, leading to a discrepancy between the model's instructions and its available tools.

gemini-code-assist · 2026-04-15T03:33:20Z

    task_manager: crate::tasks::TaskManager,
    hierarchical_instructions: Option<String>,
+    /// Model capability tier for tool filtering (1=frontier, 2=strong, 3=local, 4=qwen).
+    pub tool_tier: u8,


Using a raw u8 for tool_tier is error-prone and reduces readability. Consider defining a ModelTier enum to provide better type safety. Furthermore, note that tool_tier is currently not updated when the model is changed via set_model (line 238), which will result in incorrect tool filtering if a user switches between frontier and local models mid-session.

gemini-code-assist · 2026-04-15T03:33:20Z


-        if usage_pct > 90 {
+        // Small context models (<64K) compact earlier
+        let (full_threshold, light_threshold) = if ctx_window < 65_536 {


The value 65_536 is a magic number representing the 64K context window threshold. It should be defined as a named constant (e.g., SMALL_CONTEXT_THRESHOLD) to improve maintainability and clarity.

gemini-code-assist · 2026-04-15T03:33:20Z

                crate::compact::compact_session(&mut self.session, preserve);
            }
-        } else if usage_pct > 80 {
+        } else if usage_pct > light_threshold {


For small context models, light_threshold is set to 55. This else if block will match for any usage_pct above 55, which effectively shadows the subsequent else if usage_pct > 60 check (line 1383) that performs micro_compact. As a result, micro_compact (which trims long outputs for all tools) will never be executed for small models when they are under context pressure. Consider reordering the checks or making them independent.

gemini-code-assist · 2026-04-15T03:33:20Z

                .tool_registry
                .all_specs()
                .iter()
+                .filter(|s| tool_allowed_for_tier(s.name.as_str(), self.tool_tier))


While tools are correctly filtered here before being sent to the model, the tool_schema_tokens calculation at line 1465 still iterates over all tools in the registry. This leads to an overestimation of the context used by tool definitions for lower-tier models, resulting in a smaller effective_max_tokens budget than actually available. The filter should also be applied during the token estimation at line 1465.

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

mc/crates/mc-core/src/runtime.rs (1)

1362-1386: Micro-compact branch is unreachable for small context models.

For models with ctx_window < 65_536, the thresholds are (70, 55). The else if usage_pct > 60 check at line 1383 will never trigger because any value >60 is already caught by usage_pct > light_threshold (55).

If this is intentional (small models skip micro-compact and jump to light compact earlier), consider adding a comment to clarify. Otherwise, the micro-compact threshold should also be adaptive:

♻️ Proposed fix to make micro-compact threshold adaptive

         // Small context models (<64K) compact earlier
         let (full_threshold, light_threshold) = if ctx_window < 65_536 {
             (70, 55) // compact at 70%, light at 55%
         } else {
             (90, 80)
         };
+        let micro_threshold = if ctx_window < 65_536 { 40 } else { 60 };

         if usage_pct > full_threshold {
             // Full smart compact
             ...
         } else if usage_pct > light_threshold {
             // Collapse reads + snip thinking
             ...
-        } else if usage_pct > 60 {
+        } else if usage_pct > micro_threshold {
             // Micro-compact only
             crate::compact::micro_compact(&mut self.session);
         }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@mc/crates/mc-core/src/runtime.rs` around lines 1362 - 1386, The micro-compact
branch is currently unreachable for small models because light_threshold is 55
while the hardcoded micro threshold is 60; define an adaptive micro_threshold
(e.g., let micro_threshold = if ctx_window < 65_536 { 50 } else { 60 }) and
replace the final else-if (usage_pct > 60) with else if usage_pct >
micro_threshold so micro_compact(&mut self.session) can be reached for small and
large ctx_window values; reference variables/functions: ctx_window,
full_threshold, light_threshold, usage_pct, micro_threshold,
crate::compact::micro_compact, crate::compact::collapse_reads,
crate::compact::snip_thinking.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@mc/crates/mc-cli/src/main.rs`:
- Line 421: In run_single, after creating the ConversationRuntime via
ConversationRuntime::new (in function run_single), set runtime.tool_tier =
model_prompt_tier(&model) so the same model-based tool tier selection done in
run_tui is applied to pipe/non-interactive runs; locate the runtime variable in
run_single and assign tool_tier using the model_prompt_tier(&model) helper
before the runtime is used.

---

Nitpick comments:
In `@mc/crates/mc-core/src/runtime.rs`:
- Around line 1362-1386: The micro-compact branch is currently unreachable for
small models because light_threshold is 55 while the hardcoded micro threshold
is 60; define an adaptive micro_threshold (e.g., let micro_threshold = if
ctx_window < 65_536 { 50 } else { 60 }) and replace the final else-if (usage_pct
> 60) with else if usage_pct > micro_threshold so micro_compact(&mut
self.session) can be reached for small and large ctx_window values; reference
variables/functions: ctx_window, full_threshold, light_threshold, usage_pct,
micro_threshold, crate::compact::micro_compact, crate::compact::collapse_reads,
crate::compact::snip_thinking.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 07f7191e-dff9-4ef1-9099-2fa4191ad50c

📥 Commits

Reviewing files that changed from the base of the PR and between 2f96445 and 4055727.

📒 Files selected for processing (2)

mc/crates/mc-cli/src/main.rs
mc/crates/mc-core/src/runtime.rs

coderabbitai · 2026-04-15T03:34:28Z

        let mut rt =
            mc_core::ConversationRuntime::new(model.to_string(), max_tokens, system.to_string());
        rt.set_tool_registry(tool_registry);
+        rt.tool_tier = model_prompt_tier(&model);


⚠️ Potential issue | 🟠 Major

LGTM for TUI path, but run_single is missing the same assignment.

The tier assignment in run_tui is correct. However, the run_single function (lines 1569-1593) creates a ConversationRuntime without setting tool_tier:

// Line 1569-1570 let mut runtime = mc_core::ConversationRuntime::new(model.to_string(), max_tokens, system.to_string()); // ... tool_tier is never set, defaults to 1

This means pipe/non-interactive mode (magic-code --pipe "fix bug" --model qwen3.5) will always send all 30 tools regardless of model tier, defeating the context engineering optimization.

🐛 Proposed fix to add tier assignment in run_single

async fn run_single( model: &str, ... ) -> Result<()> { let cancel = CancellationToken::new(); let mut runtime = mc_core::ConversationRuntime::new(model.to_string(), max_tokens, system.to_string()); + runtime.tool_tier = model_prompt_tier(model); let mut tool_registry = mc_tools::ToolRegistry::new() .with_workspace_root(std::env::current_dir().unwrap_or_default());

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@mc/crates/mc-cli/src/main.rs` at line 421, In run_single, after creating the ConversationRuntime via ConversationRuntime::new (in function run_single), set runtime.tool_tier = model_prompt_tier(&model) so the same model-based tool tier selection done in run_tui is applied to pipe/non-interactive runs; locate the runtime variable in run_single and assign tool_tier using the model_prompt_tier(&model) helper before the runtime is used.

gemini-code-assist bot reviewed Apr 15, 2026

View reviewed changes

coderabbitai bot reviewed Apr 15, 2026

View reviewed changes

kienbui1995 merged commit b3704c5 into main Apr 15, 2026
16 checks passed

Conversation

kienbui1995 commented Apr 15, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context Engineering for Small Models

Selective Tool Loading

Adaptive Compaction

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

sonarqubecloud bot commented Apr 15, 2026

Quality Gate passed

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kienbui1995 commented Apr 15, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Apr 15, 2026 •

edited

Loading