Box AI Units

Box AI usage is gated by AI Units — a per-call consumption budget that depends on your Box plan. This page captures plan allocations, plugin opt-in controls, and what happens when units run out.

Full reference at references/box-ai-units.md.

Per-plan allocation

Plan	AI Units / month	What's included
Free Developer	1,000	Ask, Extract, Text Gen; multi-doc Ask yes; Hubs Q&A no; AI Studio no
Business / Business Plus	Purchase as needed	Ask, Extract, Text Gen (per-unit cost); multi-doc Ask no; Hubs Q&A no
Enterprise	1,000 included	Ask, Extract, Text Gen; Hubs Q&A yes (Enterprise feature); multi-doc Ask no; AI Studio no
Enterprise Plus	2,000 included	Above + multi-doc Ask yes
Enterprise Advanced	20,000 included	Above + AI Studio yes

Per-call unit consumption rates are not publicly published. Box directs prospects to sales for rate cards. Plan by knowing your allocation, not by counting per-call.

Per-skill consumption

Skill	Operation	Units / call	Frequency
`box-ai-recall`	`/2.0/ai/ask`	1+ (scales with input file count + prompt)	Per query
`box-ai-extract`	`/2.0/ai/extract_structured`	1+ (scales with file size + field count)	Per binary processed
`box-ai-agent invoke`	`/2.0/ai_agents`	1+	Per agent query
`box-companion` (AI path)	Same as `box-ai-extract`	Per binary	When generating companions for AI-supported formats
All other skills	None	0	N/A

Opt-in controls

The plugin defaults all AI-consuming behavior to OFF. In _box-memory.json.settings:

settings:
  ai_recall_enabled: false       # set true to allow /box-ai-recall
  ai_extract_enabled: false      # set true for /box-companion AI path + /box-ai-extract
  ai_studio_agent_enabled: false # set true for /box-ai-agent
  ai_model: null                 # let Box pick (default GPT-5 mini); override with a specific model

Users explicitly opt in. The plugin never silently consumes AI Units.

Workspace-level cost bounds (v0.2+ planned)

Coming in a future version:

settings:
  ai_budget:
    monthly_unit_cap: 500        # plugin tracks; refuses calls above this
    per_session_cap: 50          # also enforce per-session limit
    warn_at_pct: 80              # log warning when 80% of cap reached

Not in v0.1.0. Current behavior: opt-in flags only, no cap enforcement.

When you exhaust units

Box returns HTTP 402 (Payment Required) or a quota error when the monthly allocation runs out.

Plugin behavior:

Surface the error: "Box AI Unit quota reached. Falling back to non-AI paths. Quota refreshes at the start of next billing period."
Set ai_recall_enabled, ai_extract_enabled, etc. → false for the session
Continue providing service via non-AI paths (index recall, sparse companions)
On next session, re-detect tier and AI Units availability via /box-tier-detect --refresh

Bounding cost in practice

Default to off. Users explicitly opt in.
Suggest box-recall first for any query that could plausibly be exact-matched.
Cap input file sets at the 25-file Box limit (you can't accidentally query 200 files).
Use --check for companion freshness before regenerating (no AI cost for the check).
Watch for 402. Falling back fast is better than retrying and burning units.

Trade-off vs the on-prem variant

The on-prem variant consumes zero AI Units — it makes no Box API calls. Different trade-off:

Cloud (this plugin)	On-prem
Uses AI Units for AI-powered skills	Zero AI Units (no Box AI access at all)
Box AI Extract handles OCR automatically	Companion falls back to "couldn't parse" for non-text formats
Semantic Q&A via `box-ai-recall`	Local index match only (no semantic)
AI Studio agents (Enterprise Advanced)	Not available

Both are valid choices.

Box AI Units

Box AI Units

Per-plan allocation

Per-skill consumption

Opt-in controls

Workspace-level cost bounds (v0.2+ planned)

When you exhaust units

Bounding cost in practice

Trade-off vs the on-prem variant

See also

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally