Skip to content

Summarize unclaimed rows in table_splitter.ambiguous suggestion#90

Merged
jcushman merged 1 commit into
mainfrom
worktree-fix-table-splitter-ambiguous
Jun 3, 2026
Merged

Summarize unclaimed rows in table_splitter.ambiguous suggestion#90
jcushman merged 1 commit into
mainfrom
worktree-fix-table-splitter-ambiguous

Conversation

@jcushman
Copy link
Copy Markdown
Contributor

@jcushman jcushman commented Jun 3, 2026

The binoc.table_splitter.ambiguous suggestion joined every unclaimed row index into a single sentence, so on large or irregular tables one suggestion ballooned to a multi-hundred-thousand-character line.

This change replaces the full enumeration with a count plus a small capped sample (up to 5 indices, then … and N more), e.g.:

The CSV has stacked table-like regions, but 10000 rows (e.g. 1, 2, 3, 4, 5, … and 9995 more) fall outside any clear rectangle; leaving it as one table.

Small lists (≤5) are still enumerated in full. Added a unit test asserting the summary stays short regardless of row count.

Closes #88

(Auto generated pull request)

Instead of enumerating every unclaimed row index into one sentence
(which produced multi-hundred-thousand-character lines on large or
irregular tables), report the count plus a small capped sample.

Closes #88
@jcushman jcushman merged commit 1ee32fa into main Jun 3, 2026
10 checks passed
@jcushman jcushman deleted the worktree-fix-table-splitter-ambiguous branch June 3, 2026 19:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

table_splitter.ambiguous suggestion enumerates every unclaimed row, bloating changelogs

1 participant