perf(inlet): speed up pull_chunk with bulk buffer slicing by sappelhoff · Pull Request #111 · labstreaminglayer/pylsl

sappelhoff · 2026-06-15T20:14:23Z

(PR and content generated with the help of Claude)

What

Speed up StreamInlet.pull_chunk — the main data-receive hot path — without changing its behavior or output.

Why

The current implementation reads the ctypes data/timestamp buffers one element at a time inside a nested list comprehension. Each data_buff[i] access crosses the ctypes boundary individually, which is slow. A single bulk slice (data_buff[:n]) converts the whole buffer in one C-level pass, and we then split it into per-sample lists in Python.

It also replaces num_elements / num_channels (float) + repeated int(...) truncation with integer floor division.

Impact

Measured on the extraction step alone (the changed code), output verified identical via assert old() == new():

shape	old	new	speedup
8ch x 1024	0.71 ms	0.23 ms	3.1x
32ch x 1024	2.43 ms	0.68 ms	3.6x
64ch x 512	2.27 ms	0.56 ms	4.0x
256ch x 256	4.54 ms	1.13 ms	4.0x
1ch x 4096	0.92 ms	0.62 ms	1.5x

~3-4x for typical multi-channel chunks; smaller for single-channel streams. This is the Python-side extraction only — end-to-end gain depends on how extraction-bound the receive loop is. The cf_string path still pays per-element .decode(), so its relative gain is smaller.

Behavior

Byte-identical output: same list[list] of values, same timestamp list type, unchanged cf_string decoding and free_char_p_array_memory call, unchanged dest_obj path. Verified with a live localhost round-trip across the numeric, string, and dest_obj paths; existing test suite passes.

cboulay · 2026-06-16T03:13:13Z

        if dest_obj is None:
+            # Convert the whole ctypes buffer to a Python list in a single
+            # bulk slice (far faster than indexing the array element by
+            # element), then split it into one list per sample.


No need to include the verbose comment. The reason for the change is in the git history and not needed in the code.

makes sense. I removed it

Convert the ctypes data and timestamp buffers to Python lists with a single bulk slice instead of indexing element-by-element inside a nested comprehension. This is ~3-4x faster at extracting multi-channel chunks (measured on the extraction step alone) and produces byte-identical output. Also use integer floor division for the sample count instead of float division plus repeated int() truncation.

The rationale lives in the commit history; the code itself does not need it. Addresses review feedback on labstreaminglayer#111.

Cover the two paths the bulk-slice extraction must preserve: a multi-channel numeric chunk and a variable-length string chunk. Pushes a known chunk and pulls it back, asserting identical values, list[list] shape, and timestamp list type. The string case (empty and multi-byte values) exercises the cf_string decode path that previously lacked coverage.

.claude/ and CLAUDE.md are local tooling artifacts that should not be tracked.

cboulay reviewed Jun 16, 2026

View reviewed changes

sappelhoff added 4 commits June 16, 2026 10:33

perf(inlet): drop the explanatory comment on the bulk slice

4528b80

The rationale lives in the commit history; the code itself does not need it. Addresses review feedback on labstreaminglayer#111.

chore: ignore local Claude Code config

7948d36

.claude/ and CLAUDE.md are local tooling artifacts that should not be tracked.

sappelhoff force-pushed the perf/pull-chunk-bulk-slice branch from 665a42f to 7948d36 Compare June 16, 2026 08:34

sappelhoff requested a review from cboulay June 16, 2026 08:37

cboulay approved these changes Jun 17, 2026

View reviewed changes

cboulay merged commit e146d7d into labstreaminglayer:main Jun 17, 2026
19 checks passed

sappelhoff deleted the perf/pull-chunk-bulk-slice branch June 17, 2026 07:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf(inlet): speed up pull_chunk with bulk buffer slicing#111

perf(inlet): speed up pull_chunk with bulk buffer slicing#111
cboulay merged 4 commits into
labstreaminglayer:mainfrom
sappelhoff:perf/pull-chunk-bulk-slice

sappelhoff commented Jun 15, 2026

Uh oh!

cboulay Jun 16, 2026

Uh oh!

sappelhoff Jun 16, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

sappelhoff commented Jun 15, 2026

What

Why

Impact

Behavior

Uh oh!

cboulay Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

sappelhoff Jun 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sappelhoff Jun 16, 2026 •

edited

Loading