KAFKA-20579: Initial producer incremental allocation for uncompressed data by lianetm · Pull Request #22654 · apache/kafka

lianetm · 2026-06-23T18:34:14Z

Initial partial implementation for KIP-1332 (producer incremental
allocation strategy).

This PR includes:

new producer config for allocation strategy (incremental/full)
initial implementation of the incremental strategy: supports
uncompressed data only (no growth support), extra-copy on send (linked
chunks are flattened into a new buffer)
unit and integration tests (running existing Producer integration
tests with the new strategy + new ones)

Support for compressed data and network layer improvements will come in
follow-up PRs.

Reviewers: Jun Rao junrao@gmail.com

junrao

@lianetm : Thanks for the PR. Made a pass of the non-testing files. Left a few comments.

junrao · 2026-06-29T20:48:51Z

                                        Importance.MEDIUM,
                                        CommonClientConfigs.CLIENT_DNS_LOOKUP_DOC)
                                .define(BUFFER_MEMORY_CONFIG, Type.LONG, 32 * 1024 * 1024L, atLeast(0L), Importance.HIGH, BUFFER_MEMORY_DOC)
+                                .define(BUFFER_MEMORY_ALLOCATION_STRATEGY_CONFIG,


Should we mark this as internal to prevent it from leaking into 4.4 before the feature is fully implemented?

junrao · 2026-06-30T00:10:36Z

+                    // pool memory by completing batches). On exhaustion, close the batch (making it
+                    // drainable) and fall through to the blocking first-record path next iteration.
+                    try {
+                        extensionChunks = chunkedFree.allocateChunks(extensionBytes, 0L);


Why do we need to do a special non-blocking call? In the existing logic, if an allocation request is blocked, all existing ProducerBatches become immediately drainable.

junrao · 2026-06-30T00:12:57Z

+                                last.closeForRecordAppends();
+                            }
+                        }
+                        continue;


It may take a bit of time for the closed batches to be drained. If we continue here, it seems that the client will just busy-loop until some batches are drained and some free space becomes available in buffer pool?

junrao · 2026-06-30T20:49:43Z

+     * them, and retries. Otherwise defers to the parent.
+     */
+    @Override
+    protected RecordAppendResult tryAppend(long timestamp, byte[] key, byte[] value, Header[] headers,


It's a bit awkward to have a return value of null and RecordAppendResult.needsExtension. Could we introduce a non-null value to indicate the batch is full?

junrao · 2026-06-30T20:57:38Z

+                        // ProducerBatch), which can't take extension chunks. Only attach to a
+                        // writable chunked batch; otherwise refund the chunks and re-evaluate.
+                        if (last instanceof ChunkedProducerBatch && last.isWritable()) {
+                            ((ChunkedProducerBatch) last).addBuffers(extensionChunks);


I guess two concurrent clients could add buffers exceeding the batch size? Those buffers won't be used, but can only be freed after the batch is drained.

junrao · 2026-06-30T23:01:33Z

+                long remainingBytes = memoryRequired - (long) pooled.size() * chunkSize;
+                if (remainingBytes > 0) {
+                    // remainingBytes <= memoryRequired <= totalMemory (validated above), so the int cast is safe.
+                    freeUp((int) remainingBytes);


This is a no-op since all pooled chunks have been used if we reach here.

junrao · 2026-07-01T00:48:52Z

+                    pooled.add(free.pollFirst());
+                long remainingBytes = memoryRequired - (long) pooled.size() * chunkSize;
+                if (remainingBytes > 0) {
+                    // remainingBytes <= memoryRequired <= totalMemory (validated above), so the int cast is safe.


Why is remainingBytes guaranteed to be an int? memoryRequired could be larger than int and pooled.size() initially could be 0.

junrao · 2026-07-01T16:11:40Z

+                        }
+                        long stillNeeded = memoryRequired - (long) pooled.size() * chunkSize - accumulated;
+                        if (stillNeeded > 0) {
+                            freeUp((int) stillNeeded);


This may be ok, but it's a bit weird to free up the chunks only to be reallocated again. Here is an alternative that doesn't require a freeup() call.

// Reuse pooled chunks first. If a reused chunk covers a slot we already reserved as // raw bytes in an earlier iteration, hand that raw reservation back to the pool. while (pooled.size() < numChunks && !free.isEmpty()) { pooled.add(free.pollFirst()); if (accumulated >= chunkSize) { // accumulated is always chunk-aligned here accumulated -= chunkSize; this.nonPooledAvailableMemory += chunkSize; // refund → available to other waiters } } // Reserve raw memory for any still-uncovered chunks, in whole chunks. while (pooled.size() + (int)(accumulated / chunkSize) < numChunks && this.nonPooledAvailableMemory >= chunkSize) { this.nonPooledAvailableMemory -= chunkSize; accumulated += chunkSize; }

junrao · 2026-07-01T16:33:06Z

+
+                        // Take pool chunks first, then reserve non-pool bytes for the remainder.
+                        while (pooled.size() < numChunks
+                                && (long) (pooled.size() + 1) * chunkSize + accumulated <= memoryRequired


The first condition is redundant, given the second one.

(pooled+1)*chunkSize + accumulated ≤ memoryRequired ⇒ (pooled+1)*chunkSize ≤ memoryRequired − accumulated ≤ memoryRequired = numChunks*chunkSize ⇒ pooled+1 ≤ numChunks ⇒ pooled < numChunks

junrao · 2026-07-01T17:19:09Z

+                    // On failure (timeout / close / interrupt), refund the non-pool bytes taken.
+                    // Pool chunks already in `pooled` are returned to `free` separately by the
+                    // outer catch.
+                    this.nonPooledAvailableMemory += accumulated;


Could we return accumulated and pooled chunks in the same place? For example, we can set a flag like allocationCompleted to replace accumulated = 0. Then we can free both accumulated and pooled chunks if the flag is not set.

lianetm added 9 commits June 22, 2026 09:52

support dyn for uncompressed

4ec8d04

tests

203a410

config & ratio-aware chunk sizing

64147f6

integration tests

8c82cd2

refactor validation & upd docs

b71d5b7

update config

bdc14e9

fix for partition change & misc

4e91499

test updates & docs

9ccd47d

fixes

860bde4

lianetm requested a review from junrao June 23, 2026 18:34

github-actions Bot added core Kafka Broker producer build Gradle build or GitHub Actions clients labels Jun 23, 2026

lianetm added 3 commits June 23, 2026 16:27

checkstyle

179e675

Merge branch 'trunk' into lm-producer-dyn-1

2bafe00

fix test

9b45abc

github-actions Bot added the tools label Jun 23, 2026

AndrewJSchofield added the ci-approved label Jun 24, 2026

junrao reviewed Jul 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

KAFKA-20579: Initial producer incremental allocation for uncompressed data#22654

KAFKA-20579: Initial producer incremental allocation for uncompressed data#22654
lianetm wants to merge 12 commits into
apache:trunkfrom
lianetm:lm-producer-dyn-1

lianetm commented Jun 23, 2026 •

edited by github-actions Bot

Loading

Uh oh!

junrao left a comment

Uh oh!

junrao Jun 29, 2026

Uh oh!

junrao Jun 30, 2026

Uh oh!

junrao Jun 30, 2026

Uh oh!

junrao Jun 30, 2026

Uh oh!

junrao Jun 30, 2026

Uh oh!

junrao Jun 30, 2026

Uh oh!

junrao Jul 1, 2026

Uh oh!

junrao Jul 1, 2026

Uh oh!

junrao Jul 1, 2026

Uh oh!

junrao Jul 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

lianetm commented Jun 23, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

junrao left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lianetm commented Jun 23, 2026 •

edited by github-actions Bot

Loading