Skip to content

Implement aggregate rendezvous lowering for uniform multicast PipeNets #620

@brnorris03

Description

@brnorris03

Parent: #619

Problem

Uniform multicast PipeNets currently contribute to the hardware semaphore pressure tracked in #619. The current lowering can allocate rendezvous resources per receiver or per copy site, even when all receivers publish an equivalent destination DFB address/offset.

Proposed lowering

For multicast cases where all receivers publish an equivalent destination DFB address/offset, lower the protocol like tt-metal multicast patterns:

  • receivers reserve the destination DFB slot;
  • receivers increment one sender-ready counter;
  • sender waits for num_dests;
  • sender multicasts data to the uniform destination address;
  • sender multicasts one receiver-valid/completion signal.

This should make semaphore usage proportional to multicast streams/epochs rather than receiver count or copy-site count.

References

Non-goals

This does not solve non-uniform destination DFB addresses. Those still need mailbox/table-based address publication.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions