Cache modifiers: remove local-only guards, fix tests, update docs#469
Cache modifiers: remove local-only guards, fix tests, update docs#469mawad-amd merged 11 commits intomuhaawad/cache-modifiersfrom
Conversation
Co-authored-by: Ahmet Yildirim <ayildiri@amd.com>
Co-authored-by: jonathou-amd <jonathou@amd.com> Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com>
…#459) Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mawad-amd <112003944+mawad-amd@users.noreply.github.com>
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mawad-amd <112003944+mawad-amd@users.noreply.github.com>
…get_device_context()` (#468) Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mawad-amd <112003944+mawad-amd@users.noreply.github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
…ifier changes Co-authored-by: mawad-amd <112003944+mawad-amd@users.noreply.github.com> Agent-Logs-Url: https://github.com/ROCm/iris/sessions/7da6d66a-7d64-4555-a986-30038945659f
Co-authored-by: mawad-amd <112003944+mawad-amd@users.noreply.github.com> Agent-Logs-Url: https://github.com/ROCm/iris/sessions/7da6d66a-7d64-4555-a986-30038945659f
|
@copilot CI won't trigger on this PR because it targets
You are running on a self-hosted runner with 8 AMD GPUs. Run everything locally. |
Co-authored-by: mawad-amd <112003944+mawad-amd@users.noreply.github.com> Agent-Logs-Url: https://github.com/ROCm/iris/sessions/aa9585a5-d8e8-4538-9dd4-eed0e639bf02
Full test suite run completed across all rank counts on this self-hosted runner (8x AMD GPUs). Commit Results Summary:
Only failures are Bug fixed ( |
Cache modifiers were silently dropped for remote operations (
store,copy,put) via runtimeif from_rank == to_rankguards. Assembly inspection confirmed all modifiers emit correct CDNA ISA flags (.cg→sc0 nt,.cv/.wt→sc0 sc1) and remote ops with any modifier produce correct results.Core change
Removed the rank-equality guards — cache modifiers now pass through unconditionally to
tl.store()/tl.load()in all ops. It is the caller's responsibility to choose appropriate modifiers.Before:
After:
Docstrings
Each function (
load,store,copy,get,put) now explicitly identifies which side is local vs. remote and states that modifier selection is the caller's responsibility.Tests
test_store_cache_modifiers: split into_local/_remotevariants; added remote store coverage for all modifierstest_put_cache_modifiers: same split; remote put tests addedtest_copy_cache_modifiers: expanded remote-write parametrization to all store modifiers (previously restricted to[None, ""])test_get_cache_modifiers/test_load_cache_modifiers: removed redundantNone-branching in kernelstest_get_other_triton/test_put_other_triton: fixed wrong expected values —tl.storewithmask=Falseis a no-op, so masked-out positions stay at0.0, notother_valuetest_load_other_triton: fixed data init (fill_value=source_rankinstead ofpartner) so loaded remote values match expectations💬 Send tasks to Copilot coding agent from Slack and Teams to turn conversations into code. Copilot posts an update in your thread when it's finished.