Skip to content

Version 3: Investigate poor performance on Zen. #15

@Mysticial

Description

@Mysticial

The 128-bit AVX multiply+add benchmark fails to achieve the theoretical 4 instructions/cycle on AMD Zen when running with one thread. With two threads on the core, it's possible.

See if it's possible to get 4/cycle with just one thread without the help of SMT.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions