Skip to content

Added Kokkos GPU implementation of ISAI approximate inverse and lAIR …#223

Open
stevendargaville wants to merge 2 commits intomainfrom
sai_lair_gpu
Open

Added Kokkos GPU implementation of ISAI approximate inverse and lAIR …#223
stevendargaville wants to merge 2 commits intomainfrom
sai_lair_gpu

Conversation

@stevendargaville
Copy link
Collaborator

…z type. SAI types are not yet implemented. Also not yet well optimised - lAIR is difficult on a GPU as the size of the dense inverses required changes at every row so they can't be batched. The dense matrices also require lots of temporary memory, which limits the threads/teams/warps that can be launched at any one time

…z type. SAI types are not yet implemented. Also not yet well optimised - lAIR is difficult on a GPU as the size of the dense inverses required changes at every row so they can't be batched. The dense matrices also require lots of temporary memory, which limits the threads/teams/warps that can be launched at any one time
…os debug comparisons, as the approx solve is converged to a bigger tolerance
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant