Hi Folks (especially AMD GPUs' trainer),
While installing/building mamba-ssm from repository, if you have faced issues with cannot determine amdgcn architecture and the solution suggested being consider passing it via '--offload-arch'. Then here is the solution.
[NOTE]: This have been tried with the rocm/pytorch-training docker image.
First of all, try to checkout to a particular commit of mamba repo to avoid possible errors like NameError: name 'bare_metal_version' is not defined and then export a variable that overrides the possibility of setting --offload_arch=native as done in setup.py of mamba repo.
Steps:
git clone https://github.com/state-spaces/mamba.git mamba_ssm
cd mamba_ssm
git checkout 014c094
export HIP_ARCHITECTURES="gfx942" # For MI300 only. Replace it with your architecture(s)
pip install --no-cache-dir --verbose .
I tried searching for the HIP_ARCHITECTURES variable by loading docker image and os.getenv(), but couldn't find any.
Hi Folks (especially AMD GPUs' trainer),
While installing/building
mamba-ssmfrom repository, if you have faced issues withcannot determine amdgcn architectureand the solution suggested beingconsider passing it via '--offload-arch'. Then here is the solution.First of all, try to checkout to a particular commit of mamba repo to avoid possible errors like
NameError: name 'bare_metal_version' is not definedand then export a variable that overrides the possibility of setting--offload_arch=nativeas done insetup.pyofmambarepo.Steps: