关于多卡训练卡死的问题

对代码进行了修改以训练自己数据，发现多卡训练总是不能进行下去，单卡是没问题。调试发现是卡在了下面这一步。
`msg = model.load_state_dict(checkpoint, strict=False)`
![image](https://user-images.githubusercontent.com/73874553/222131421-29783075-76d7-4fa0-bd15-0badda282c63.png)
请问是为什么呢？
训练命令为：
`CUDA_VISIBLE_DEVICES=0,1 python -m torch.distributed.run --nproc_per_node 2 --master_port 12345 main.py`