采用指令 torchrun --standalone --nnodes=4 --nproc-per-node=8 train_pretrain_stage0.py 报错:torch.distributed.elastic.rendezvous.api.RendezvousTimeoutError
采用指令 torchrun --standalone --nnodes=4 --nproc-per-node=8 train_pretrain_stage0.py
报错:torch.distributed.elastic.rendezvous.api.RendezvousTimeoutError