Skip to content

how can I get ip of master node #48

@WilliamZhaoz

Description

@WilliamZhaoz

when I get it via env variable AZ_BATCHAI_MPI_MASTER_NODE, I get follow error

File "src/scripts/submit_job/distributed.py", line 273, in set_environment_variables_for_nccl_backend
os.environ['MASTER_ADDR'] = os.environ['AZ_BATCHAI_MPI_MASTER_NODE']
File "/opt/conda/lib/python3.6/os.py", line 669, in getitem
raise KeyError(key) from None
KeyError: 'AZ_BATCHAI_MPI_MASTER_NODE'

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions