Skip to content

Latest commit

 

History

History
57 lines (50 loc) · 1.82 KB

IB_test.md

File metadata and controls

57 lines (50 loc) · 1.82 KB

Intel MPI

OpenMPI

  • ompi_info --display-map to display information of the current openmpi

IB command

  • ibstat

Nvidia Peer Memory

Status of IB and RDMA

NCCL test

sudo lspci | grep PLX
sudo lspci -vvv | grep ACSCtl

sudo setpci -s 03:00.0 f2a.w=0000
sudo setpci -s 04:08.0 f2a.w=0000
  • https://github.com/NVIDIA/nccl-tests
    • Setup NCC_HOME, MPI_HOME and build using make
    • ./build/all_reduce_perf -b 8 -e 128M -f 2 -g 4
    • mpirun -n 1 ./build/all_reduce_perf -b 8 -e 128M -f 2 -g 4
      • This uses 4 gpus per node

find ip address

  • ifconfig is deprecated. Use ip a
  • IB's ip is different than ethernet ip