You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Setting UCX_NET_DEVICES to target only TCP devices when RoCE is available seems to be ignored in favour of some fallback.
I'm running a 2 node IMB_MPI PingPong to benchmark RoCE against regular TCP ethernet.
Setting UCX_NET_DEVICES=all or mlx5_0:1 gives the optimal performance and uses RDMA as expected.
Setting UCX_NET_DEVICES=eth0, eth1, or anything else still appears to use RoCE at only a slightly longer latency
HW information from ibstat or ibv_devinfo -vv command :
How ompi is configured from ompi_info | grep Configure :
Configured architecture: x86_64-pc-linux-gnu
Configured by: abuild
Configured on: Thu Aug 3 14:25:15 UTC 2023
Configure command line: '--prefix=/opt/ohpc/pub/mpi/openmpi4-gnu12/4.1.5'
'--disable-static' '--enable-builtin-atomics'
'--with-sge' '--enable-mpi-cxx'
'--with-hwloc=/opt/ohpc/pub/libs/hwloc'
'--with-libfabric=/opt/ohpc/pub/mpi/libfabric/1.18.0'
'--with-ucx=/opt/ohpc/pub/mpi/ucx-ohpc/1.14.0'
'--without-verbs' '--with-tm=/opt/pbs/'
Following the advice from Here, it is apparently due to a higher priority of OpenMPI's btl/openib component but I don't think it can be if --without-verbs and openib is not available when searching ompi_info | grep btl.
As suggested in the UCX issue, adding -mca pml_ucx_tls any -mca pml_ucx_devices any to my mpirun has fixed this problem, but I was wondering what in the MCA precisely causes this behaviour.
Details of the problem
Setting UCX_NET_DEVICES to target only TCP devices when RoCE is available seems to be ignored in favour of some fallback.
I'm running a 2 node IMB_MPI PingPong to benchmark RoCE against regular TCP ethernet.
Setting
UCX_NET_DEVICES=all
ormlx5_0:1
gives the optimal performance and uses RDMA as expected.Setting
UCX_NET_DEVICES=eth0
,eth1
, or anything else still appears to use RoCE at only a slightly longer latencyHW information from
ibstat
oribv_devinfo -vv
command :How ompi is configured from
ompi_info | grep Configure
:Following the advice from Here, it is apparently due to a higher priority of OpenMPI's btl/openib component but I don't think it can be if
--without-verbs
and openib is not available when searchingompi_info | grep btl
.As suggested in the UCX issue, adding
-mca pml_ucx_tls any -mca pml_ucx_devices any
to my mpirun has fixed this problem, but I was wondering what in the MCA precisely causes this behaviour.Here's my batch script:
The text was updated successfully, but these errors were encountered: