Looks like the LD library path is bad#1031
Conversation
PR Reviewer Guide 🔍Here are some key observations to aid the review process:
|
| h-all HPC_OMPI_BIN="/apps/mpi/cuda/12.8.1/nvhpc/25.3/openmpi/5.0.7/bin" | ||
| h-all OMPI_MCA_pml=ob1 OMPI_MCA_coll_hcoll_enable=0 | ||
| h-gpu PATH="/apps/mpi/cuda/12.8.1/nvhpc/25.3/openmpi/5.0.7/bin:${PATH}" | ||
| h-all LD_LIBRARY_PATH=/apps/compilers/cuda/12.8.1/lib64:$LD_LIBRARY_PATH |
There was a problem hiding this comment.
Suggestion: To prevent potential runtime errors on non-GPU systems, scope the CUDA LD_LIBRARY_PATH setting to GPU hosts only by changing h-all to h-gpu. [possible issue, importance: 8]
| h-all LD_LIBRARY_PATH=/apps/compilers/cuda/12.8.1/lib64:$LD_LIBRARY_PATH | |
| h-gpu LD_LIBRARY_PATH=/apps/compilers/cuda/12.8.1/lib64:$LD_LIBRARY_PATH |
PR Compliance Guide 🔍Below is a summary of compliance checks for this PR:
Compliance status legend🟢 - Fully Compliant🟡 - Partial Compliant 🔴 - Not Compliant ⚪ - Requires Further Human Verification 🏷️ - Compliance label |
||||||||||||||||||||||||
User description
Description
Fixes an issue when calling MPI with multiple ranks on hipervator
PR Type
Bug fix
Description
Fixes LD_LIBRARY_PATH configuration for MPI multi-rank execution
Adds CUDA library path to environment on HiperGator cluster
Ensures proper library resolution for OpenMPI with NVHPC
Diagram Walkthrough
File Walkthrough
modules
Add CUDA library path to environment configurationtoolchain/modules
LD_LIBRARY_PATHenvironment variable configuration forHiperGator cluster
library resolution