diff --git a/docs.it4i/cs/guides/grace.md b/docs.it4i/cs/guides/grace.md index 570b8b1a0dcd5b479cdcc8950ff364f30e153358..ac560fb2de766fca37e4ebf6f0ad2d39dc6327cd 100644 --- a/docs.it4i/cs/guides/grace.md +++ b/docs.it4i/cs/guides/grace.md @@ -86,6 +86,21 @@ The [NVPL](https://developer.nvidia.com/nvpl) package includes more extensive se This package should be compatible with all availiable toolchains and includes CMake module files for easy integration into CMake-based projects. For further documentation see also [NVPL](https://docs.nvidia.com/nvpl). +### Recommended BLAS Library + +We recommend to use the multi-threaded BLAS library from the NVPL package. + +!!! note + It is important to pin the processes using **OMP_PROC_BIND=spread** + +Example: + +```console +$ ml NVHPC +$ nvc -O3 -march=native myprog.c -o myprog -lnvpl_blas_lp64_gomp +$ OMP_PROC_BIND=spread ./myprog +``` + ## Basic Communication Libraries The OpenMPI 4 implementation is included with NVHPC toolchain and is exposed as a module (`ml OpenMPI`). The following example