diff --git a/docs.it4i/lumi/pytorch.md b/docs.it4i/lumi/pytorch.md index 718d80ac24d1321fb463f33a6b41488d004b8b35..9f3711b0354243e3b011954a1eca6efb98e19ad4 100644 --- a/docs.it4i/lumi/pytorch.md +++ b/docs.it4i/lumi/pytorch.md @@ -9,6 +9,12 @@ * Official LUMI page: [https://lumi-supercomputer.github.io/LUMI-EasyBuild-docs/p/PyTorch/][3] * **Warning:** be careful where the SIF image is installed or copied ($HOME is not recommended for quota reasons). For EasyBuild you must specify the installation path: `export EBU_USER_PREFIX=/project/project_XXXX/EasyBuild`. +## CSC installed software collection + +- [https://docs.csc.fi/support/tutorials/ml-multi/][8] +- [https://docs.lumi-supercomputer.eu/software/local/csc/][9] +- [https://docs.csc.fi/apps/pytorch/][10] + ## PyTorch Install ### Base Environment @@ -182,8 +188,11 @@ export NCCL_DEBUG_SUBSYS=INIT ## References * Samuel Antao (AMD), LUMI Courses -* [https://lumi-supercomputer.github.io/LUMI-training-materials/4day-20230530/extra_4_10_Best_Practices_GPU_Optimization/][5] -* [https://lumi-supercomputer.github.io/LUMI-training-materials/4day-20231003/extra_4_10_Best_Practices_GPU_Optimization/][6] + * [https://lumi-supercomputer.github.io/LUMI-training-materials/4day-20230530/extra_4_10_Best_Practices_GPU_Optimization/][5] + * [https://lumi-supercomputer.github.io/LUMI-training-materials/4day-20231003/extra_4_10_Best_Practices_GPU_Optimization/][6] + +* Multi-GPU and multi-node machine learning by CSC + * [https://docs.csc.fi/support/tutorials/ml-multi/][11] [1]: https://pytorch.org/ [2]: https://github.com/pytorch/pytorch @@ -192,3 +201,7 @@ export NCCL_DEBUG_SUBSYS=INIT [5]: https://lumi-supercomputer.github.io/LUMI-training-materials/4day-20230530/extra_4_10_Best_Practices_GPU_Optimization/ [6]: https://lumi-supercomputer.github.io/LUMI-training-materials/4day-20231003/extra_4_10_Best_Practices_GPU_Optimization/ [7]: https://github.com/ROCmSoftwarePlatform/aws-ofi-rccl +[8]: https://docs.csc.fi/support/tutorials/ml-multi +[9]: https://docs.lumi-supercomputer.eu/software/local/csc/ +[10]: https://docs.csc.fi/apps/pytorch/ +[11]: https://docs.csc.fi/support/tutorials/ml-multi/