Conversation
|
Although we don't have the symlinks yet, I can actually already test this in the container - it will just create the symlinks in What I did: Works as intended. After implementing the variant symlinks, we should retest, try to use the |
|
Tested in the container using EESSI 2025.06 and without having configured the variant symlinks: With the variant symlink reconfigured as Wiping that dir and doing it again using Also checked the symlinks, and the pointed to the expected locations. |
| # Do some checks on existence of links and that we don't end up at /dev/null (the default), so we can print some informative information | ||
| # One downside is that we can't explicitely check if something is a variant symlink, so we'll just assume that if it's a link AND it | ||
| # lives in our CVMFS repository, it must be a variant symlink | ||
| nvidia_trusted_dir="${EESSI_EPREFIX}/lib/nvidia" |
There was a problem hiding this comment.
Does this mean that the script will no longer work for 2023.06?
There was a problem hiding this comment.
Hm, yeah, that's annoying, this script is in an unversioned prefix. I mean, if we deploy this only for 2025.06, we keep the old version for 2023.06. But then if we want to update that, we have to revert all changes, etc. Maybe we should just duplicate the script? I.e. create something like scripts/gpu_support/nvidia/2023.06/link_nvidia_host_libraries.sh? Or should it be at higher level 2023.06/scripts/gpu_support...?
Co-authored-by: Bob Dröge <b.e.droge@rug.nl>
Co-authored-by: Bob Dröge <b.e.droge@rug.nl>
We'll need the following variant symlinks to be in place before this script can work as intended:
And then:
This can then be quite easily tested from within the container:
This should error out stating that the variant symlink resolves to
/dev/null. Then, you can change/etc/cvmfs/default.localto set e.g.EESSI_NVIDIA_OVERRIDE_DEFAULT(e.g. to/opt/eessi/nvidia) and run the linking script again - this should the install the symlinks.