[NLPL Task Force (A)] [rt.uio.no #3396801] newer CUDA versions on Abel

Stephan Oepen via RT hpc-drift at usit.uio.no
Mon May 13 21:18:33 UTC 2019


many thanks, ole!  i tried installing TensorFlow against the new CUDA
module, but unfortunately TensorFlow and i appear a little difficult
to please:

ImportError: libcublas.so.10.0: cannot open shared object file: No
such file or directory

they do in fact specify CUDA 10.0 as the requirement for the latest
TensorFlow release (1.13), so i guess they are just being cautious and
stubborn.

could you bring yourself to also putting the CUDA 10.0 libraries into
yet another module?

with thanks in advance, oe

On Mon, May 13, 2019 at 9:26 AM Ole Saastad via RT
<hpc-drift at usit.uio.no> wrote:
>
> On Sun, 2019-05-12 at 13:17 +0200, Stephan Oepen via RT wrote:
> > >
> > so, is it actually not so easy to just provision the CUDA 10
> > libraries
> > as another module, without touching the drivers?
>
> Done (10.1, not tested), this is just installing some software in a
> directory. Updating the driver is a bit more intrusive. It require
> downtime and reboot of the gpu compute nodes.
>
> > or are we running up
> > against a matter of principle here, no more software updates on Abel?
> >
> Yes, you are right, with only months left og it's lifetime we spend
> most of our effort on installing and setting up the new systems.
> Plans was to shut down Abel more than a year ago.
>
> Regards,
> Ole
>
>
> > cheers, oe
> >
> >
> --
> Ole W. Saastad, Dr.Scient.
> UiO/USIT/UVA/ITF/FI
> Besøk: Kristen Nygaards hus - Rom 2315
> Post: Gaustadalléen 23A, 0349 Oslo
> USIT, Postboks 1059 Blindern, 0316 Oslo
> Tel: +47-22840752
>
>
>





More information about the infrastructure mailing list