[NLPL Task Force (A)] [uninett.no #196965] Tensorflow issues, pt. 2

Vinit Ravishankar vinitr at ifi.uio.no
Fri Oct 25 07:39:40 UTC 2019


This is when I test it on the accel nodes unfortunately, though the error vanishes in 1.14.0.

On 25 Oct 2019 09:13, "Henrik R. Nagel via RT" <support at metacenter.no> wrote:
Hi,

> Unfortunately, when I downgrade (again, using conda), I can no longer
> import tensorflow, because it can’t find libcuda.so.1.

The libcuda.so.1 is located in /usr/lib64, but only on the accel compute nodes and not on the login nodes. If you tested the downgraded version of TensorFlow on a login node, then that is the reason why it didn't work. You must run the downgraded version of TensorFlow as a batch job on the "accel" partition or request an interactive compute node there.

Best regards,

Henrik
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.nlpl.eu/archives/infrastructure/attachments/20191025/687c44c6/attachment.htm>


More information about the infrastructure mailing list