[NLPL Task Force (A)] Issues with TensorFlow on Saga

Vinit Ravishankar vinitr at ifi.uio.no
Sun Oct 20 11:43:49 UTC 2019


Hi all,

I’ve been having some issues getting (other people’s) projects in TensorFlow to run on GPU. There’s two scenarios here:

1. My own anaconda environment with TensorFlow installed manually (this works fine for PyTorch, and, indeed, is my normal workflow): ImportError: /lib64/libm.so.6: version `GLIBC_2.23' not found

2. Using the tensorflow module: ImportError: libcuda.so.1: cannot open shared object file: No such file or directory

(2) is despite CUDA being loaded by the module (as far as I can tell, anyway).

How do I solve this? Additionally, it would also be cool to get multi-GPU support with Horovod (https://github.com/horovod/horovod), something I don’t believe works at the moment.

Thanks!

– Vinit





More information about the infrastructure mailing list