[NLPL Task Force (A)] PyTorch on CPU
Martin Matthiesen
martin.matthiesen at csc.fi
Tue Sep 25 07:47:25 UTC 2018
Hi Yves,
Thanks very much for the write-up, this was useful to me.
Martin
--
Martin Matthiesen
CSC - Tieteen tietotekniikan keskus
CSC - IT Center for Science
PL 405, 02101 Espoo, Finland
+358 9 457 2376, martin.matthiesen at csc.fi
Public key : https://pgp.mit.edu/pks/lookup?op=get&search=0x74B12876FD890704
Fingerprint: AA25 6F56 5C9A 8B42 009F BA70 74B1 2876 FD89 0704
> From: "Yves Scherrer" <yves.scherrer at helsinki.fi>
> To: "Martin Matthiesen" <martin.matthiesen at csc.fi>, "infrastructure"
> <infrastructure at nlpl.eu>
> Cc: "Markus Koskela" <markus.koskela at csc.fi>
> Sent: Tuesday, 25 September, 2018 10:07:41
> Subject: RE: PyTorch on CPU
> Hi,
> I see basically three use cases for PyTorch (and similar libraries) on CPU:
> * Rapid prototyping and testing (without having to use SLURM scripts). Using a
> combination of direct running on login nodes (I know, you CSC guys don’t like
> that…) and gputest queue should do the trick here.
> * Teaching: it would be nice if students could run PyTorch directly from
> taito-shell, as they are used to for other things. This is low-priority as we
> don’t use PyTorch for teaching at the moment, but may do so in the near future.
> * The most important point, though, is that running PyTorch on CPU allows us to
> benefit from the shorter queues on these nodes and from the “cheaper” billing.
> In machine translation for example, training new models requires GPU, but using
> these models to translate text usually doesn’t benefit that much from GPU. In
> these cases, we have often switched to CPU mode. These are typically very long
> runs for which the current GPU billing scheme is a bit prohibitive. (I have
> even been told by one of the CSC guys once that I wasn’t efficiently using GPU
> and should therefore switch to CPU instead. This was not with PyTorch, but in a
> similar setting with Theano.)
> Best,
> Yves
> From: Martin Matthiesen <martin.matthiesen at csc.fi>
> Sent: Friday, September 21, 2018 6:01:06 PM
> To: infrastructure
> Cc: Scherrer, Yves; Markus Koskela
> Subject: PyTorch on CPU
> Hello,
> I discussed CPU support for PyTorch with Markus today and he told me that
> PyTorch is working on a CPU-only environment. Case in point is the
> taito-gpu.csc.fi login node which does not have GPU hardware. It does not work
> on taito.csc.fi and it is a bit unclear to us, why. There is a missing symbol,
> but ldd shows that all libraries (and the same ones as on taito-gpu) are found.
> I would like to still ask, why cpu-only support is important. I understood it is
> for rapid-prototyping, but would Taito-gpu's gputest queue[1] effectively
> achieve the same? The queue is meant for very short runs only (max 15 mins).
> Have a nice weekend!
> Martin
> [1] [ https://research.csc.fi/taito-gpu-running?inheritRedirect=true |
> https://research.csc.fi/taito-gpu-running?inheritRedirect=true ]
> --
> Martin Matthiesen
> CSC - Tieteen tietotekniikan keskus
> CSC - IT Center for Science
> PL 405, 02101 Espoo, Finland
> +358 9 457 2376, martin.matthiesen at csc.fi
> Public key : [ https://pgp.mit.edu/pks/lookup?op=get&search=0x74B12876FD890704 |
> https://pgp.mit.edu/pks/lookup?op=get&search=0x74B12876FD890704 ]
> Fingerprint: AA25 6F56 5C9A 8B42 009F BA70 74B1 2876 FD89 0704
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.nlpl.eu/archives/infrastructure/attachments/20180925/a89c32b5/attachment.htm>
More information about the infrastructure
mailing list