[NLPL Task Force (A)] [rt.uio.no #3406027] gpu usage on Abel
Stephan Oepen via RT
hpc-drift at usit.uio.no
Wed May 15 22:46:07 UTC 2019
dear colleagues,
> User notified.
i am reopening this ticket as it may look as over-allocation of gpus
by this user continues:
[oe at login-0-0 ~]$ squeue --partition=accel | grep michaelm
26981933 accel pe_con michaelm PD 0:00 1 (Priority)
26981931 accel pe_con michaelm R 1-10:45:37 1 c19-15
26981930 accel pe_con michaelm R 1-11:25:20 1 c19-16
26981928 accel st_con michaelm R 1-11:53:02 1 c19-5
26981929 accel pe_con michaelm R 1-11:53:02 1 c19-11
26981926 accel st_con michaelm R 1-11:53:49 1 c19-3
26981927 accel st_con michaelm R 1-11:53:49 1 c19-8
26981924 accel st_con michaelm R 1-11:54:36 1 c19-14
[oe at login-0-0 ~]$ for i in 3 5 8 11 14 15 16; do ssh c19-${i}
nvidia-smi | grep Default; done
| N/A 31C P0 86W / 235W | 673MiB / 5699MiB | 80% Default |
| N/A 19C P8 18W / 235W | 11MiB / 5699MiB | 0% Default |
| N/A 30C P0 85W / 235W | 673MiB / 5699MiB | 82% Default |
| N/A 18C P8 18W / 235W | 11MiB / 5699MiB | 0% Default |
| N/A 31C P0 88W / 235W | 673MiB / 5699MiB | 83% Default |
| N/A 18C P8 18W / 235W | 11MiB / 5699MiB | 0% Default |
| N/A 46C P0 96W / 235W | 1128MiB / 5699MiB | 82% Default |
| N/A 26C P8 17W / 235W | 11MiB / 5699MiB | 0% Default |
| N/A 34C P0 90W / 235W | 673MiB / 5699MiB | 82% Default |
| N/A 20C P8 18W / 235W | 11MiB / 5699MiB | 0% Default |
| N/A 35C P0 96W / 235W | 1128MiB / 5699MiB | 86% Default |
| N/A 20C P8 17W / 235W | 11MiB / 5699MiB | 0% Default |
| N/A 33C P0 90W / 235W | 1128MiB / 5699MiB | 84% Default |
| N/A 20C P8 18W / 235W | 11MiB / 5699MiB | 0% Default |
i do not want to police other abel users. but unless i mis-read the
above, this usage pattern unnecessarily blocks close to half of one of
the currently most precious resources available.
best wishes (from copenhagen), oe
More information about the infrastructure
mailing list