<html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40"><head><meta http-equiv=Content-Type content="text/html; charset=utf-8"><meta name=Generator content="Microsoft Word 15 (filtered medium)"><style></style></head><body lang=FI link=blue vlink="#954F72"><div class=WordSection1>I think this is a good move. While you could probably fit enough examples per batch by using 2 or 3 times the cards, the half-precision performance of the V100s is a good deal better. Plus Puhti is a system we know we can run this on.<o:p></o:p><o:p> </o:p>-Antti<o:p> </o:p><div style='mso-element:para-border-div;border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm 0cm 0cm'>From: <a href="mailto:oe@ifi.uio.no">Stephan Oepen</a> Sent: perjantai 31. tammikuuta 2020 19.53 To: <a href="mailto:sajvir@utu.fi">Antti Virtanen</a> Cc: <a href="mailto:andreku@ifi.uio.no">Andrei Kutuzov</a>; <a href="mailto:infrastructure@nlpl.eu">infrastructure</a>; <a href="mailto:outreach@nlpl.eu">outreach@nlpl.eu</a> Subject: Re: rolling your own BERT (and maybe ELMo) on Saga</div><o:p> </o:p>belatedly, thanks for the link, antti!<o:p> </o:p>> Here's a (quick and dirty) repo for the code we used to train FinBERT: https://github.com/haamis/DeepLearningExamples_FinBERT/tree/master/TensorFlow/LanguageModeling/BERT_nonscaling. This one has the sbatch files used: https://github.com/haamis/BERT-pretraining<o:p> </o:p>the README claims that V100 gpus are required, whereas Saga only hasP100 cards. so i just gave up on the prospect of getting this to runin norway and rather requested an allocation of billing units on Puhti:-). i think for your presentation next week, you can just assumethat Saga is not a relevant target system for this work.<o:p> </o:p>looking forward to the tutorial! oe<o:p> </o:p></div></body></html>