[NLPL Task Force (A)] big array job

Asad Sayeed asad.sayeed at gu.se
Sat Feb 16 12:41:31 UTC 2019


Hi,

The abel documentation says users are allowed to run up to 400 jobs 
simultaneously.  If I run arrayrun 4x on different segments of the 
corpus, will I get myself into trouble with the authorities or 
something?  400 at a time is a significant time saving for me, obviously 
(2.5 days for the whole thing).

Thanks.

Yours,
--Asad.


On 2019-02-16 01:29 PM, Asad Sayeed wrote:
> Hi Stephan,
>
> I am now trying to scale up my SRL task "for real" over 70M sentences, 
> divided up into 3500 segments/tasks, each taking about 12G memory each 
> and taking about 7 hours.  I am trying to use arrayrun on abel on my 
> script.  However, it seems like arrayrun will only activate 100 jobs 
> at a time.  This will take 10 days to run the entire job, which is 
> slower than the much smaller cluster I was running it on elsewhere 
> (where I can run about 300 at a time and take 14 hours, for about 7 
> days).  I was hoping to gain a signficant turnaround time for 
> experimentation on abel.   Is there any way to get more on abel or is 
> that a hard limit?
>
> Thanks.
>
> Yours,
> --Asad.
>




More information about the infrastructure mailing list