[NLPL Task Force (A)] OPUS copy on Abel

Stephan Oepen oe at ifi.uio.no
Mon Feb 11 19:08:20 UTC 2019


hi joerg,

our NLPL partition on Abel has hit the disk quota limit (two
terabytes), which means we cannot install software updates.  i am
afraid i would like to propose that we further restrict the OPUS
mirror on Abel, as it accounts for by far the biggest ‘chunk’ of NLPL
data (715 gigabytes currently on Abel).  would it make sense to just
keep the XML variants of the data (i am guessing the fairly bulky
‘moses’ and ‘raw’ variants are derived)?

more generally, i was planning to suggest that we move to automated
mirroring of the most important parts of OPUS from Taito to Abel, as
we do for most of the other data sub-directories now.  could you (a)
suggest an rsync(1) command to selective copy from Taito to Abel and
(b) temporarilty ‘\rm -rf /projects/nlpl/data/OPUS’ on Abel?

i could then include the rsync(1) in my nightly cron(5) job on Taito,
such that for the selected parts at least the two copies would remain
synchronized (because the cron(5) jobs runs in my account, i will have
to be the owner of the rsync(1) target directory on Abel).

best wishes, oe




More information about the infrastructure mailing list