[NLPL Task Force (A)] automated synchronization of NLPL vectors repository

Filip Ginter figint at utu.fi
Wed Apr 11 09:48:33 UTC 2018


Hi guys

Before, when I needed English vectors from CoNLL17, I read the file
vectors/CoNLL17/en.vectors.xz on taito. Stephan was unhappy about this and
asked me to delete the vectors from /proj/nlpl. Now, to achieve the same, I
need to parse a json file helpfully named 11.json, which eventually tells
me the vectors are in the file 11/40.zip, which I then need to unzip and
then I get my vectors. That is not what I would call an improvement over
vectors/CoNLL17/en.vectors.xz . :D  ...not to complain or anything, I will
grab my own copy of these vectors from CoNLL and ignore the /proj/nlpl
version, but maybe you want to be aware that this choice of layered,
numbered files does not suit a script-driven workflow. :)

Cheers

F


On Thu, Mar 8, 2018 at 10:06 AM, Filip Ginter <figint at utu.fi> wrote:

> Hi Stephan
>
> Wiped.
>
> I have my own copy to run my scripts, which pays off now. :D
>
> Cheers
>
> F
>
>
> On Tue, Mar 6, 2018 at 4:07 PM, Stephan Oepen <oe at ifi.uio.no> wrote:
>
>> hi filip,
>>
>> andrey and i have a new release candidate of the NLPL vectors
>> repository ready for public announcement; we presented the emerging
>> structure at the NLPL walk-through during the winter school.  for some
>> general information, please see:
>>
>>   http://wiki.nlpl.eu/index.php/Vectors/home
>>
>> to actually look at the current set of files, you will have to login
>> into Abel.  but i would like to change that and make
>> ‘/proj/nlpl/data/vectors/’ a replica of the corresponding Abel
>> directory, using rsync(1).
>>
>> to accomplish that, could i ask you to remove the contents of the
>> current ‘/proj/nlpl/data/vectors/’ on Taito please?  we have included
>> the CoNLL 2017 embeddings in version 1.1 of the NLPL repository, so
>> putting a copy of the NLPL vectors repository on Taito will still make
>> those models available (albeit using a different naming scheme, of
>> course).
>>
>> with thanks in advance, oe
>>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.nlpl.eu/archives/infrastructure/attachments/20180411/f7fcb780/attachment.htm>


More information about the infrastructure mailing list