Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT, Chronopoulou et al., 2020

Paper, Code, Tags: #nlp, #machine-translation

The reused monolingual LM is fine-tuned on both languages and then used to initialize a UNMT model. We propose a new vocabulary extension method, RE-LM, that outperforms a competitive cross-lingual pretraining model and also improves translations on a low-resource supervised setup.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2011.214.md

2011.214.md

Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT, Chronopoulou et al., 2020

Paper, Code, Tags: #nlp, #machine-translation

Files

2011.214.md

Latest commit

History

2011.214.md

File metadata and controls

Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT, Chronopoulou et al., 2020

Paper, Code, Tags: #nlp, #machine-translation