A universal cross-language representation leads tobetter multilingual translation performance . For English-centric directions, mCOLT achievescompetitive or even better performance than a strong pre-trained model mBART . For non-English directions, the model achieves animprovement of average 10+ BLEU compared with the multilingual baseline. For other languages, mColT achieves an average of 10-BLEU.

Author(s) : Xiao Pan, Mingxuan Wang, Liwei Wu, Lei Li

