diff --git a/docs/source/model_summary.rst b/docs/source/model_summary.rst index 79bc495865..4a675c797a 100644 --- a/docs/source/model_summary.rst +++ b/docs/source/model_summary.rst @@ -552,6 +552,31 @@ input becomes “My very .” and the target input becomes “ dog is The library provides a version of this model for conditional generation. +MBart +---------------------------------------------- + +.. raw:: html + + + Models + + + Doc + + +`Multilingual Denoising Pre-training for Neural Machine Translation `_ by Yinhan Liu, Jiatao Gu, Naman Goyal, Xian Li, Sergey Edunov +Marjan Ghazvininejad, Mike Lewis, Luke Zettlemoyer. + +The model architecture and pre-training objective is same as BART, but MBart is trained on 25 languages +and is intended for supervised and unsupervised machine translation. MBart is one of the first methods +for pre-training a complete sequence-to-sequence model by denoising full texts in multiple languages, + +The library provides a version of this model for conditional generation. + +The `mbart-large-en-ro checkpoint `_ can be used for english -> romanian translation. + +The `mbart-large-cc25 `_ checkpoint can be finetuned for other translation and summarization tasks, using code in ```examples/seq2seq/``` , but is not very useful without finetuning. + .. _multimodal-models: Multimodal models