From c88f05163d5a90f3d18ef567e1e71d25541f48d3 Mon Sep 17 00:00:00 2001 From: thomwolf Date: Mon, 16 Sep 2019 13:42:20 +0200 Subject: [PATCH] fix typo in XLM models --- docs/source/pretrained_models.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/docs/source/pretrained_models.rst b/docs/source/pretrained_models.rst index 4222ee32cf..d6e273797f 100644 --- a/docs/source/pretrained_models.rst +++ b/docs/source/pretrained_models.rst @@ -79,10 +79,10 @@ Here is the full list of the currently provided pretrained models together with | | | | XLM English model | | +------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+ | | ``xlm-mlm-ende-1024`` | | 6-layer, 1024-hidden, 8-heads | -| | | | XLM English-German Multi-language model | +| | | | XLM English-German model trained on the concatenation of English and German wikipedia | | +------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+ | | ``xlm-mlm-enfr-1024`` | | 6-layer, 1024-hidden, 8-heads | -| | | | XLM English-French Multi-language model | +| | | | XLM English-French model trained on the concatenation of English and French wikipedia | | +------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+ | | ``xlm-mlm-enro-1024`` | | 6-layer, 1024-hidden, 8-heads | | | | | XLM English-Romanian Multi-language model | @@ -93,11 +93,11 @@ Here is the full list of the currently provided pretrained models together with | | ``xlm-mlm-tlm-xnli15-1024`` | | 12-layer, 1024-hidden, 8-heads | | | | | XLM Model pre-trained with MLM + TLM on the `15 XNLI languages `__. | | +------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+ -| | ``xlm-clm-enfr-1024`` | | 12-layer, 1024-hidden, 8-heads | -| | | | XLM English model trained with CLM (Causal Language Modeling) | +| | ``xlm-clm-enfr-1024`` | | 6-layer, 1024-hidden, 8-heads | +| | | | XLM English-French model trained with CLM (Causal Language Modeling) on the concatenation of English and French wikipedia | | +------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+ | | ``xlm-clm-ende-1024`` | | 6-layer, 1024-hidden, 8-heads | -| | | | XLM English-German Multi-language model trained with CLM (Causal Language Modeling) | +| | | | XLM English-German model trained with CLM (Causal Language Modeling) on the concatenation of English and German wikipedia | +-------------------+------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+ | RoBERTa | ``roberta-base`` | | 12-layer, 768-hidden, 12-heads, 125M parameters | | | | | RoBERTa using the BERT-base architecture |