From 63e539459d5d241eab8c523be027a9bdfc7df3b4 Mon Sep 17 00:00:00 2001 From: Patrick von Platen Date: Thu, 10 Sep 2020 16:34:28 +0200 Subject: [PATCH] Update README.md --- model_cards/google/roberta2roberta_L-24_gigaword/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/model_cards/google/roberta2roberta_L-24_gigaword/README.md b/model_cards/google/roberta2roberta_L-24_gigaword/README.md index 84cdce380f..948ee80bcf 100644 --- a/model_cards/google/roberta2roberta_L-24_gigaword/README.md +++ b/model_cards/google/roberta2roberta_L-24_gigaword/README.md @@ -8,7 +8,7 @@ datasets: # Roberta2Roberta_L-24_gigaword EncoderDecoder model The model was introduced in -[this paper](https://arxiv.org/abs/1907.12461) by Sascha Rothe, Shashi Narayan, Aliaksei Severyn and first released in [this repository](https://tfhub.dev/google/bertseq2seq/roberta24_bbc/1). +[this paper](https://arxiv.org/abs/1907.12461) by Sascha Rothe, Shashi Narayan, Aliaksei Severyn and first released in [this repository](https://tfhub.dev/google/bertseq2seq/roberta24_gigaword/1). The model is an encoder-decoder model that was initialized on the `roberta-large` checkpoints for both the encoder and decoder and fine-tuned on headline generation using the Gigaword dataset, which is linked above.