From 331d8d2936e7a140225cf60301ba6469930fd216 Mon Sep 17 00:00:00 2001 From: Kevin Canwen Xu Date: Tue, 30 Jun 2020 18:11:11 +0800 Subject: [PATCH] Upload DistilBART artwork (#5394) --- examples/seq2seq/README.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/examples/seq2seq/README.md b/examples/seq2seq/README.md index 8cfaf82b5e..29866c09a9 100644 --- a/examples/seq2seq/README.md +++ b/examples/seq2seq/README.md @@ -1,3 +1,5 @@ +## Sequence to Sequence + This directory contains examples for finetuning and evaluating transformers on summarization and translation tasks. Summarization support is more mature than translation support. Please tag @sshleifer with any issues/unexpected behaviors, or send a PR! @@ -168,6 +170,7 @@ python run_eval.py sshleifer/distilbart-cnn-12-6 $DATA_DIR/val.source dbart_val_ ### DistilBART +![DBART](https://huggingface.co/front/thumbnails/distilbart_large.png) For the CNN/DailyMail dataset, (relatively longer, more extractive summaries), we found a simple technique that works: you just copy alternating layers from `bart-large-cnn` and finetune more on the same data.