Upload DistilBART artwork (#5394)
This commit is contained in:
@@ -1,3 +1,5 @@
|
|||||||
|
## Sequence to Sequence
|
||||||
|
|
||||||
This directory contains examples for finetuning and evaluating transformers on summarization and translation tasks.
|
This directory contains examples for finetuning and evaluating transformers on summarization and translation tasks.
|
||||||
Summarization support is more mature than translation support.
|
Summarization support is more mature than translation support.
|
||||||
Please tag @sshleifer with any issues/unexpected behaviors, or send a PR!
|
Please tag @sshleifer with any issues/unexpected behaviors, or send a PR!
|
||||||
@@ -168,6 +170,7 @@ python run_eval.py sshleifer/distilbart-cnn-12-6 $DATA_DIR/val.source dbart_val_
|
|||||||
|
|
||||||
|
|
||||||
### DistilBART
|
### DistilBART
|
||||||
|

|
||||||
|
|
||||||
For the CNN/DailyMail dataset, (relatively longer, more extractive summaries), we found a simple technique that works:
|
For the CNN/DailyMail dataset, (relatively longer, more extractive summaries), we found a simple technique that works:
|
||||||
you just copy alternating layers from `bart-large-cnn` and finetune more on the same data.
|
you just copy alternating layers from `bart-large-cnn` and finetune more on the same data.
|
||||||
|
|||||||
Reference in New Issue
Block a user