Thomas Wolf
|
6ffe03a0a1
|
Merge pull request #3137 from tomhosking/bart-refactor
Refactor BartModel so that input checks are handled within enc/dec
|
2020-03-06 13:06:34 +01:00 |
|
Sam Shleifer
|
857e0a0d3b
|
Rename BartForMaskedLM -> BartForConditionalGeneration (#3114)
* improved documentation
|
2020-03-05 17:41:18 -05:00 |
|
sshleifer
|
14d40584b2
|
remove newline
|
2020-03-05 13:06:35 -05:00 |
|
sshleifer
|
1360dacaa3
|
cleanup deltas
|
2020-03-05 12:57:42 -05:00 |
|
sshleifer
|
810079de1f
|
no ipdb
|
2020-03-05 12:48:14 -05:00 |
|
sshleifer
|
c36fdc88d4
|
tests pass
|
2020-03-05 12:33:08 -05:00 |
|
Tom Hosking
|
06a6cb6f36
|
Refactor BartModel so that input checks are handled within BartEncoder and BartDecoder
|
2020-03-05 13:45:41 +00:00 |
|
Sam Shleifer
|
e9e6efdc45
|
BartForSequenceClassification: fix num_labels, add test (#3110)
|
2020-03-03 15:54:29 -05:00 |
|
Sam Shleifer
|
5c5af879b6
|
[Bart] dont call .forward (#3094)
|
2020-03-03 15:14:12 -05:00 |
|
Sam Shleifer
|
b54ef78d0c
|
Bart-CNN (#3059)
`generate` code that produces 99% identical summarizations to fairseq on CNN test data, with caching.
|
2020-03-02 10:35:53 -05:00 |
|
Julien Chaumond
|
9cda3620b6
|
Fix (non-slow) tests on GPU (torch) (#3024)
* Fix tests on GPU (torch)
* Fix bart slow tests
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
|
2020-02-26 11:59:25 -05:00 |
|
Sam Shleifer
|
92487a1dc0
|
Bart: fix layerdrop and cached decoder_input_ids for generation (#2969)
|
2020-02-22 16:25:04 -05:00 |
|
Sam Shleifer
|
53ce3854a1
|
New BartModel (#2745)
* Results same as fairseq
* Wrote a ton of tests
* Struggled with api signatures
* added some docs
|
2020-02-20 18:11:13 -05:00 |
|