Files
HuggingFace_transformer/docs/source
Yossi Synett bc0d26d1de [All Seq2Seq model + CLM models that can be used with EncoderDecoder] Add cross-attention weights to outputs (#8071)
* Output cross-attention with decoder attention output

* Update src/transformers/modeling_bert.py

* add cross-attention for t5 and bart as well

* fix tests

* correct typo in docs

* add sylvains and sams comments

* correct typo

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2020-11-06 19:34:48 +01:00
..
2020-10-20 16:29:00 +02:00
2020-11-05 17:20:57 -05:00
2020-10-26 18:26:02 -04:00
2020-10-26 18:26:02 -04:00
2020-10-20 16:22:26 +02:00
2020-10-26 18:26:02 -04:00
2020-10-26 18:26:02 -04:00
2020-10-26 18:26:02 -04:00
2020-10-26 18:26:02 -04:00
2020-04-06 14:32:39 -04:00
2020-10-26 18:26:02 -04:00
2020-10-26 18:26:02 -04:00
2020-10-26 18:26:02 -04:00
2020-10-26 18:26:02 -04:00
2020-10-26 18:26:02 -04:00
2020-10-26 18:26:02 -04:00
2020-10-26 18:26:02 -04:00
2020-10-26 18:26:02 -04:00