Files
HuggingFace_transformer/docs/source/main_classes
Yossi Synett bc0d26d1de [All Seq2Seq model + CLM models that can be used with EncoderDecoder] Add cross-attention weights to outputs (#8071)
* Output cross-attention with decoder attention output

* Update src/transformers/modeling_bert.py

* add cross-attention for t5 and bart as well

* fix tests

* correct typo in docs

* add sylvains and sams comments

* correct typo

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2020-11-06 19:34:48 +01:00
..
2020-09-23 13:20:45 -04:00
2020-10-26 18:26:02 -04:00
2020-09-23 13:20:45 -04:00
2020-10-26 18:26:02 -04:00
2020-10-26 18:26:02 -04:00
2020-10-26 18:26:02 -04:00
2020-10-26 18:26:02 -04:00