Files
HuggingFace_transformer/tests
Yih-Dar 4cdb67caba Use cross_attention_hidden_size in Encoder-Decoder models (#14378)
* add cross_attention_hidden_size to text-2-text encoder-decoder models (PT/Flax)

* for TFEncoderDecoderModel

* add equivalence test for TFEncoderDecoderModel

* fix

* fix failed equivalence tests

* remove unused import

* add detailed comment

* Fix check_equivalence_tf_to_pt by using encoder/decoder

* cleaning

* Use cross_attention_hidden_size in speech-to-text

* clean fast init logging msg in encoder decoder models

* increase tol from 1e-5 to 1e-3 for tf test

* style

* style

* make sure projection layer can run

* remove type conversion + add check

* fix conflict (config.output_hidden_size)

* Remove TF -> PT in check_pt_tf_equivalence for TFEncoderDecoderModel

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2021-12-07 00:27:32 +01:00
..
2021-11-22 12:17:38 -05:00
2020-12-07 18:36:34 -05:00
2021-05-12 13:48:15 +05:30
2021-10-20 14:15:47 +02:00
2021-01-27 21:25:11 +03:00
2020-12-07 18:36:34 -05:00
2021-09-25 21:20:21 +02:00
2021-12-01 10:57:39 +05:30
2021-09-20 13:24:30 +02:00
2020-12-07 18:36:34 -05:00
2021-08-30 06:02:08 -04:00
2021-11-06 10:08:58 -04:00
2021-11-30 11:07:55 +01:00
2021-01-27 21:25:11 +03:00
2021-05-05 12:38:01 +02:00
2021-11-30 11:07:55 +01:00
2020-12-07 18:36:34 -05:00
2021-05-12 13:48:15 +05:30
2021-10-14 10:54:20 +02:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-04-26 13:50:34 +02:00