Files
HuggingFace_transformer/tests
Oleksiy Syvokon ee5de0ba44 BERT decoder: Fix causal mask dtype.
PyTorch < 1.3 requires multiplication operands to be of the same type.
This was violated when using default attention mask (i.e.,
attention_mask=None in arguments) given BERT in the decoder mode.

In particular, this was breaking Model2Model and made tutorial
from the quickstart failing.
2020-02-11 15:19:22 -05:00
..
2020-02-04 16:38:52 -05:00
2020-02-04 18:05:35 -05:00
2020-02-04 18:05:35 -05:00
2020-02-07 15:28:13 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
2020-01-29 16:29:20 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
2020-01-11 03:43:57 +00:00