Oleksiy Syvokon
ee5de0ba44
BERT decoder: Fix causal mask dtype.
...
PyTorch < 1.3 requires multiplication operands to be of the same type.
This was violated when using default attention mask (i.e.,
attention_mask=None in arguments) given BERT in the decoder mode.
In particular, this was breaking Model2Model and made tutorial
from the quickstart failing.
2020-02-11 15:19:22 -05:00
..
2020-01-11 02:46:17 +00:00
2020-01-06 15:11:12 +01:00
2020-01-13 23:11:44 +00:00
2020-01-06 15:11:12 +01:00
2020-01-30 10:07:22 -05:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-01-31 14:16:52 -05:00
2020-02-11 15:19:22 -05:00
2020-02-04 16:38:52 -05:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-02-04 18:05:35 -05:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-02-04 18:05:35 -05:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-01-27 14:27:07 -05:00
2020-01-06 15:11:12 +01:00
2020-01-27 21:58:36 -05:00
2020-01-06 15:11:12 +01:00
2020-01-27 12:09:58 -05:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-02-04 18:05:35 -05:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-02-07 15:28:13 -05:00
2020-01-15 18:33:50 -05:00
2020-01-31 14:16:52 -05:00
2020-01-15 18:33:50 -05:00
2020-01-17 14:57:56 -05:00
2020-01-29 16:29:20 -05:00
2020-01-15 18:33:50 -05:00
2020-01-06 15:11:12 +01:00
2020-01-15 18:33:50 -05:00
2020-01-15 18:33:50 -05:00
2020-01-15 18:33:50 -05:00
2020-01-15 18:33:50 -05:00
2020-01-15 18:33:50 -05:00
2020-01-06 15:11:12 +01:00
2020-01-15 18:33:50 -05:00
2020-01-15 18:33:50 -05:00
2020-01-11 03:43:57 +00:00