Daniel Stancl
0c6c0afc0e
Add head_mask and decoder_head_mask to FSMT ( #9819 )
...
* Add {decoder_,}head_mask to fsmt_modeling.py
* Enable test_headmasking and some changes to docs
* Remove test_head_masking flag from fsmt test file
Remove test_head_masking flag from test_modeling_fsmt.py
since test_head_masking is set to be True by default (thus it is redundant to store).
* Merge master and remove test_head_masking = True
* Rebase necessary due to an update of jaxlib
* Remove test_head_masking=True in tests/test_modeling_fsmt.py
as it is redundant.
2021-02-01 09:30:21 +03:00
..
2021-01-27 11:28:11 +01:00
2021-01-27 03:20:09 -05:00
2021-02-01 01:31:29 +03:00
2021-01-12 18:19:38 -05:00
2021-01-28 06:11:52 -05:00
2021-01-08 07:40:59 -05:00
2021-01-08 07:40:59 -05:00
2021-01-08 07:40:59 -05:00
2021-02-01 01:31:29 +03:00
2021-02-01 01:31:29 +03:00
2021-01-27 21:25:11 +03:00
2021-01-08 07:40:59 -05:00
2021-01-27 10:45:42 +01:00
2021-01-27 11:28:11 +01:00
2021-01-20 10:18:50 -05:00
2021-01-19 09:40:15 -05:00
2021-01-27 11:28:11 +01:00
2021-01-27 10:45:42 +01:00
2021-01-28 06:11:52 -05:00
2021-01-08 07:40:59 -05:00
2021-01-27 10:45:42 +01:00
2021-02-01 09:30:21 +03:00
2021-01-27 11:28:11 +01:00
2021-01-27 10:45:42 +01:00
2021-01-08 07:40:59 -05:00
2021-01-19 17:11:22 -05:00
2021-01-27 10:45:42 +01:00
2021-01-27 11:28:11 +01:00
2021-01-27 11:28:11 +01:00
2021-02-01 01:31:29 +03:00
2021-02-01 01:31:29 +03:00
2021-01-08 07:40:59 -05:00
2021-01-27 11:28:11 +01:00
2021-01-27 11:28:11 +01:00
2021-01-27 04:09:56 -05:00
2021-01-27 11:28:11 +01:00
2021-02-01 01:31:29 +03:00
2021-01-08 07:40:59 -05:00
2021-01-21 11:13:38 +01:00
2021-01-26 20:32:46 +03:00
2021-01-19 09:40:15 -05:00
2021-01-08 07:40:59 -05:00
2021-01-28 06:11:52 -05:00
2021-01-08 07:40:59 -05:00
2021-01-27 10:45:42 +01:00
2021-01-19 09:40:15 -05:00
2021-01-27 10:45:42 +01:00
2021-01-27 11:28:11 +01:00
2020-12-11 16:59:54 +01:00
2021-01-08 07:40:59 -05:00
2021-01-27 10:45:42 +01:00
2021-01-27 03:20:09 -05:00