Files
HuggingFace_transformer/tests
Guido Novati ecd6efe7cb Fix megatron_gpt2 attention block's causal mask (#12007)
* Fix megatron_gpt2 attention block's causal mask.

* compatibility with checkpoints created with recent versions of Megatron-LM

* added integration test for the released Megatron-GPT2 model

* code style changes

* added option to megatron conversion script to read from config file

Co-authored-by: Guido Novati <gnovati@nvidia.com>
2021-06-14 04:57:55 -04:00
..
2021-06-09 11:51:13 -04:00
2021-04-23 09:17:37 -04:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-05-12 13:48:15 +05:30
2021-04-23 09:17:37 -04:00
2021-01-27 21:25:11 +03:00
2020-12-07 18:36:34 -05:00
2021-06-09 11:51:13 -04:00
2021-06-09 11:51:13 -04:00
2021-06-09 11:51:13 -04:00
2021-04-26 13:50:34 +02:00
2021-04-26 13:50:34 +02:00
2020-12-09 10:32:43 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-06-01 19:07:37 +01:00
2021-01-27 21:25:11 +03:00
2021-05-05 12:38:01 +02:00
2021-06-01 19:07:37 +01:00
2021-06-09 11:51:13 -04:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-05-12 13:48:15 +05:30
2021-06-01 19:07:37 +01:00
2021-05-12 13:48:15 +05:30
2021-04-26 13:50:34 +02:00
2021-05-03 09:07:29 -04:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-04-26 13:50:34 +02:00
2021-04-21 11:11:20 -04:00