Sanchit Gandhi
cd9274d010
[FlaxBert] Add ForCausalLM (#16995)
* [FlaxBert] Add ForCausalLM
* make style
* fix output attentions
* Add RobertaForCausalLM
* remove comment
* fix fx-to-pt model loading
* remove comment
* add modeling tests
* add enc-dec model tests
* add big_bird
* add electra
* make style
* make repo-consitency
* add to docs
* remove roberta test
* quality
* amend cookiecutter
* fix attention_mask bug in flax bert model tester
* tighten pt-fx thresholds to 1e-5
* add 'copied from' statements
* amend 'copied from' statements
* amend 'copied from' statements
* quality
2022-05-03 11:26:19 +02:00
..
2022-04-12 12:38:50 -04:00
2022-04-26 16:56:08 -04:00
2022-05-03 11:26:19 +02:00
2022-04-08 15:55:42 -05:00
2022-04-04 10:25:46 -04:00
2022-05-02 18:30:55 +02:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-27 16:46:55 +02:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-05-02 18:30:55 +02:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-20 11:01:24 -07:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-30 10:40:46 +02:00
2022-04-08 15:55:42 -05:00
2022-04-25 16:26:59 +02:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-26 16:51:15 +02:00
2022-04-11 14:59:35 +02:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00