Files
HuggingFace_transformer/docs/source/model_doc
Vasudev Gupta d9c0d08f9a Flax Big Bird (#11967)
* add flax bert

* bert -> bigbird

* original_full ported

* add debugger

* init block sparse

* fix copies ; gelu_fast -> gelu_new

* block sparse port

* fix block sparse

* block sparse working

* all ckpts working

* fix-copies

* make quality

* init tests

* temporary fix for FlaxBigBirdForMultipleChoice

* skip test_attention_outputs

* fix

* gelu_fast -> gelu_new ; fix multiple choice model

* remove nsp

* fix sequence classifier

* fix

* make quality

* make fix-copies

* finish

* Delete debugger.ipynb

* Update src/transformers/models/big_bird/modeling_flax_big_bird.py

* make style

* finish

* bye bye jit flax tests

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2021-06-14 20:01:03 +01:00
..
2021-05-18 22:50:51 +01:00
2021-06-14 15:16:08 +05:30
2021-04-21 11:11:20 -04:00
2021-04-21 09:47:27 -04:00
2021-06-14 20:01:03 +01:00
2021-04-21 09:47:27 -04:00
2021-06-01 19:07:37 +01:00
2021-06-01 09:44:31 +05:30
2021-04-21 09:47:27 -04:00
2021-04-21 09:47:27 -04:00
2021-04-21 09:47:27 -04:00
2021-06-09 11:51:13 -04:00
2021-01-27 21:25:11 +03:00
2021-04-21 11:11:20 -04:00
2021-04-21 09:47:27 -04:00
2021-05-04 20:56:09 +02:00
2020-12-07 18:36:34 -05:00
2021-04-21 09:47:27 -04:00
2021-05-18 22:50:51 +01:00
2021-04-21 09:47:27 -04:00
2021-04-21 09:47:27 -04:00
2021-05-03 09:07:29 -04:00
2020-12-10 09:29:38 -05:00
2021-04-21 09:47:27 -04:00
2020-12-07 18:36:34 -05:00
2021-04-21 09:47:27 -04:00
2021-04-21 11:11:20 -04:00
2021-06-02 18:13:08 +05:30
2021-06-10 21:17:13 +05:30
2021-06-14 18:58:54 +01:00
2021-04-21 09:47:27 -04:00
2020-12-07 18:36:34 -05:00
2021-04-21 11:11:20 -04:00