Files
HuggingFace_transformer/docs/source
Vasudev Gupta d9c0d08f9a Flax Big Bird (#11967)
* add flax bert

* bert -> bigbird

* original_full ported

* add debugger

* init block sparse

* fix copies ; gelu_fast -> gelu_new

* block sparse port

* fix block sparse

* block sparse working

* all ckpts working

* fix-copies

* make quality

* init tests

* temporary fix for FlaxBigBirdForMultipleChoice

* skip test_attention_outputs

* fix

* gelu_fast -> gelu_new ; fix multiple choice model

* remove nsp

* fix sequence classifier

* fix

* make quality

* make fix-copies

* finish

* Delete debugger.ipynb

* Update src/transformers/models/big_bird/modeling_flax_big_bird.py

* make style

* finish

* bye bye jit flax tests

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2021-06-14 20:01:03 +01:00
..
2021-05-12 17:08:35 +02:00
2021-06-08 12:55:17 -07:00
2021-06-14 20:01:03 +01:00
2021-04-21 11:11:20 -04:00
2021-06-09 11:51:13 -04:00
2021-06-14 20:01:03 +01:00
2021-05-24 14:26:02 -04:00
2021-04-21 11:11:20 -04:00
2021-04-21 11:11:20 -04:00
2020-04-06 14:32:39 -04:00
2021-04-01 11:58:37 -06:00
2021-04-21 11:11:20 -04:00
2020-12-07 18:36:34 -05:00
2021-05-03 13:18:46 -04:00