Vasudev Gupta
d9c0d08f9a
Flax Big Bird (#11967)
* add flax bert
* bert -> bigbird
* original_full ported
* add debugger
* init block sparse
* fix copies ; gelu_fast -> gelu_new
* block sparse port
* fix block sparse
* block sparse working
* all ckpts working
* fix-copies
* make quality
* init tests
* temporary fix for FlaxBigBirdForMultipleChoice
* skip test_attention_outputs
* fix
* gelu_fast -> gelu_new ; fix multiple choice model
* remove nsp
* fix sequence classifier
* fix
* make quality
* make fix-copies
* finish
* Delete debugger.ipynb
* Update src/transformers/models/big_bird/modeling_flax_big_bird.py
* make style
* finish
* bye bye jit flax tests
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2021-06-14 20:01:03 +01:00
..
2021-05-12 17:08:35 +02:00
2021-02-05 15:47:54 +03:00
2021-06-02 09:21:05 -07:00
2021-06-08 12:55:17 -07:00
2021-06-14 20:01:03 +01:00
2021-04-13 15:36:36 -04:00
2021-04-21 11:11:20 -04:00
2021-01-05 06:18:48 -05:00
2021-06-09 11:51:13 -04:00
2021-04-06 14:56:18 +02:00
2020-06-17 14:01:10 -04:00
2021-04-21 11:11:20 -04:00
2021-03-26 08:07:59 -04:00
2021-04-30 11:15:46 -07:00
2020-05-27 11:36:55 -04:00
2021-04-05 10:51:16 -04:00
2020-02-25 13:48:24 -05:00
2021-04-13 15:36:36 -04:00
2021-06-14 20:01:03 +01:00
2021-05-24 14:26:02 -04:00
2021-04-26 08:37:32 -07:00
2021-05-24 14:26:02 -04:00
2021-04-21 11:11:20 -04:00
2021-04-21 11:11:20 -04:00
2020-04-06 14:32:39 -04:00
2021-04-01 11:58:37 -06:00
2020-12-23 10:15:49 -05:00
2020-12-23 10:15:49 -05:00
2021-03-30 11:15:55 -04:00
2021-04-27 10:04:12 -04:00
2021-04-21 11:11:20 -04:00
2020-12-07 18:36:34 -05:00
2021-06-02 09:21:05 -07:00
2021-04-21 08:51:00 -07:00
2020-12-23 10:15:49 -05:00
2021-05-03 13:18:46 -04:00
2021-04-14 08:39:23 -07:00