Vasudev Gupta
6dfd027279
BigBird (#10183)
* init bigbird
* model.__init__ working, conversion script ready, config updated
* add conversion script
* BigBirdEmbeddings working :)
* slightly update conversion script
* BigBirdAttention working :) ; some bug in layer.output.dense
* add debugger-notebook
* forward() working for BigBirdModel :) ; replaced gelu with gelu_fast
* tf code adapted to torch till rand_attn in bigbird_block_sparse_attention ; till now everything working :)
* BigBirdModel working in block-sparse attention mode :)
* add BigBirdForPreTraining
* small fix
* add tokenizer for BigBirdModel
* fix config & hence modeling
* fix base prefix
* init testing
* init tokenizer test
* pos_embed must be absolute, attn_type=original_full when add_cross_attn=True , nsp loss is optional in BigBirdForPreTraining, add assert statements
* remove position_embedding_type arg
* complete normal tests
* add comments to block sparse attention
* add attn_probs for sliding & global tokens
* create fn for block sparse attn mask creation
* add special tests
* restore pos embed arg
* minor fix
* attn probs update
* make big bird fully gpu friendly
* fix tests
* remove pruning
* correct tokenzier & minor fixes
* update conversion script , remove norm_type
* tokenizer-inference test add
* remove extra comments
* add docs
* save intermediate
* finish trivia_qa conversion
* small update to forward
* correct qa and layer
* better error message
* BigBird QA ready
* fix rebased
* add triva-qa debugger notebook
* qa setup
* fixed till embeddings
* some issue in q/k/v_layer
* fix bug in conversion-script
* fixed till self-attn
* qa fixed except layer norm
* add qa end2end test
* fix gradient ckpting ; other qa test
* speed-up big bird a bit
* hub_id=google
* clean up
* make quality
* speed up einsum with bmm
* finish perf improvements for big bird
* remove wav2vec2 tok
* fix tokenizer
* include docs
* correct docs
* add helper to auto pad block size
* make style
* remove fast tokenizer for now
* fix some
* add pad test
* finish
* fix some bugs
* fix another bug
* fix buffer tokens
* fix comment and merge from master
* add comments
* make style
* commit some suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Fix typos
* fix some more suggestions
* add another patch
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* fix copies
* another path
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* update
* update nit suggestions
* make style
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2021-03-30 08:51:34 +03:00
..
2021-02-13 08:24:53 -05:00
2020-01-06 15:11:12 +01:00
2021-03-16 08:05:37 +03:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2020-11-16 21:43:42 -05:00
2020-10-22 15:39:01 +02:00
2020-12-07 18:36:34 -05:00
2020-11-02 08:47:37 -05:00
2021-03-09 12:16:59 +03:00
2021-03-10 21:42:04 +05:30
2021-03-11 12:56:12 -05:00
2021-03-16 15:37:52 -04:00
2021-02-26 12:35:36 +03:00
2021-01-06 17:11:42 +01:00
2021-03-23 01:00:05 +03:00
2021-03-12 10:11:50 +01:00
2021-03-23 01:00:05 +03:00
2021-02-12 21:46:17 +01:00
2021-03-15 09:27:55 -04:00
2021-03-26 11:23:56 -04:00
2020-12-07 18:36:34 -05:00
2020-01-06 15:11:12 +01:00
2020-09-11 12:01:33 -04:00
2021-03-11 13:43:53 -05:00
2021-01-12 10:03:50 -05:00
2021-02-08 13:25:09 +03:00
2021-02-08 08:22:19 -05:00
2021-03-11 13:43:53 -05:00
2021-03-30 08:51:34 +03:00
2021-02-04 11:56:12 +03:00
2021-02-04 11:41:34 -05:00
2021-01-27 21:25:11 +03:00
2020-12-07 18:36:34 -05:00
2021-03-18 12:51:42 -04:00
2021-03-11 13:43:53 -05:00
2020-12-01 09:49:27 +01:00
2021-03-11 13:43:53 -05:00
2021-03-11 13:43:53 -05:00
2021-03-11 13:43:53 -05:00
2021-02-08 04:43:25 -05:00
2021-03-11 13:43:53 -05:00
2021-02-04 11:56:12 +03:00
2021-02-08 04:36:50 -05:00
2020-12-16 13:03:32 +01:00
2021-03-18 09:44:17 +03:00
2020-12-16 13:03:32 +01:00
2021-03-05 12:10:19 -05:00
2020-11-16 11:43:00 -05:00
2021-03-15 09:10:44 -04:00
2021-02-25 10:06:42 -05:00
2021-01-13 09:55:48 -05:00
2021-02-02 11:06:52 -08:00
2021-02-02 11:06:52 -08:00
2021-01-28 10:09:13 -05:00
2021-03-09 17:35:07 +03:00
2021-03-11 12:58:15 -05:00
2021-03-11 13:43:53 -05:00
2020-12-08 12:04:34 -05:00
2020-12-09 10:32:43 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-02-04 11:56:12 +03:00
2021-03-17 16:15:14 +03:00
2021-03-15 10:07:12 +03:00
2021-01-06 17:11:42 +01:00
2020-12-23 23:01:32 +05:30
2021-03-10 21:42:04 +05:30
2021-03-11 13:43:53 -05:00
2021-03-05 12:10:19 -05:00
2021-01-11 08:02:41 -05:00
2021-02-15 17:18:33 +01:00
2020-12-08 09:14:09 -05:00
2021-02-17 17:48:56 +01:00
2021-02-15 07:55:10 -05:00
2021-02-17 17:48:56 +01:00
2021-03-12 14:16:40 +03:00
2021-01-27 21:25:11 +03:00
2020-12-07 18:36:34 -05:00
2021-03-15 17:28:01 -04:00
2021-02-24 14:55:34 +03:00
2021-02-17 18:54:15 +01:00
2021-02-15 07:55:10 -05:00
2021-02-15 07:55:10 -05:00
2021-02-15 07:55:10 -05:00
2021-02-17 18:02:48 +01:00
2021-02-18 06:29:43 -05:00
2021-02-18 09:36:01 +01:00
2021-03-25 12:32:38 -04:00
2021-02-22 15:41:56 +01:00
2021-02-22 15:41:56 +01:00
2021-02-19 06:54:14 -05:00
2021-03-12 14:16:40 +03:00
2021-02-22 12:36:16 -05:00
2021-02-19 06:55:25 -05:00
2021-02-19 06:56:41 -05:00
2021-03-12 09:09:46 +03:00
2021-02-19 09:33:25 -05:00
2021-03-11 13:35:50 -05:00
2020-12-08 09:14:09 -05:00
2021-03-15 10:07:12 +03:00
2021-02-15 07:55:10 -05:00
2021-02-19 06:57:16 -05:00
2021-02-19 06:58:07 -05:00
2020-12-07 18:36:34 -05:00
2021-02-17 18:02:48 +01:00
2021-02-15 07:55:10 -05:00
2021-01-06 17:11:42 +01:00
2021-03-17 18:10:17 +03:00
2020-10-30 10:25:48 -04:00
2020-12-07 18:36:34 -05:00
2021-01-06 17:11:42 +01:00
2021-01-06 17:11:42 +01:00
2021-03-08 08:52:20 -08:00
2021-03-11 13:38:29 -05:00
2021-01-05 03:58:26 -05:00
2020-12-07 18:36:34 -05:00
2021-02-25 17:42:46 +03:00
2021-03-05 09:24:14 -05:00
2020-12-07 18:36:34 -05:00
2021-01-26 10:06:28 +01:00
2021-02-15 06:22:45 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-03-05 12:10:19 -05:00
2021-01-27 04:10:53 -05:00
2020-12-07 18:36:34 -05:00
2021-01-29 10:27:32 +01:00
2020-12-07 18:36:34 -05:00
2021-03-15 16:02:46 -06:00
2021-03-10 21:42:04 +05:30
2021-02-25 17:42:46 +03:00
2020-12-07 18:36:34 -05:00
2021-03-10 21:42:04 +05:30
2020-10-20 04:39:13 -04:00
2021-02-02 04:39:33 -05:00
2021-03-18 22:13:45 -04:00
2021-02-22 12:36:16 -05:00
2021-03-08 21:44:07 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2020-11-16 21:43:42 -05:00
2021-03-30 08:51:34 +03:00
2021-01-05 22:00:05 +01:00
2021-02-13 08:24:53 -05:00
2021-03-16 08:05:37 +03:00
2021-01-12 18:19:38 -05:00
2021-02-19 18:34:44 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2020-10-18 20:51:24 +02:00
2020-12-07 18:36:34 -05:00
2020-11-16 21:43:42 -05:00
2021-02-02 10:35:27 +03:00
2020-11-16 21:43:42 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-03-06 22:14:16 +05:30
2021-03-16 08:05:37 +03:00
2021-02-25 11:42:25 -05:00
2021-02-25 11:42:25 -05:00
2020-12-21 15:41:34 +01:00
2021-02-02 10:35:27 +03:00
2021-02-22 12:36:16 -05:00
2020-11-16 21:43:42 -05:00
2021-02-22 12:36:16 -05:00
2020-12-07 18:36:34 -05:00
2021-02-02 10:35:27 +03:00
2020-12-07 18:36:34 -05:00
2021-01-05 22:00:05 +01:00
2021-03-10 21:42:04 +05:30
2020-10-18 20:51:24 +02:00
2021-03-16 08:05:37 +03:00
2021-01-13 09:55:48 -05:00
2021-01-12 18:19:38 -05:00
2020-12-07 18:36:34 -05:00
2021-03-11 12:56:12 -05:00
2020-11-16 21:43:42 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-03-09 16:25:32 -05:00
2021-03-23 19:03:06 -04:00
2021-02-08 16:04:28 +03:00
2020-12-07 18:36:34 -05:00
2021-03-17 19:18:11 -04:00
2021-03-17 11:16:37 -07:00
2021-03-17 18:11:20 -04:00
2020-12-07 18:36:34 -05:00