Suraj Patil
860264379f
GPT Neo (#10848)
* lets begin
* boom boom
* fix out proj in attn
* fix attention
* fix local attention
* add tokenizer
* fix imports
* autotokenizer
* fix checkpoint name
* cleanup
* more clean-up
* more cleanup
* output attentions
* fix attn mask creation
* fix imports
* config doc
* add tests
* add slow tests
* quality
* add conversion script
* copyright
* typo
* another bites the dust
* fix attention tests
* doc
* add embed init in convert function
* fix copies
* remove tokenizer
* enable caching
* address review comments
* improve config and create attn layer list internally
* more consistent naming
* init hf config from mesh-tf config json file
* remove neo tokenizer from doc
* handle attention_mask in local attn layer
* attn_layers => attention_layers
* add tokenizer_class in config
* fix docstring
* raise if len of attention_layers is not same as num_layers
* remove tokenizer_class from config
* more consistent naming
* fix doc
* fix checkpoint names
* fp16 compat
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2021-03-30 09:42:30 -04:00
..
2020-12-10 11:57:12 -05:00
2020-12-16 12:14:33 -05:00
2021-02-04 11:56:12 +03:00
2021-01-05 06:18:48 -05:00
2020-12-10 11:57:12 -05:00
2020-12-07 18:36:34 -05:00
2021-01-04 13:12:59 -05:00
2021-03-30 08:51:34 +03:00
2021-02-04 11:56:12 +03:00
2021-02-04 11:56:12 +03:00
2021-01-27 21:25:11 +03:00
2020-12-10 11:57:12 -05:00
2021-02-04 10:14:46 -05:00
2020-12-17 18:10:57 -05:00
2021-02-19 18:34:44 -05:00
2021-02-19 18:34:44 -05:00
2021-01-27 21:25:11 +03:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-02-22 12:36:16 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-03-30 09:42:30 -04:00
2020-12-15 11:27:08 -05:00
2021-01-06 09:13:43 -05:00
2021-02-28 19:03:49 +03:00
2021-03-25 12:32:38 -04:00
2021-01-06 07:46:17 -05:00
2020-12-16 11:06:14 -05:00
2020-12-07 18:36:34 -05:00
2021-03-16 20:20:00 +05:30
2021-02-22 12:36:16 -05:00
2021-02-22 12:36:16 -05:00
2020-12-07 18:36:34 -05:00
2020-12-10 09:29:38 -05:00
2020-12-10 11:57:12 -05:00
2021-03-11 14:29:02 -05:00
2021-01-06 10:04:32 -05:00
2020-12-07 18:36:34 -05:00
2021-03-09 00:49:51 +03:00
2020-12-10 11:57:12 -05:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-03-16 08:55:07 -04:00
2020-12-07 18:36:34 -05:00
2021-02-22 12:36:16 -05:00
2021-01-06 07:20:41 -05:00
2020-12-22 11:12:58 -05:00
2021-03-01 12:13:17 +03:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2020-12-10 11:57:12 -05:00
2020-12-10 11:57:12 -05:00
2021-03-11 17:44:18 +03:00