Suraj Patil
d25e25ee2b
Add XGLM models (#14876)
* add xglm
* update vocab size
* fix model name
* style and tokenizer
* typo
* no mask token
* fix pos embed compute
* fix args
* fix tokenizer
* fix positions
* fix tokenization
* style and dic fixes
* fix imports
* add fast tokenizer
* update names
* add pt tests
* fix tokenizer
* fix typo
* fix tokenizer import
* fix fast tokenizer
* fix tokenizer
* fix converter
* add tokenizer test
* update checkpoint names
* fix tokenizer tests
* fix slow tests
* add copied from comments
* rst -> mdx
* flax model
* update flax tests
* quality
* style
* doc
* update index and readme
* fix copies
* fix doc
* update toctrr
* fix indent
* minor fixes
* fix config doc
* don't save embed_pos weights
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* address Sylvains commnets, few doc fixes
* fix check_repo
* align order of arguments
* fix copies
* fix labels
* remove unnecessary mapping
* fix saving tokenizer
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-01-28 18:55:23 +01:00
..
2021-02-15 07:55:10 -05:00
2022-01-18 09:16:55 -05:00
2022-01-14 10:59:41 -05:00
2022-01-18 09:48:46 -05:00
2022-01-28 18:55:23 +01:00
2022-01-11 18:06:05 +01:00
2021-02-15 07:55:10 -05:00
2021-03-19 16:17:13 -04:00
2022-01-27 14:29:31 +01:00
2021-10-07 12:44:23 +05:30
2021-01-07 04:47:50 -05:00
2021-09-13 16:17:29 -04:00
2022-01-27 14:29:31 +01:00
2021-12-03 08:18:36 -05:00
2021-12-15 18:29:53 +01:00
2022-01-18 09:16:55 -05:00
2022-01-27 14:17:48 -05:00
2021-12-21 11:17:11 -05:00