Jason Phang
0041be5b3d
LLaMA Implementation (#21955)
* LLaMA
* sharding and docs
* tweak
* black
* inits
* ruff
* LLAMA_PRETRAINED_CONFIG_ARCHIVE_MAP
* init
* no checkpoint
* docs
* ruff
* type_vocab_size
* tokenizer fixes
* tokenizer fixes
* Update tokenization_llama.py
* Update tokenization_llama.py
* Update configuration_llama.py
* Update modeling_llama.py
* tokenizer add_bos by default
* licenses
* remove decoder
* norms and mlp
* rope overhaul
* tweaks
* black
* mention OPT implementation
* off-by-one naming
* typo
* fix
* tokenization fix and slicing bug
* padding config
* cleanup
* black
* update tests
* undo typo
* fix vocab caching logic
* ruff
* docbuilder
* attn fix from BlackSamorez
* initial feedback
* typo
* docs
* llama case
* llama case
* load checkpoint docs
* comment about tokenizer
* tokenizer defaults
* clear past_key_values if use_cache=False
* last tweaks
* last tweaks
* last tweaks
* last tweaks
---------
Co-authored-by: Stella Biderman <stellabiderman@gmail.com>
2023-03-16 09:00:53 -04:00
..
2023-02-28 19:40:57 +01:00
2023-03-03 14:47:09 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-09 09:23:48 -05:00
2023-02-28 19:40:57 +01:00
2023-02-06 18:10:56 -05:00
2023-02-06 18:10:56 -05:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-06 18:10:56 -05:00
2023-02-06 18:10:56 -05:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-02 18:20:34 +01:00
2023-03-13 15:03:21 +01:00
2023-02-28 19:40:57 +01:00
2022-05-03 14:42:02 +02:00
2023-03-15 20:54:38 +01:00
2023-02-06 18:10:56 -05:00
2023-02-10 10:58:29 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-03 18:42:18 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-07 14:23:36 +01:00
2023-03-01 11:11:04 +01:00
2023-02-28 19:40:57 +01:00
2023-03-14 12:08:14 +03:00
2022-05-03 14:42:02 +02:00
2023-03-14 10:03:02 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-07 15:19:39 -05:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2022-05-03 14:42:02 +02:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-20 16:37:11 +03:00
2023-02-28 19:40:57 +01:00
2023-02-06 18:10:56 -05:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-14 10:03:02 +01:00
2023-02-28 19:40:57 +01:00
2023-03-14 10:03:02 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-10 22:08:21 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2022-12-12 13:12:13 -05:00
2023-03-14 10:03:02 +01:00
2023-02-22 11:09:04 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-06 18:10:56 -05:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-07 21:36:38 +01:00
2023-03-07 04:20:14 +01:00
2023-03-14 10:03:02 +01:00
2023-03-14 10:03:02 +01:00
2023-03-14 10:03:02 +01:00
2023-02-22 09:14:54 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-14 10:03:02 +01:00
2023-03-16 09:00:53 -04:00
2023-03-14 10:03:02 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-14 10:03:02 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2022-07-29 08:09:09 -04:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2022-05-03 14:42:02 +02:00
2023-03-16 11:48:52 +00:00
2022-05-12 16:25:55 -04:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-06 18:10:56 -05:00
2023-03-14 10:03:02 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-06 18:10:56 -05:00
2023-02-28 19:40:57 +01:00
2023-03-07 22:31:14 +01:00
2023-03-14 10:03:02 +01:00
2023-03-14 10:03:02 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-06 18:10:56 -05:00
2023-03-14 10:03:02 +01:00
2023-02-28 19:40:57 +01:00
2023-03-14 10:03:02 +01:00
2023-02-28 19:40:57 +01:00
2023-02-09 14:46:15 +00:00
2023-02-28 19:40:57 +01:00
2023-03-14 10:03:02 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-06 18:10:56 -05:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-14 10:03:02 +01:00
2023-03-14 10:03:02 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-06 18:10:56 -05:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-07 22:31:14 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-07 14:23:36 +01:00
2023-03-07 14:23:36 +01:00
2023-03-07 14:23:36 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-14 10:03:02 +01:00
2023-02-06 18:10:56 -05:00
2023-03-02 20:43:35 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-14 10:03:02 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 18:41:34 +00:00
2023-03-01 18:00:48 +00:00
2023-02-28 19:40:57 +01:00
2023-03-15 18:37:23 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-01 10:49:21 +01:00
2023-02-28 19:40:57 +01:00
2022-05-03 14:42:02 +02:00
2023-02-22 09:14:54 +01:00
2023-02-28 19:40:57 +01:00
2023-03-13 19:46:01 +01:00
2023-02-28 19:40:57 +01:00
2023-03-06 18:07:31 +01:00
2023-03-14 10:03:02 +01:00
2023-03-06 09:15:44 +01:00
2023-02-09 21:49:54 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-03-07 07:34:04 -05:00
2023-02-28 19:40:57 +01:00
2022-05-03 14:42:02 +02:00