Jason Phang
0041be5b3d
LLaMA Implementation (#21955)
* LLaMA
* sharding and docs
* tweak
* black
* inits
* ruff
* LLAMA_PRETRAINED_CONFIG_ARCHIVE_MAP
* init
* no checkpoint
* docs
* ruff
* type_vocab_size
* tokenizer fixes
* tokenizer fixes
* Update tokenization_llama.py
* Update tokenization_llama.py
* Update configuration_llama.py
* Update modeling_llama.py
* tokenizer add_bos by default
* licenses
* remove decoder
* norms and mlp
* rope overhaul
* tweaks
* black
* mention OPT implementation
* off-by-one naming
* typo
* fix
* tokenization fix and slicing bug
* padding config
* cleanup
* black
* update tests
* undo typo
* fix vocab caching logic
* ruff
* docbuilder
* attn fix from BlackSamorez
* initial feedback
* typo
* docs
* llama case
* llama case
* load checkpoint docs
* comment about tokenizer
* tokenizer defaults
* clear past_key_values if use_cache=False
* last tweaks
* last tweaks
* last tweaks
* last tweaks
---------
Co-authored-by: Stella Biderman <stellabiderman@gmail.com>
2023-03-16 09:00:53 -04:00
..
2023-03-07 21:36:38 +01:00
2023-03-13 12:52:42 -07:00
2023-03-16 09:00:53 -04:00
2023-03-16 09:00:53 -04:00
2022-04-04 10:25:46 -04:00
2023-03-16 09:00:53 -04:00
2022-08-08 10:53:52 -05:00
2023-02-09 15:46:26 -05:00
2022-12-12 09:23:23 -05:00
2022-11-07 09:19:04 -05:00
2023-02-15 10:35:14 -08:00
2022-11-30 14:50:55 +00:00
2022-04-04 10:25:46 -04:00
2023-03-08 07:54:30 -05:00
2022-11-07 09:19:04 -05:00
2023-02-27 08:36:36 +01:00
2022-04-04 10:25:46 -04:00
2022-09-30 20:30:38 +01:00
2023-01-25 11:33:39 -05:00
2022-11-07 13:57:39 -05:00
2022-12-12 10:38:56 -05:00
2022-04-04 10:25:46 -04:00
2023-01-17 12:23:22 -05:00
2023-03-13 19:09:37 -04:00
2022-10-11 22:54:19 -04:00
2023-03-16 09:00:53 -04:00
2023-03-09 07:44:58 -05:00
2022-04-04 10:25:46 -04:00
2023-02-06 09:26:22 -05:00
2023-02-15 10:35:14 -08:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-05-16 13:36:41 +02:00
2022-12-01 16:52:13 +01:00
2022-12-01 16:52:13 +01:00
2023-02-07 15:09:27 +01:00
2022-06-23 14:51:54 +02:00
2022-11-28 08:54:35 -05:00
2023-02-06 09:20:43 -05:00
2022-11-30 10:41:18 -05:00
2022-12-08 11:18:52 -05:00
2022-11-30 10:41:18 -05:00
2023-02-07 15:19:19 +00:00
2022-11-30 10:41:18 -05:00
2022-11-30 10:41:18 -05:00
2022-09-06 15:20:12 -04:00
2022-11-30 14:50:55 +00:00
2023-03-14 09:13:16 +00:00
2023-02-06 18:10:56 -05:00
2023-02-07 12:27:31 -05:00
2023-01-19 08:43:36 -05:00
2023-02-23 09:41:28 +01:00
2022-08-06 09:38:55 +02:00
2022-04-04 10:25:46 -04:00
2023-02-20 16:37:11 +03:00
2023-02-13 10:09:00 -05:00
2023-02-02 11:41:27 -08:00
2023-02-27 12:33:34 +01:00
2023-01-18 17:53:05 +00:00
2022-11-15 01:18:13 +01:00
2022-10-03 13:18:29 -07:00
2023-01-20 07:38:15 -08:00
2023-03-01 08:26:25 -05:00