Files
HuggingFace_transformer/docs/source/en/model_doc
Jason Phang 0041be5b3d LLaMA Implementation (#21955)
* LLaMA

* sharding and docs

* tweak

* black

* inits

* ruff

* LLAMA_PRETRAINED_CONFIG_ARCHIVE_MAP

* init

* no checkpoint

* docs

* ruff

* type_vocab_size

* tokenizer fixes

* tokenizer fixes

* Update tokenization_llama.py

* Update tokenization_llama.py

* Update configuration_llama.py

* Update modeling_llama.py

* tokenizer add_bos by default

* licenses

* remove decoder

* norms and mlp

* rope overhaul

* tweaks

* black

* mention OPT implementation

* off-by-one naming

* typo

* fix

* tokenization fix and slicing bug

* padding config

* cleanup

* black

* update tests

* undo typo

* fix vocab caching logic

* ruff

* docbuilder

* attn fix from BlackSamorez

* initial feedback

* typo

* docs

* llama case

* llama case

* load checkpoint docs

* comment about tokenizer

* tokenizer defaults

* clear past_key_values if use_cache=False

* last tweaks

* last tweaks

* last tweaks

* last tweaks

---------

Co-authored-by: Stella Biderman <stellabiderman@gmail.com>
2023-03-16 09:00:53 -04:00
..
2023-03-16 13:41:48 +03:00
2023-01-04 09:18:57 +01:00
2023-02-27 08:36:36 +01:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2023-02-28 15:42:55 +01:00
2023-01-18 11:24:37 +01:00
2022-04-04 10:25:46 -04:00
2023-01-17 17:18:56 +01:00
2022-11-09 18:31:22 +01:00
2023-03-14 12:08:14 +03:00
2022-04-04 10:25:46 -04:00
2023-01-17 17:18:56 +01:00
2022-11-29 10:38:01 +00:00
2023-02-15 10:35:14 -08:00
2023-02-20 16:37:11 +03:00
2022-11-08 19:54:41 +00:00
2022-04-04 10:25:46 -04:00
2023-01-17 17:18:56 +01:00
2023-01-17 17:18:56 +01:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2023-03-16 09:00:53 -04:00
2023-01-16 20:37:07 +03:00
2023-03-13 10:11:31 +00:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2023-02-15 10:35:14 -08:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2023-02-03 12:43:46 -05:00
2022-12-16 16:24:01 +01:00
2023-03-03 17:57:24 +01:00
2022-04-08 10:57:51 +02:00
2023-02-15 18:10:30 +00:00
2022-06-21 10:24:50 +02:00
2022-11-08 19:54:41 +00:00
2023-01-17 17:18:56 +01:00
2023-01-17 17:18:56 +01:00
2023-02-07 16:43:19 -05:00
2022-04-04 10:25:46 -04:00