Mayank Mishra
c35d2ccf5a
Granite language models (#31502)
* first commit
* drop tokenizer
* drop tokenizer
* drop tokenizer
* drop convert
* granite
* drop tokenization test
* mup
* fix
* reformat
* reformat
* reformat
* fix docs
* stop checking for checkpoint
* update support
* attention multiplier
* update model
* tiny drop
* saibo drop
* skip test
* fix test
* fix test
* drop
* drop useless imports
* update docs
* drop flash function
* copied from
* drop pretraining tp
* drop pretraining tp
* drop pretraining tp
* drop unused import
* drop code path
* change name
* softmax scale
* head dim
* drop legacy cache
* rename params
* cleanup
* fix copies
* comments
* add back legacy cache
* multipliers
* multipliers
* multipliers
* text fix
* fix copies
* merge
* multipliers
* attention multiplier
* drop unused imports
* fix
* fix
* fix
* move rope?
* Update src/transformers/models/granite/configuration_granite.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* fix
* Update src/transformers/models/granite/modeling_granite.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* fix
* fix
* fix
* fix
* fix-copies
* torch rmsnorm
* add authors
* change model path
* fix
* test
* drop static cache test
* uupdate readme
* drop non-causal
* readme
* drop useless imports
* Update docs/source/en/model_doc/granite.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update docs/source/en/model_doc/granite.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update docs/source/en/model_doc/granite.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2024-08-27 21:27:21 +02:00
..
2022-11-08 19:54:41 +00:00
2021-02-15 07:55:10 -05:00
2024-05-22 06:40:15 +02:00
2023-05-18 14:14:43 -04:00
2024-08-26 15:16:44 +02:00
2024-08-27 21:27:21 +02:00
2024-06-25 13:55:11 +02:00
2024-05-22 06:40:15 +02:00
2024-08-19 09:28:13 +01:00
2024-05-22 06:40:15 +02:00
2024-05-22 06:40:15 +02:00
2024-05-15 14:28:20 +02:00
2023-03-13 19:11:19 +01:00
2024-08-27 18:50:27 +01:00
2023-06-06 18:17:41 +02:00
2024-05-22 06:40:15 +02:00
2024-08-08 15:47:24 +02:00
2021-02-15 07:55:10 -05:00
2024-08-27 11:58:27 +01:00
2024-05-22 06:40:15 +02:00
2024-06-03 09:42:53 +01:00
2024-07-08 11:52:47 +01:00
2024-05-22 06:40:15 +02:00
2024-04-15 15:08:09 +02:00
2024-01-31 15:58:17 +01:00
2023-02-28 17:12:44 +01:00
2023-02-03 12:57:02 -05:00
2024-05-07 16:58:50 +02:00
2024-08-27 11:58:27 +01:00
2024-04-12 10:01:28 +02:00
2024-05-22 06:40:15 +02:00
2024-06-07 17:50:18 +01:00
2024-04-15 13:20:36 +02:00
2024-06-20 18:57:24 +02:00
2024-07-29 09:42:54 +02:00
2023-03-30 21:06:35 +02:00
2024-05-31 08:49:33 +02:00
2024-05-03 18:19:30 +02:00
2022-06-02 10:24:16 +02:00
2024-05-22 06:40:15 +02:00
2024-06-10 15:16:58 +02:00
2024-05-09 22:57:52 +02:00
2024-05-22 06:40:15 +02:00
2024-04-15 13:20:36 +02:00
2024-04-24 22:32:42 +02:00
2024-04-16 21:25:06 +02:00
2024-05-22 06:40:15 +02:00
2024-07-22 14:14:47 +01:00