Raushan Turganbay
bc161d5d06
Delete deprecated stuff ( #38838 )
...
* delete deprecated stuff
* fix copies
* remove unused tests
* fix modernbert and fuyu
* Update src/transformers/cache_utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com >
* bye bye `seen_tokens`
* address comments
* update typings
* ecnoder decoder models follow same pattern as whisper
* fix copies
* why is it set to False?
* fix switch transformers
* fix encoder decoder models shared weight
* fix copies and RAG
* remove `next_cache`
* fix gptj/git
* fix copies
* fix copies
* style...
* another forgotten docsrting
---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com >
2025-07-10 05:18:44 +00:00
Xavier Dupré
f16fbfb89a
Make _compute_dynamic_ntk_parameters exportable ( #39171 )
...
* Make _compute_dynamic_ntk_parameters exportable
* add unit test
2025-07-07 14:48:31 +02:00
cyyever
1e6b546ea6
Use Python 3.9 syntax in tests ( #37343 )
...
Signed-off-by: cyy <cyyever@outlook.com >
2025-04-08 14:12:08 +02:00
Afanti
7f5077e536
fix typos in the tests directory ( #36717 )
2025-03-17 17:45:57 +00:00
Cyril Vallez
965a2fb320
More model refactoring! ( #35359 )
...
* cohere
* style
* phi3
* style
* small fix
* small fix
* phi3 longrope
* oups
* Update rope (only for phi3 still)
* Update test_modeling_rope_utils.py
* Update modeling_phi3.py
* fix
* fix copies
* style
* Fix copied from bad renaming
2025-01-09 11:09:09 +01:00
Raushan Turganbay
061c2c4c38
Ignore keys on validate_rope ( #33753 )
...
* ignore keys on check rope
* add tests
* fix tests, so maybe better leave at logger lvl
2024-10-04 12:39:37 +02:00
Alex Sherstinsky
122ded0a11
Bugfix/alexsherstinsky/fix none check for attention factor in rope scaling 2024 08 28 0 ( #33188 )
...
* Fixing a bug in the way "attention_factor" is validated in ROPE utilities.
* Fixing a bug in the way "attention_factor" is validated in ROPE utilities.
* Fixing a bug in the way "attention_factor" is validated in ROPE utilities.
2024-09-04 17:01:12 +02:00
Joao Gante
083e13b7c4
RoPE: Add numerical tests ✨ ( #32380 )
...
tests! :D
2024-08-02 09:39:45 +01:00
Joao Gante
2e113422b3
Llama: RoPE refactor ( #32135 )
...
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com >
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com >
2024-07-23 10:42:55 +01:00