Add SmolLM3 (#38755)

* init smollm3

* integration tests

* config quirks

* docs stub

* rests round 2

* tests round 3

* tests round 4

* bring SWA back

* config checker pls

* final checkpoint

* style and copies

* Update src/transformers/models/smollm3/modular_smollm3.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/smollm3/modular_smollm3.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
This commit is contained in:
Anton Lozhkov
2025-06-25 17:12:15 +02:00
committed by GitHub
parent 3233e9b7c3
commit dad0e87c79
12 changed files with 1879 additions and 0 deletions

View File

@@ -272,6 +272,7 @@ SPECIAL_CASES_TO_ALLOW = {
"attention_chunk_size",
],
"Llama4VisionConfig": ["multi_modal_projector_bias", "norm_eps"],
"SmolLM3Config": ["no_rope_layer_interval"],
}