🔴 [VLM] modeling updates (#38317)
* updates * fixup * fix tests * fix test * fix * let it be here for now, till monday * two more fixes * persimmon * fixup * fix * fixup * make sure fuyu runs now that LM has new attn API * fixup + tests * qwen vl uses new mask interface as well * qwen image features format * update * remove image_sizes * address comments * i am dumb...
This commit is contained in:
committed by
GitHub
parent
a6f7acb603
commit
ad9dd3d17b
@@ -46,7 +46,10 @@ SPECIAL_CASES_TO_ALLOW = {
|
||||
],
|
||||
"Qwen2Config": ["use_sliding_window", "max_window_layers"],
|
||||
"Qwen2MoeConfig": ["use_sliding_window"],
|
||||
"Qwen2VLConfig": ["use_sliding_window"],
|
||||
"Qwen2VLTextConfig": ["use_sliding_window", "max_window_layers"],
|
||||
"Qwen2_5_VLTextConfig": ["use_sliding_window", "max_window_layers"],
|
||||
"Qwen2_5OmniTextConfig": ["use_sliding_window", "max_window_layers"],
|
||||
"Qwen2_5OmniTalkerConfig": ["use_sliding_window", "max_window_layers"],
|
||||
"Qwen3Config": ["max_window_layers", "use_sliding_window"], # now use `layer_types` instead
|
||||
"Qwen3MoeConfig": ["max_window_layers", "use_sliding_window"],
|
||||
# `cache_implementation` should be in the default generation config, but we don't yet support per-model
|
||||
|
||||
Reference in New Issue
Block a user