Add new model (#32615)
* v1 - working version * fix * fix * fix * fix * rename to correct name * fix title * fixup * rename files * fix * add copied from on tests * rename to `FalconMamba` everywhere and fix bugs * fix quantization + accelerate * fix copies * add `torch.compile` support * fix tests * fix tests and add slow tests * copies on config * merge the latest changes * fix tests * add few lines about instruct * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix * fix tests --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
This commit is contained in:
@@ -50,6 +50,8 @@ SPECIAL_CASES_TO_ALLOW = {
|
||||
"RecurrentGemmaConfig": ["block_types"],
|
||||
# used as in the config to define `intermediate_size`
|
||||
"MambaConfig": ["expand"],
|
||||
# used as in the config to define `intermediate_size`
|
||||
"FalconMambaConfig": ["expand"],
|
||||
# used as `self.bert_model = BertModel(config, ...)`
|
||||
"DPRConfig": True,
|
||||
"FuyuConfig": True,
|
||||
|
||||
Reference in New Issue
Block a user