Cyril Vallez
|
4ded9a4113
|
🚨🚨 Fix and simplify attention implementation dispatch and subconfigs handling (#39423)
* first try
* Update modeling_utils.py
* Update modeling_utils.py
* big refactor
* Update modeling_utils.py
* style
* docstrings and simplify inner workings of configs
* remove all trace of _internal
* Update modeling_utils.py
* fix logic error
* Update modeling_utils.py
* recursive on config
* Update configuration_utils.py
* fix
* Update configuration_dpt.py
* Update configuration_utils.py
* Update configuration_utils.py
* Update modeling_idefics.py
* Update modeling_utils.py
* fix for old models
* more old models fixup
* Update modeling_utils.py
* Update configuration_utils.py
* Remove outdated test
* remove the deepcopy!! 🥵🥵
* Update test_modeling_gpt_bigcode.py
* fix qwen dispatch
* restrict to only models supporting it
* style
* switch name
* Update modeling_utils.py
* Update modeling_utils.py
* add tests!
* fix
* rypo
* remove bad copies
* fix
* Update modeling_utils.py
* additional check
* Update modeling_utils.py
* Update modeling_utils.py
* Update modeling_utils.py
* Update modeling_utils.py
* Update modeling_utils.py
* fix
* skip
|
2025-07-18 13:41:54 +02:00 |
|
Sylvain Gugger
|
d1fcc90abf
|
Fix from_pretrained with default base_model_prefix (#15814)
|
2022-02-24 11:43:51 +01:00 |
|
Sylvain Gugger
|
1f60bc46f3
|
Make sure custom configs work with Transformers (#15569)
* Make sure custom configs work with Transformers
* Apply code review suggestions
|
2022-02-09 10:04:44 -05:00 |
|
Sylvain Gugger
|
44b21f117b
|
Save code of registered custom models (#15379)
* Allow dynamic modules to use relative imports
* Work for configs
* Fix last merge conflict
* Save code of registered custom objects
* Map strings to strings
* Fix test
* Add tokenizer
* Rework tests
* Tests
* Ignore fixtures py files for tests
* Tokenizer test + fix collection
* With full path
* Rework integration
* Fix typo
* Remove changes in conftest
* Test for tokenizers
* Add documentation
* Update docs/source/custom_models.mdx
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Add file structure and file content
* Add more doc
* Style
* Update docs/source/custom_models.mdx
Co-authored-by: Suraj Patil <surajp815@gmail.com>
* Address review comments
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
|
2022-02-02 10:44:37 -05:00 |
|