Yaswanth Gali
fbdaa7b099
Add Aimv2 model (#36625)
* Model skelton
* changes
* temp push
* changes
* Added support for aimv2-native
* More changes
* More changes
* Stupid mistake correction
* Added config and refactor
* Added vison model
* update
* Refactor for lit variant
* Added Text Model
* Minor fixes
* nits
* update
* Preliminary tests
* More fixes
* Updated tests 🤗
* Refactor
* Updated testcase
* Updated config
* make fixup
* more fixes
* Bug fix and updates
* deadcode
* Fixes
* nit
* up
* Happy CI ✅
* Reduce LOC
* nit
* nit
* make style
* return_dict refactor
* bug fix
* fix
* doc update
* nit
* make fixup
* Minor update
* _init_weigths modifcation
* update tests
* Minor fixes post review
* Update w.r.t GradientCheckpointingLayer
* docs update
* update
* nit
* Use more Modular 😉
* Change name from AIMv2 to Aimv2
* Nit
* make style
* Add model doc pointer
* make style
* Update model doc section
* updates
* Modify attn mask and interface
* update test
* Final change
* Utilize flash and flex attn
* keep attn mask
* camelcase model name in test file
* Fix docstring
* Fix config warning finally and create_causal_mask
* disable torchscript
* remove unused arg
* remove from tests
* balance model size for tests
* fix device
* tests
* tests
* flaky test
* fix import
---------
Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>
Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com>
2025-07-08 11:53:21 +02:00
..
2025-06-17 19:37:18 +01:00
2025-06-25 17:29:10 +00:00
2025-07-08 11:53:21 +02:00
2025-06-17 19:37:18 +01:00
2025-06-13 11:07:09 +00:00
2024-11-04 09:40:30 -08:00
2025-07-08 10:20:52 +02:00
2025-06-25 17:29:10 +00:00
2025-07-07 09:12:55 -07:00
2025-06-13 11:07:09 +00:00
2025-06-17 19:37:18 +01:00
2024-12-17 09:32:00 -08:00
2023-11-08 08:35:20 -05:00
2025-06-17 19:37:18 +01:00
2024-04-08 14:21:16 +01:00