Yaswanth Gali
fbdaa7b099
Add Aimv2 model (#36625)
* Model skelton
* changes
* temp push
* changes
* Added support for aimv2-native
* More changes
* More changes
* Stupid mistake correction
* Added config and refactor
* Added vison model
* update
* Refactor for lit variant
* Added Text Model
* Minor fixes
* nits
* update
* Preliminary tests
* More fixes
* Updated tests 🤗
* Refactor
* Updated testcase
* Updated config
* make fixup
* more fixes
* Bug fix and updates
* deadcode
* Fixes
* nit
* up
* Happy CI ✅
* Reduce LOC
* nit
* nit
* make style
* return_dict refactor
* bug fix
* fix
* doc update
* nit
* make fixup
* Minor update
* _init_weigths modifcation
* update tests
* Minor fixes post review
* Update w.r.t GradientCheckpointingLayer
* docs update
* update
* nit
* Use more Modular 😉
* Change name from AIMv2 to Aimv2
* Nit
* make style
* Add model doc pointer
* make style
* Update model doc section
* updates
* Modify attn mask and interface
* update test
* Final change
* Utilize flash and flex attn
* keep attn mask
* camelcase model name in test file
* Fix docstring
* Fix config warning finally and create_causal_mask
* disable torchscript
* remove unused arg
* remove from tests
* balance model size for tests
* fix device
* tests
* tests
* flaky test
* fix import
---------
Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>
Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com>
2025-07-08 11:53:21 +02:00
..
2025-04-08 14:12:08 +02:00
2025-07-02 22:06:47 +02:00
2025-06-25 14:31:20 +00:00
2025-04-28 14:20:45 +01:00
2025-06-26 16:25:00 +01:00
2025-06-23 10:56:51 +02:00
2025-07-08 08:59:37 +00:00
2025-07-08 11:53:21 +02:00
2025-06-25 17:29:10 +00:00
2025-06-27 18:33:11 +02:00
2025-06-27 19:25:32 +01:00
2025-07-08 10:18:26 +02:00
2025-07-07 14:52:57 +02:00
2025-06-11 17:28:06 +01:00
2025-06-30 11:49:03 +02:00
2025-06-26 16:25:00 +01:00
2025-07-03 11:17:27 +02:00
2025-07-08 08:59:37 +00:00
2025-06-13 16:22:12 +01:00
2025-04-08 14:12:08 +02:00
2025-04-09 11:48:49 +02:00
2025-04-08 14:12:08 +02:00
2025-06-23 14:17:25 +00:00
2025-06-26 16:25:00 +01:00
2025-07-07 11:42:33 +02:00
2025-06-17 19:37:18 +01:00
2025-07-02 19:57:15 +08:00
2025-06-25 17:29:10 +00:00
2025-06-26 16:25:00 +01:00
2025-03-17 16:09:46 +01:00
2025-07-08 08:22:04 +02:00