Yaswanth Gali
fbdaa7b099
Add Aimv2 model (#36625)
* Model skelton
* changes
* temp push
* changes
* Added support for aimv2-native
* More changes
* More changes
* Stupid mistake correction
* Added config and refactor
* Added vison model
* update
* Refactor for lit variant
* Added Text Model
* Minor fixes
* nits
* update
* Preliminary tests
* More fixes
* Updated tests 🤗
* Refactor
* Updated testcase
* Updated config
* make fixup
* more fixes
* Bug fix and updates
* deadcode
* Fixes
* nit
* up
* Happy CI ✅
* Reduce LOC
* nit
* nit
* make style
* return_dict refactor
* bug fix
* fix
* doc update
* nit
* make fixup
* Minor update
* _init_weigths modifcation
* update tests
* Minor fixes post review
* Update w.r.t GradientCheckpointingLayer
* docs update
* update
* nit
* Use more Modular 😉
* Change name from AIMv2 to Aimv2
* Nit
* make style
* Add model doc pointer
* make style
* Update model doc section
* updates
* Modify attn mask and interface
* update test
* Final change
* Utilize flash and flex attn
* keep attn mask
* camelcase model name in test file
* Fix docstring
* Fix config warning finally and create_causal_mask
* disable torchscript
* remove unused arg
* remove from tests
* balance model size for tests
* fix device
* tests
* tests
* flaky test
* fix import
---------
Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>
Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com>
2025-07-08 11:53:21 +02:00
..
2025-05-12 11:55:51 +02:00
2021-02-15 07:55:10 -05:00
2024-05-22 06:40:15 +02:00
2025-05-24 19:15:02 +02:00
2025-03-21 13:08:47 +01:00
2025-07-01 09:08:21 +00:00
2025-05-06 06:47:43 +02:00
2025-06-18 14:38:08 +01:00
2025-06-17 19:37:18 +01:00
2025-07-05 11:34:28 +02:00
2024-05-22 06:40:15 +02:00
2025-06-17 19:37:18 +01:00
2025-06-17 19:37:18 +01:00
2023-03-13 19:11:19 +01:00
2025-06-19 15:22:59 +01:00
2025-06-13 13:44:07 +01:00
2025-07-08 11:53:21 +02:00
2025-06-13 17:37:46 +02:00
2021-02-15 07:55:10 -05:00
2025-03-06 13:12:30 +00:00
2024-08-27 11:58:27 +01:00
2025-06-17 19:37:18 +01:00
2025-06-26 11:56:31 +00:00
2025-03-25 16:00:11 +01:00
2025-05-09 11:45:03 +02:00
2024-04-15 15:08:09 +02:00
2025-04-02 14:39:57 +02:00
2024-01-31 15:58:17 +01:00
2025-03-25 16:00:11 +01:00
2023-02-03 12:57:02 -05:00
2025-07-01 20:19:06 +02:00
2025-06-04 11:38:25 +02:00
2025-06-20 16:10:35 +00:00
2024-08-27 11:58:27 +01:00
2024-04-12 10:01:28 +02:00
2024-05-22 06:40:15 +02:00
2025-07-08 10:41:44 +02:00
2025-07-03 17:04:16 +01:00
2025-06-17 19:37:18 +01:00
2025-06-17 19:37:18 +01:00
2025-03-25 16:00:11 +01:00
2025-07-07 15:11:48 +02:00
2025-06-17 19:37:18 +01:00
2025-06-23 10:56:51 +02:00
2025-05-24 19:15:02 +02:00
2025-03-25 16:00:11 +01:00
2024-09-03 16:53:21 +02:00
2025-03-11 13:47:38 +00:00
2024-06-10 15:16:58 +02:00
2025-06-13 12:02:27 -07:00
2024-05-22 06:40:15 +02:00
2025-03-13 15:12:44 +00:00
2025-07-03 11:17:27 +02:00
2025-06-17 19:37:18 +01:00
2025-07-08 10:20:52 +02:00
2024-07-22 14:14:47 +01:00