Yaswanth Gali
fbdaa7b099
Add Aimv2 model (#36625)
* Model skelton
* changes
* temp push
* changes
* Added support for aimv2-native
* More changes
* More changes
* Stupid mistake correction
* Added config and refactor
* Added vison model
* update
* Refactor for lit variant
* Added Text Model
* Minor fixes
* nits
* update
* Preliminary tests
* More fixes
* Updated tests 🤗
* Refactor
* Updated testcase
* Updated config
* make fixup
* more fixes
* Bug fix and updates
* deadcode
* Fixes
* nit
* up
* Happy CI ✅
* Reduce LOC
* nit
* nit
* make style
* return_dict refactor
* bug fix
* fix
* doc update
* nit
* make fixup
* Minor update
* _init_weigths modifcation
* update tests
* Minor fixes post review
* Update w.r.t GradientCheckpointingLayer
* docs update
* update
* nit
* Use more Modular 😉
* Change name from AIMv2 to Aimv2
* Nit
* make style
* Add model doc pointer
* make style
* Update model doc section
* updates
* Modify attn mask and interface
* update test
* Final change
* Utilize flash and flex attn
* keep attn mask
* camelcase model name in test file
* Fix docstring
* Fix config warning finally and create_causal_mask
* disable torchscript
* remove unused arg
* remove from tests
* balance model size for tests
* fix device
* tests
* tests
* flaky test
* fix import
---------
Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>
Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com>
2025-07-08 11:53:21 +02:00
..
2025-06-12 16:39:33 +02:00
2025-06-13 11:07:09 +00:00
2025-07-08 11:53:21 +02:00
2025-07-07 13:12:02 +00:00
2025-05-23 16:39:47 +00:00
2025-06-17 19:37:18 +01:00
2024-11-28 16:04:05 +01:00
2024-05-28 18:29:22 +02:00
2025-07-08 11:53:21 +02:00
2025-06-13 15:32:40 +00:00
2025-05-30 16:05:07 +00:00
2025-06-26 12:25:14 -07:00
2025-03-03 10:33:46 -08:00
2025-07-03 17:04:16 +01:00
2025-06-17 19:37:18 +01:00
2025-06-26 14:21:54 -07:00
2025-03-03 10:33:46 -08:00
2025-06-03 09:53:23 -07:00
2025-03-07 13:09:02 +00:00
2025-06-30 07:56:55 -07:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2024-09-09 10:47:24 +02:00
2025-07-03 17:04:16 +01:00
2025-06-17 19:37:18 +01:00
2025-03-03 10:33:46 -08:00
2025-03-04 13:47:41 +00:00
2025-03-03 10:33:46 -08:00
2025-03-03 10:33:46 -08:00
2025-06-30 07:56:55 -07:00
2025-03-03 10:33:46 -08:00
2025-03-04 13:47:41 +00:00
2025-06-20 17:36:57 +01:00
2025-03-11 15:29:14 +01:00
2025-06-13 11:07:09 +00:00
2025-05-19 10:37:54 -07:00
2025-06-05 14:07:23 -07:00
2025-05-12 11:55:51 +02:00
2025-06-13 12:02:27 -07:00
2025-04-07 15:19:47 +02:00
2025-07-07 13:12:02 +00:00
2025-04-03 14:15:53 +01:00
2025-06-13 11:07:09 +00:00
2025-06-17 19:37:18 +01:00
2025-06-13 11:07:09 +00:00
2025-03-03 10:33:46 -08:00
2025-06-25 14:55:22 +00:00
2025-06-26 14:21:54 -07:00
2025-03-03 10:33:46 -08:00
2024-09-09 10:47:24 +02:00
2025-03-03 10:33:46 -08:00
2025-03-04 13:47:41 +00:00
2025-06-06 20:04:44 +02:00
2025-06-30 08:54:05 -07:00
2025-05-06 14:32:55 +01:00
2025-03-03 10:33:46 -08:00
2025-06-06 20:04:44 +02:00
2025-06-06 20:04:44 +02:00
2025-04-29 13:28:06 -07:00
2025-06-26 14:40:45 -07:00
2025-06-23 12:33:10 -07:00
2025-03-03 10:33:46 -08:00
2024-11-26 09:23:34 -08:00
2023-11-06 19:45:03 +00:00
2025-03-03 10:33:46 -08:00
2025-03-04 13:47:41 +00:00
2025-04-15 08:35:05 -07:00
2024-09-09 10:47:24 +02:00
2025-05-19 13:16:35 +00:00
2025-06-24 11:48:15 -07:00
2025-03-11 13:47:38 +00:00
2025-03-03 10:33:46 -08:00
2025-07-07 15:58:36 +02:00
2025-06-25 17:29:10 +00:00
2025-03-03 10:33:46 -08:00
2025-06-13 11:07:09 +00:00
2025-07-03 17:04:16 +01:00
2025-05-08 16:47:45 +01:00
2025-06-19 15:54:08 +00:00
2025-03-03 10:33:46 -08:00
2024-02-16 08:16:58 +01:00
2025-05-12 11:55:51 +02:00