Files
HuggingFace_transformer/tests/models
Anton Vlasjuk 5a2aedca1e [Mamba2] Fix caching, slow path, and multi-gpu (#35154)
* fixup mamba2 - caching and several other small fixes

* fixup cached forward

* correct fix this time

* fixup cache - we do not need to extend the attn mask it's handled by generate (gives total ids + mask at each step)

* remove unnecessary (un)squeeze

* fixup cache position

* simplify a few things

* [run-slow] mamba2

* multi gpu attempt two

* [run-slow] mamba2

* [run-slow] mamba2

* [run-slow] mamba2

* [run-slow] mamba2

* add newer slow path fix

* [run-slow] mamba2
2024-12-20 09:27:47 +01:00
..
2024-12-06 12:17:34 +01:00
2024-12-18 20:18:17 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-12-17 14:44:47 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-12-13 09:35:50 +01:00
2024-12-13 09:35:50 +01:00
2024-12-17 11:26:43 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-12-17 14:44:47 +01:00
2024-06-26 21:59:08 +01:00
2024-09-21 01:58:00 +01:00
2024-12-18 16:53:39 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-12-18 16:53:39 +01:00
2024-06-26 21:59:08 +01:00
2024-05-22 06:40:15 +02:00
2024-12-18 16:53:39 +01:00
2024-12-18 16:53:39 +01:00
2024-12-19 17:05:25 +01:00
2024-12-19 17:05:25 +01:00
2024-06-26 21:59:08 +01:00
2024-12-18 16:53:39 +01:00
2024-06-26 21:59:08 +01:00
2024-12-19 17:05:25 +01:00
2024-10-22 08:23:53 +02:00
2024-06-26 21:59:08 +01:00
2024-12-18 16:53:39 +01:00
2024-12-18 16:53:39 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-10-07 10:56:24 +02:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-12-18 16:53:39 +01:00
2024-12-18 16:53:39 +01:00
2024-10-04 22:45:37 +02:00
2024-06-26 21:59:08 +01:00
2024-12-18 16:53:39 +01:00
2024-12-18 16:53:39 +01:00
2024-12-18 16:53:39 +01:00
2024-12-18 16:53:39 +01:00
2024-06-26 21:59:08 +01:00
2024-12-11 12:40:30 +00:00
2024-11-29 11:58:11 +00:00
2024-05-22 06:40:15 +02:00
2024-06-26 21:59:08 +01:00
2024-10-15 11:27:54 +02:00
2024-12-18 14:13:21 +01:00
2024-06-26 21:59:08 +01:00
2022-05-03 14:42:02 +02:00