Files
HuggingFace_transformer/tests
Anton Vlasjuk 5a2aedca1e [Mamba2] Fix caching, slow path, and multi-gpu (#35154)
* fixup mamba2 - caching and several other small fixes

* fixup cached forward

* correct fix this time

* fixup cache - we do not need to extend the attn mask it's handled by generate (gives total ids + mask at each step)

* remove unnecessary (un)squeeze

* fixup cache position

* simplify a few things

* [run-slow] mamba2

* multi gpu attempt two

* [run-slow] mamba2

* [run-slow] mamba2

* [run-slow] mamba2

* [run-slow] mamba2

* add newer slow path fix

* [run-slow] mamba2
2024-12-20 09:27:47 +01:00
..
2024-11-15 22:28:06 +01:00
2024-12-18 20:18:17 +01:00
2024-12-18 09:49:59 -05:00
2024-12-18 16:53:39 +01:00
2024-12-18 16:53:39 +01:00