Files
HuggingFace_transformer/tests/models/gemma2
Raushan Turganbay 7f552e28e0 Gemma2 and flash-attention (#32188)
* enable flash-attn & static cache

* this works, not the prev

* fix for sliding window layers

* not needed anymore
2024-07-31 10:33:38 +05:00
..
2024-06-27 17:36:19 +02:00