Files
HuggingFace_transformer/tests
Zhen e686fed635 [Feature] Support using FlashAttention2 on Ascend NPU (#36696)
* [Feature] Support using flash-attention on Ascend NPU

* Fix qwen3 and qwen3_moe moduler conversion mismatch
2025-03-31 16:12:58 +02:00
..
2025-03-12 09:08:12 +01:00
2025-03-31 15:38:40 +02:00
2025-03-25 16:00:11 +01:00
2025-03-25 17:25:39 +01:00
2025-03-31 10:55:47 +02:00