Files
HuggingFace_transformer/tests/utils
Zhen e686fed635 [Feature] Support using FlashAttention2 on Ascend NPU (#36696)
* [Feature] Support using flash-attention on Ascend NPU

* Fix qwen3 and qwen3_moe moduler conversion mismatch
2025-03-31 16:12:58 +02:00
..
2025-03-24 14:08:29 +00:00
2025-03-13 17:26:09 +00:00