Files
HuggingFace_transformer/docs/source/en
Younes Belkada ae9a344cce [Mistral] Add Flash Attention-2 support for mistral (#26464)
* add FA-2 support for mistral

* fixup

* add sliding windows

* fixing few nits

* v1 slicing cache - logits do not match

* add comment

* fix bugs

* more mem efficient

* add warning once

* add warning once

* oops

* fixup

* more comments

* copy

* add safety checker

* fixup

* Update src/transformers/models/mistral/modeling_mistral.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* copied from

* up

* raise when padding side is right

* fixup

* add doc + few minor changes

* fixup

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2023-10-03 13:44:46 +02:00
..
2022-04-04 10:25:46 -04:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:16:34 +01:00
2022-04-04 10:25:46 -04:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:15:12 +01:00
2023-09-04 11:15:12 +01:00
2022-04-04 10:25:46 -04:00
2023-08-04 14:56:29 +02:00
2023-07-25 22:10:06 +02:00
2023-08-23 08:34:30 +02:00