Files
HuggingFace_transformer/tests/models
Younes Belkada ae9a344cce [Mistral] Add Flash Attention-2 support for mistral (#26464)
* add FA-2 support for mistral

* fixup

* add sliding windows

* fixing few nits

* v1 slicing cache - logits do not match

* add comment

* fix bugs

* more mem efficient

* add warning once

* add warning once

* oops

* fixup

* more comments

* copy

* add safety checker

* fixup

* Update src/transformers/models/mistral/modeling_mistral.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* copied from

* up

* raise when padding side is right

* fixup

* add doc + few minor changes

* fixup

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2023-10-03 13:44:46 +02:00
..
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-19 11:15:38 +02:00
2023-06-06 14:31:14 -04:00
2023-09-18 20:28:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-09-18 20:28:36 +02:00
2023-09-18 20:28:36 +02:00
2023-09-18 20:28:36 +02:00
2023-09-18 20:28:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-10-02 18:29:27 +02:00
2023-09-18 20:28:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-05 10:12:25 +02:00
2023-08-02 20:22:36 +02:00
2023-09-18 20:28:36 +02:00
2023-09-05 11:19:56 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-14 17:08:47 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-10-02 09:13:19 +02:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-10-02 13:55:46 +02:00
2023-09-21 14:46:05 +02:00
2023-06-16 15:40:49 +01:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-09-18 20:28:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-18 20:28:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-18 20:28:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-10-02 13:55:46 +02:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-09-18 20:28:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-18 20:28:36 +02:00
2023-06-16 15:40:49 +01:00
2023-09-18 20:28:36 +02:00
2023-09-28 15:08:35 +02:00
2023-08-02 20:22:36 +02:00
2023-09-18 20:28:36 +02:00
2023-08-19 11:15:38 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-05-24 13:52:52 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-04-04 14:53:06 +02:00
2023-09-26 07:06:04 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-18 20:28:36 +02:00
2023-09-18 20:28:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-18 20:28:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-06-16 15:40:49 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-18 20:28:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-05 10:12:25 +02:00
2023-09-05 10:12:25 +02:00
2023-09-18 20:28:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-05 10:12:25 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:02:57 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-19 10:56:10 -03:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-18 20:28:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2022-05-03 14:42:02 +02:00