Files
HuggingFace_transformer/docs/source
Susnato Dhar 1ac2463dfe [FA2] Add flash attention for for DistilBert (#26489)
* flash attention added for DistilBert

* fixes

* removed padding_masks

* Update modeling_distilbert.py

* Update test_modeling_distilbert.py

* style fix
2023-11-03 16:07:54 +00:00
..
2023-10-02 09:56:40 -07:00
2023-09-04 11:15:12 +01:00
2023-08-17 12:08:11 +02:00
2023-11-02 10:42:29 -07:00