Files
HuggingFace_transformer/docs/source/en/model_doc
Susnato Dhar 1ac2463dfe [FA2] Add flash attention for for DistilBert (#26489)
* flash attention added for DistilBert

* fixes

* removed padding_masks

* Update modeling_distilbert.py

* Update test_modeling_distilbert.py

* style fix
2023-11-03 16:07:54 +00:00
..
2023-09-22 19:53:55 +03:00
2023-06-20 18:07:47 -04:00
2023-10-19 15:36:41 +02:00
2023-10-30 21:42:19 +01:00
2023-07-24 15:34:19 +01:00
2023-07-13 11:46:54 -04:00