Files
HuggingFace_transformer/docs/source/en/model_doc
Susnato Dhar b5db8ca66f Add flash attention for gpt_bigcode (#26479)
* added flash attention of gpt_bigcode

* changed docs

* Update src/transformers/models/gpt_bigcode/modeling_gpt_bigcode.py

* add FA-2 docs

* oops

* Update docs/source/en/perf_infer_gpu_one.md Last Nit

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix

* oops

* remove padding_mask

* change getattr->hasattr logic

* changed .md file

---------

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2023-10-31 11:21:02 +00:00
..
2023-09-22 19:53:55 +03:00
2023-06-20 18:07:47 -04:00
2023-08-17 12:08:11 +02:00
2023-07-27 18:24:56 +01:00
2023-07-13 11:46:54 -04:00
2023-09-14 18:02:37 +01:00
2023-10-13 11:12:59 -07:00
2023-09-01 20:40:40 +02:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-07-18 15:34:06 +01:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-10-19 15:36:41 +02:00
2023-06-20 18:07:47 -04:00
2023-06-26 11:23:57 +02:00
2023-10-30 21:42:19 +01:00
2023-06-20 18:07:47 -04:00
2023-10-23 14:19:59 +02:00
2023-09-05 10:50:08 -07:00
2023-07-13 11:46:54 -04:00
2023-10-17 14:06:37 -07:00
2023-09-04 11:53:41 +01:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-09-26 07:06:04 +02:00
2023-06-20 18:07:47 -04:00
2023-07-24 15:34:19 +01:00
2023-06-20 18:07:47 -04:00
2023-07-13 11:46:54 -04:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-07-13 11:46:54 -04:00
2023-06-20 18:07:47 -04:00
2023-07-13 11:46:54 -04:00
2023-06-20 18:07:47 -04:00
2023-08-29 10:03:52 +01:00
2023-09-26 07:06:38 +02:00
2023-07-11 14:04:04 +01:00
2023-06-20 18:07:47 -04:00