Files
HuggingFace_transformer/docs/source/model_doc
Suraj Patil ca33278fdb FlaxGPT2 (#11556)
* flax gpt2

* combine masks

* handle shared embeds

* add causal LM sample

* style

* add tests

* style

* fix imports, docs, quality

* don't use cache

* add cache

* add cache 1st version

* make use cache work

* start adding test for generation

* finish generation loop compilation

* rewrite test

* finish

* update

* update

* apply sylvains suggestions

* update

* refactor

* fix typo

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2021-05-18 22:50:51 +01:00
..
2021-05-18 22:50:51 +01:00
2021-04-21 11:11:20 -04:00
2021-04-21 11:11:20 -04:00
2021-04-21 09:47:27 -04:00
2021-04-21 09:47:27 -04:00
2021-05-12 13:48:15 +05:30
2021-04-21 09:47:27 -04:00
2021-04-21 09:47:27 -04:00
2021-04-21 09:47:27 -04:00
2021-01-27 21:25:11 +03:00
2021-04-21 11:11:20 -04:00
2021-04-21 09:47:27 -04:00
2021-05-04 20:56:09 +02:00
2020-12-07 18:36:34 -05:00
2021-04-21 09:47:27 -04:00
2021-05-18 22:50:51 +01:00
2021-04-21 09:47:27 -04:00
2021-04-21 09:47:27 -04:00
2021-05-03 09:07:29 -04:00
2020-12-10 09:29:38 -05:00
2021-04-21 09:47:27 -04:00
2021-04-21 11:11:20 -04:00
2020-12-07 18:36:34 -05:00
2021-04-21 09:47:27 -04:00
2021-04-21 11:11:20 -04:00
2021-04-21 09:47:27 -04:00
2021-04-21 09:47:27 -04:00
2021-04-21 09:47:27 -04:00
2020-12-07 18:36:34 -05:00
2021-04-21 11:11:20 -04:00