Files
HuggingFace_transformer/docs/source
Suraj Patil ca33278fdb FlaxGPT2 (#11556)
* flax gpt2

* combine masks

* handle shared embeds

* add causal LM sample

* style

* add tests

* style

* fix imports, docs, quality

* don't use cache

* add cache

* add cache 1st version

* make use cache work

* start adding test for generation

* finish generation loop compilation

* rewrite test

* finish

* update

* update

* apply sylvains suggestions

* update

* refactor

* fix typo

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2021-05-18 22:50:51 +01:00
..
2021-05-12 17:08:35 +02:00
2021-05-13 10:34:14 -04:00
2021-05-18 22:50:51 +01:00
2021-04-21 11:11:20 -04:00
2021-05-12 11:46:02 -04:00
2021-05-18 22:50:51 +01:00
2021-04-28 11:16:41 -04:00
2021-04-23 09:17:37 -04:00
2021-04-21 11:11:20 -04:00
2021-04-21 11:11:20 -04:00
2020-04-06 14:32:39 -04:00
2021-04-01 11:58:37 -06:00
2021-04-21 11:11:20 -04:00
2020-12-07 18:36:34 -05:00
2021-05-03 13:18:46 -04:00