Files
HuggingFace_transformer/docs/source
Stas Bekman 78f5fe1416 [Deepspeed] adapt multiple models, add zero_to_fp32 tests (#12477)
* zero_to_fp32 tests

* args change

* remove unnecessary work

* use transformers.trainer_utils.get_last_checkpoint

* document the new features

* cleanup

* wip

* fix fsmt

* add bert

* cleanup

* add xlm-roberta

* electra works

* cleanup

* sync

* split off the model zoo tests

* cleanup

* cleanup

* cleanup

* cleanup

* reformat

* cleanup

* casing

* deepspeed>=0.4.3

* adjust distilbert

* Update docs/source/main_classes/deepspeed.rst

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2021-07-13 12:07:32 -07:00
..
2021-06-30 14:39:52 +02:00
2021-07-08 07:20:46 -04:00
2021-06-09 11:51:13 -04:00
2021-06-17 17:57:42 +02:00
2021-04-21 11:11:20 -04:00
2020-04-06 14:32:39 -04:00
2021-07-09 18:48:28 -07:00
2021-06-22 15:34:19 -07:00
2021-04-01 11:58:37 -06:00
2021-07-12 18:02:51 +02:00
2021-07-12 12:03:13 -04:00