non-native optimizers are mostly ok with zero-offload (#12690)

This commit is contained in:
Stas Bekman
2021-07-13 20:18:51 -07:00
committed by GitHub
parent 4cdb7ee51d
commit 5dd0c956a8
3 changed files with 12 additions and 11 deletions

View File

@@ -1061,7 +1061,8 @@ optimizers, with the exception of using the combination of HuggingFace scheduler
| DS Optimizer | No | Yes |
+--------------+--------------+--------------+
If ``offload_optimizer`` is enabled you must use both DeepSpeed scheduler and DeepSpeed optimizer.
It is possible to use a non-DeepSpeed optimizer when ``offload_optimizer`` is enabled, as long as it has both CPU and
GPU implementation (except LAMB).