[docs] Trainer docs (#28145)
* fsdp, debugging, gpu selection * fix hfoption * fix
This commit is contained in:
@@ -256,6 +256,12 @@ NEFTune is disabled after training to restore the original embedding layer to av
|
||||
|
||||
The [`Trainer`] class is powered by [Accelerate](https://hf.co/docs/accelerate), a library for easily training PyTorch models in distributed environments with support for integrations such as [FullyShardedDataParallel (FSDP)](https://pytorch.org/blog/introducing-pytorch-fully-sharded-data-parallel-api/) and [DeepSpeed](https://www.deepspeed.ai/).
|
||||
|
||||
<Tip>
|
||||
|
||||
Learn more about FSDP sharding strategies, CPU offloading, and more with the [`Trainer`] in the [Fully Sharded Data Parallel](fsdp) guide.
|
||||
|
||||
</Tip>
|
||||
|
||||
To use Accelerate with [`Trainer`], run the [`accelerate.config`](https://huggingface.co/docs/accelerate/package_reference/cli#accelerate-config) command to set up training for your training environment. This command creates a `config_file.yaml` that'll be used when you launch your training script. For example, some example configurations you can setup are:
|
||||
|
||||
<hfoptions id="config">
|
||||
|
||||
Reference in New Issue
Block a user