New features for CodeParrot training script (#16851)
* add tflops logging and fix grad accumulation * add accelerate tracking and checkpointing * scale loss of last batch correctly * fix typo * compress loss computation Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com> * add resume from checkpoint argument * add load_state accelerate from checkpoint, register lr scheduler and add tflops function * reformat code * reformat code * add condition on path for resume checkpoint * combine if conditions Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com> * add source for tflops formula Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
This commit is contained in:
@@ -82,7 +82,7 @@ Now that the dataset, tokenizer, and model are ready we can start training the m
|
||||
First you need to configure `accelerate` and login to Weights & Biases:
|
||||
|
||||
```bash
|
||||
acclerate config
|
||||
accelerate config
|
||||
wandb login
|
||||
```
|
||||
|
||||
|
||||
Reference in New Issue
Block a user