Document adam betas TrainingArguments (#8688)
This commit is contained in:
@@ -89,6 +89,10 @@ class TrainingArguments:
|
||||
The initial learning rate for Adam.
|
||||
weight_decay (:obj:`float`, `optional`, defaults to 0):
|
||||
The weight decay to apply (if not zero).
|
||||
adam_beta1 (:obj:`float`, `optional`, defaults to 0.9):
|
||||
The beta1 for the Adam optimizer.
|
||||
adam_beta2 (:obj:`float`, `optional`, defaults to 0.999):
|
||||
The beta2 for the Adam optimizer.
|
||||
adam_epsilon (:obj:`float`, `optional`, defaults to 1e-8):
|
||||
Epsilon for the Adam optimizer.
|
||||
max_grad_norm (:obj:`float`, `optional`, defaults to 1.0):
|
||||
|
||||
Reference in New Issue
Block a user