Document adam betas TrainingArguments (#8688)
This commit is contained in:
@@ -89,6 +89,10 @@ class TrainingArguments:
|
|||||||
The initial learning rate for Adam.
|
The initial learning rate for Adam.
|
||||||
weight_decay (:obj:`float`, `optional`, defaults to 0):
|
weight_decay (:obj:`float`, `optional`, defaults to 0):
|
||||||
The weight decay to apply (if not zero).
|
The weight decay to apply (if not zero).
|
||||||
|
adam_beta1 (:obj:`float`, `optional`, defaults to 0.9):
|
||||||
|
The beta1 for the Adam optimizer.
|
||||||
|
adam_beta2 (:obj:`float`, `optional`, defaults to 0.999):
|
||||||
|
The beta2 for the Adam optimizer.
|
||||||
adam_epsilon (:obj:`float`, `optional`, defaults to 1e-8):
|
adam_epsilon (:obj:`float`, `optional`, defaults to 1e-8):
|
||||||
Epsilon for the Adam optimizer.
|
Epsilon for the Adam optimizer.
|
||||||
max_grad_norm (:obj:`float`, `optional`, defaults to 1.0):
|
max_grad_norm (:obj:`float`, `optional`, defaults to 1.0):
|
||||||
|
|||||||
Reference in New Issue
Block a user