[Trainer] memory tracker metrics (#10225)
* memory tracker metrics * go back to eval for somewhat consistency * handle no-gpu case * deal with stackable eval calls * restore callback order * style * simplify the API * add test * docs * consistently use eval_ prefix * improve docs * Update src/transformers/trainer_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * rename method * style Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
This commit is contained in:
@@ -88,8 +88,8 @@ class TestDeepSpeed(TestCasePlus):
|
||||
extra_args_str="--do_eval",
|
||||
remove_args_str="--do_train",
|
||||
)
|
||||
val_metrics = load_json(os.path.join(output_dir, "val_results.json"))
|
||||
assert "val_bleu" in val_metrics
|
||||
val_metrics = load_json(os.path.join(output_dir, "eval_results.json"))
|
||||
assert "eval_bleu" in val_metrics
|
||||
|
||||
# XXX: need to do better validation beyond just that the run was successful
|
||||
def run_quick(self, distributed=True, extra_args_str=None, remove_args_str=None):
|
||||
|
||||
Reference in New Issue
Block a user