CI: update to ROCm 6.0.2 and test MI300 (#30266)
* update to ROCm 6.0.2 and test MI300 * add callers for mi300 * update dockerfile * fix trainer tests * remove apex * style * Update tests/trainer/test_trainer_seq2seq.py * Update tests/trainer/test_trainer_seq2seq.py * Update tests/trainer/test_trainer_seq2seq.py * Update tests/trainer/test_trainer_seq2seq.py * update to torch 2.3 * add workflow dispatch target * we may need branches: mi300-ci after all * nit * fix docker build * nit * add check runner * remove docker-gpu * fix issues * fix --------- Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
This commit is contained in:
@@ -301,6 +301,7 @@ class TestTrainerExt(TestCasePlus):
|
||||
--label_smoothing_factor 0.1
|
||||
--target_lang ro_RO
|
||||
--source_lang en_XX
|
||||
--report_to none
|
||||
""".split()
|
||||
|
||||
args_eval = f"""
|
||||
|
||||
Reference in New Issue
Block a user