Fixing issue "Training beyond specified 't_total' steps with schedule 'warmup_linear'" reported in #556