HPU support (#36424)

* test

* fix

* fix

* skip some and run some first

* test fsdp

* fix

* patches for generate

* test distributed

* copy

* don't test distributed loss for hpu

* require fp16 and run first

* changes from marc's PR fixing zero3

* better alternative

* return True when fp16 support on gaudi without creating bridge

* fix

* fix tested dtype in deepspeed inference test

* test

* fix

* test

* fix

* skip

* require fp16

* run first fsdp

* Apply suggestions from code review

* address comments

* address comments and refactor test

* reduce precison

* avoid doing gaudi1 specific stuff in the genreation loop

* document test_gradient_accumulation_loss_alignment_with_model_loss test a bit more
This commit is contained in:
Ilyas Moutawwakil
2025-03-12 09:08:12 +01:00
committed by GitHub
parent 50d3530aa0
commit 89f6956015
19 changed files with 337 additions and 139 deletions

View File

@@ -152,6 +152,7 @@ _deps = [
"pytest-asyncio",
"pytest-timeout",
"pytest-xdist",
"pytest-order",
"python>=3.9.0",
"ray[tune]>=2.7.0",
"regex!=2019.12.17",
@@ -324,6 +325,7 @@ extras["testing"] = (
"pytest-asyncio",
"pytest-rich",
"pytest-xdist",
"pytest-order",
"timeout-decorator",
"parameterized",
"psutil",