Fix more inefficient PT operations (#37060)
* Fix inefficient operations * Remove cpu() call * Reorder detach() * Reorder detach() * tolist without detach * item without detach * Update src/transformers/models/rag/modeling_rag.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update tests/models/encodec/test_modeling_encodec.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Use detach().cpu().numpy * Revert some numpy operations * More fixes --------- Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
This commit is contained in:
@@ -4231,7 +4231,7 @@ class ModelTesterMixin:
|
||||
loss = model(**inputs).loss
|
||||
loss.backward()
|
||||
|
||||
params = {name: param.grad.clone().detach().cpu() for name, param in model.named_parameters()}
|
||||
params = {name: param.grad.detach().to(device="cpu", copy=True) for name, param in model.named_parameters()}
|
||||
model.zero_grad()
|
||||
del loss
|
||||
|
||||
|
||||
Reference in New Issue
Block a user