Set TF32 flag for PyTorch cuDNN backend (#25075)

2023-07-25 20:04:48 +08:00
parent 5dba88b2d2
commit 6bc61aa7af
3 changed files with 6 additions and 0 deletions
--- a/docs/source/en/perf_train_gpu_one.md
+++ b/docs/source/en/perf_train_gpu_one.md
@@ -203,6 +203,7 @@ improvement. All you need to do is to add the following to your code:
 ```
 import torch
 torch.backends.cuda.matmul.allow_tf32 = True
+torch.backends.cudnn.allow_tf32 = True
 ```

 CUDA will automatically switch to using tf32 instead of fp32 where possible, assuming that the used GPU is from the Ampere series.