From 180c6de6a69645225ae0017f0b5030d0c411dacf Mon Sep 17 00:00:00 2001 From: Jake Tae Date: Tue, 31 Aug 2021 06:49:05 -0400 Subject: [PATCH] docs: fix minor typo (#13289) `at` should be `a1` --- docs/source/parallelism.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/source/parallelism.md b/docs/source/parallelism.md index 28b0822e2a..0d54a97bbf 100644 --- a/docs/source/parallelism.md +++ b/docs/source/parallelism.md @@ -58,7 +58,7 @@ a0 | b0 | c0 a1 | b1 | c1 a2 | b2 | c2 ``` -Layer La has weights a0, at and a2. +Layer La has weights a0, a1 and a2. If we have 3 GPUs, the Sharded DDP (= Zero-DP) splits the model onto 3 GPUs like so: