[docs] Improve visualization for vertical parallelism (#28583)
The documentation says "We refer to this Model parallelism as “Vertical” because of how models are typically visualized.", but then visualizes the model horizontally. This change visualizes the model indeed vertically.
This commit is contained in:
@@ -285,10 +285,19 @@ following diagram shows an 8-layer model split vertically into two slices, placi
|
|||||||
GPU0 and 4-7 to GPU1:
|
GPU0 and 4-7 to GPU1:
|
||||||
|
|
||||||
```
|
```
|
||||||
=================== ===================
|
================
|
||||||
| 0 | 1 | 2 | 3 | | 4 | 5 | 6 | 7 |
|
| Layer | |
|
||||||
=================== ===================
|
| 0 | |
|
||||||
GPU0 GPU1
|
| 1 | GPU0 |
|
||||||
|
| 2 | |
|
||||||
|
| 3 | |
|
||||||
|
================
|
||||||
|
| Layer | |
|
||||||
|
| 4 | |
|
||||||
|
| 5 | GPU1 |
|
||||||
|
| 6 | |
|
||||||
|
| 7 | |
|
||||||
|
================
|
||||||
```
|
```
|
||||||
|
|
||||||
In this example, when data moves from layer 0 to 3, it's no different from regular forward pass. However, passing data
|
In this example, when data moves from layer 0 to 3, it's no different from regular forward pass. However, passing data
|
||||||
|
|||||||
Reference in New Issue
Block a user