chore: Fix multiple typos (#28574)

This commit is contained in:
hugo-syn
2024-01-18 14:35:09 +01:00
committed by GitHub
parent 8189977885
commit 5d8eb93eee
5 changed files with 5 additions and 5 deletions

View File

@@ -27,7 +27,7 @@ To adapt the script for other models, we need to also change the `ParitionSpec`
TODO: Add more explantion.
Before training, let's prepare our model first. To be able to shard the model, the sharded dimention needs to be a multiple of devices it'll be sharded on. But GPTNeo's vocab size is 50257, so we need to resize the embeddings accordingly.
Before training, let's prepare our model first. To be able to shard the model, the sharded dimension needs to be a multiple of devices it'll be sharded on. But GPTNeo's vocab size is 50257, so we need to resize the embeddings accordingly.
```python
from transformers import FlaxGPTNeoForCausalLM, GPTNeoConfig