Fix typos in contrastive-image-text example README (#21665)
This commit is contained in:
@@ -50,9 +50,9 @@ COCO_DIR = os.path.join(os.getcwd(), "data")
|
||||
ds = datasets.load_dataset("ydshieh/coco_dataset_script", "2017", data_dir=COCO_DIR)
|
||||
```
|
||||
|
||||
### Create a model from a vision encoder model and a text decoder model
|
||||
### Create a model from a vision encoder model and a text encoder model
|
||||
Next, we create a [VisionTextDualEncoderModel](https://huggingface.co/docs/transformers/model_doc/vision-text-dual-encoder#visiontextdualencoder).
|
||||
The `VisionTextDualEncoderModel` class let's you load any vision and text encoder model to create a dual encoder.
|
||||
The `VisionTextDualEncoderModel` class lets you load any vision and text encoder model to create a dual encoder.
|
||||
Here is an example of how to load the model using pre-trained vision and text models.
|
||||
|
||||
```python3
|
||||
|
||||
Reference in New Issue
Block a user