From 751f17aa4891de2092620db1d4e2ba823ea4c2a8 Mon Sep 17 00:00:00 2001 From: regisss <15324346+regisss@users.noreply.github.com> Date: Thu, 16 Feb 2023 15:10:25 +0100 Subject: [PATCH] Fix typos in contrastive-image-text example README (#21665) --- examples/pytorch/contrastive-image-text/README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/examples/pytorch/contrastive-image-text/README.md b/examples/pytorch/contrastive-image-text/README.md index ee3d9da73f..f22f2c82dc 100644 --- a/examples/pytorch/contrastive-image-text/README.md +++ b/examples/pytorch/contrastive-image-text/README.md @@ -50,9 +50,9 @@ COCO_DIR = os.path.join(os.getcwd(), "data") ds = datasets.load_dataset("ydshieh/coco_dataset_script", "2017", data_dir=COCO_DIR) ``` -### Create a model from a vision encoder model and a text decoder model +### Create a model from a vision encoder model and a text encoder model Next, we create a [VisionTextDualEncoderModel](https://huggingface.co/docs/transformers/model_doc/vision-text-dual-encoder#visiontextdualencoder). -The `VisionTextDualEncoderModel` class let's you load any vision and text encoder model to create a dual encoder. +The `VisionTextDualEncoderModel` class lets you load any vision and text encoder model to create a dual encoder. Here is an example of how to load the model using pre-trained vision and text models. ```python3