[Docs] Model_doc structure/clarity improvements (#26876)

* first batch of structure improvements for model_docs * second batch of structure improvements for model_docs * more structure improvements for model_docs * more structure improvements for model_docs * structure improvements for cv model_docs * more structural refactoring * addressed feedback about image processors
2023-11-03 10:57:03 -04:00
parent ad8ff96224
commit 5964f820db
223 changed files with 1796 additions and 1116 deletions
--- a/docs/source/en/model_doc/blip.md
+++ b/docs/source/en/model_doc/blip.md
@@ -20,7 +20,7 @@ rendered properly in your Markdown viewer.

 The BLIP model was proposed in [BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation](https://arxiv.org/abs/2201.12086) by Junnan Li, Dongxu Li, Caiming Xiong, Steven Hoi.

-BLIP is a model that is able to perform various multi-modal tasks including
+BLIP is a model that is able to perform various multi-modal tasks including:
 - Visual Question Answering 
 - Image-Text retrieval (Image-text matching)
 - Image Captioning
@@ -39,7 +39,6 @@ The original code can be found [here](https://github.com/salesforce/BLIP).

 - [Jupyter notebook](https://github.com/huggingface/notebooks/blob/main/examples/image_captioning_blip.ipynb) on how to fine-tune BLIP for image captioning on a custom dataset

-
 ## BlipConfig

 [[autodoc]] BlipConfig
@@ -57,12 +56,14 @@ The original code can be found [here](https://github.com/salesforce/BLIP).

 [[autodoc]] BlipProcessor

-
 ## BlipImageProcessor

 [[autodoc]] BlipImageProcessor
    - preprocess

+<frameworkcontent>
+<pt>
+
 ## BlipModel

 [[autodoc]] BlipModel
@@ -75,30 +76,29 @@ The original code can be found [here](https://github.com/salesforce/BLIP).
 [[autodoc]] BlipTextModel
    - forward

-
 ## BlipVisionModel

 [[autodoc]] BlipVisionModel
    - forward

-
 ## BlipForConditionalGeneration

 [[autodoc]] BlipForConditionalGeneration
    - forward

-
 ## BlipForImageTextRetrieval

 [[autodoc]] BlipForImageTextRetrieval
    - forward

-
 ## BlipForQuestionAnswering

 [[autodoc]] BlipForQuestionAnswering
    - forward

+</pt>
+<tf>
+
 ## TFBlipModel

 [[autodoc]] TFBlipModel
@@ -111,26 +111,24 @@ The original code can be found [here](https://github.com/salesforce/BLIP).
 [[autodoc]] TFBlipTextModel
    - call

-
 ## TFBlipVisionModel

 [[autodoc]] TFBlipVisionModel
    - call

-
 ## TFBlipForConditionalGeneration

 [[autodoc]] TFBlipForConditionalGeneration
    - call

-
 ## TFBlipForImageTextRetrieval

 [[autodoc]] TFBlipForImageTextRetrieval
    - call

-
 ## TFBlipForQuestionAnswering

 [[autodoc]] TFBlipForQuestionAnswering
-    - call
+    - call
+</tf>
+</frameworkcontent>