[CLIP] allow loading projection layer in vision and text model (#18962)

* allow loading projection in text and vision model * begin tests * finish test for CLIPTextModelTest * style * add slow tests * add new classes for projection heads * remove with_projection * add in init * add in doc * fix tests * fix some more tests * fix copies * fix docs * remove leftover from fix-copies * add the head models in IGNORE_NON_AUTO_CONFIGURED * fix docstr * fix tests * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * add docstr for models Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-11-15 17:50:07 +01:00
parent 9643ecf8ca
commit 7f74433814
10 changed files with 347 additions and 7 deletions
--- a/utils/check_repo.py
+++ b/utils/check_repo.py
@@ -177,7 +177,9 @@ IGNORE_NON_AUTO_CONFIGURED = PRIVATE_MODELS.copy() + [
    "PLBartDecoderWrapper",
    "BeitForMaskedImageModeling",
    "CLIPTextModel",
+    "CLIPTextModelWithProjection",
    "CLIPVisionModel",
+    "CLIPVisionModelWithProjection",
    "GroupViTTextModel",
    "GroupViTVisionModel",
    "TFCLIPTextModel",