blip support for training (#21021)

* `blip` support for training

* remove labels creation

* remove unneeded `decoder_input_ids` creation

* final changes

- add colab link to documentation
- reduction = mean for loss

* fix nits

* update link

* clearer error message
This commit is contained in:
Younes Belkada
2023-01-18 11:24:37 +01:00
committed by GitHub
parent c8849583ad
commit 023f51fe16
4 changed files with 364 additions and 15 deletions

View File

@@ -31,6 +31,10 @@ However, most existing pre-trained models only excel in either understanding-bas
This model was contributed by [ybelkada](https://huggingface.co/ybelkada).
The original code can be found [here](https://github.com/salesforce/BLIP).
## Resources
- [Jupyter notebook](https://github.com/huggingface/notebooks/blob/main/examples/image_captioning_blip.ipynb) on how to fine-tune BLIP for image captioning on a custom dataset
## BlipConfig