NielsRogge
9c6f7485a6
Add GIT (GenerativeImage2Text) (#20295)
* First draft
* Make model instantiation work
* Fix copied from statement
* More fixes
* Add correct output head
* Improve configuration
* Add conversion script
* Improve conversion script
* Remove token_type_ids
* Fix conversion of projection layers
* Convert all weights
* Use cats image
* Make logits match
* Generate caption on cats image
* Add GITProcessor
* Update conversion script
* Add support for more checkpoints
* Fix conversion script
* Add initial tests
* Remove cross-attention
* More improvements
* Remove is_decoder
* Improve model tests
* Improve tests
* Improve model outputs
* Fix model outputs equivalence
* Fix more tests
* Remove unused code
* Use generate to generate text, no use of cache for now
* Use generate more appropriately
* Fix config tests
* Fix style
* Add support for use_cache
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
* Fix style
* Fix GIT vision encoder
* Update README
* Fix integration test
* Set bos and eos token ids
* Improve docs
* Improve code
* Add support for provided attention_mask
* Add copied from statement
* Fix gradient checkpointing test
* Set model_input_names
* Investigate model_input_names
* Remove script
* Fix model inputs
* Fix docstring
* Rename GIT to Git
* Support more models
* Add support for textvqa model
* Add video support
* Extend conversion script for video
* Add support for large variant
* Add support for more models
* Fix config archive map
* Update integration test
* Fix README
* Fix CLIP mean and std
* Update processor
* Fix use_cache for video, thanks @gante
* Remove print statements
* Remove assertion
* Add processor tests
* Fix model_input_names
* Use Auto API for processor
* Fix processor tests
* Fix integration test
* Fix pipeline test
* Make tests faster
* Update conversion script
* Update conversion script
* Convert more checkpoints
* Update conversion script
* Fix typo
* Update docstrings
* Improve code snippets
* Fix doc tests
* Add more code examplesé
* Fix doc tests
* Add integration tests
* Fix unused variable
* revert
* Add GIT to Japanese README
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-01-03 14:17:18 +01:00
..
2023-01-03 06:29:02 -05:00
2022-12-15 18:27:20 +00:00
2023-01-03 14:17:18 +01:00
2022-12-12 09:23:23 -05:00
2022-04-04 10:25:46 -04:00
2023-01-03 14:17:18 +01:00
2022-08-08 10:53:52 -05:00
2022-12-27 02:49:15 -05:00
2022-12-12 09:23:23 -05:00
2022-11-07 09:19:04 -05:00
2022-11-30 14:50:55 +00:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-11-07 09:19:04 -05:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-09-30 20:30:38 +01:00
2022-11-30 14:50:55 +00:00
2022-11-07 13:57:39 -05:00
2022-12-12 10:38:56 -05:00
2022-04-04 10:25:46 -04:00
2022-11-30 14:50:55 +00:00
2022-10-11 22:54:19 -04:00
2023-01-03 14:17:18 +01:00
2022-08-05 13:14:00 -04:00
2022-04-04 10:25:46 -04:00
2022-08-06 09:38:55 +02:00
2022-12-20 10:26:52 -08:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-05-16 13:36:41 +02:00
2022-12-01 16:52:13 +01:00
2022-12-01 16:52:13 +01:00
2022-12-01 16:52:13 +01:00
2022-06-23 14:51:54 +02:00
2022-11-28 08:54:35 -05:00
2022-11-28 08:54:35 -05:00
2022-11-30 10:41:18 -05:00
2022-12-08 11:18:52 -05:00
2022-11-30 10:41:18 -05:00
2022-11-30 10:41:18 -05:00
2022-11-30 10:41:18 -05:00
2022-09-06 15:20:12 -04:00
2022-11-30 14:50:55 +00:00
2022-12-13 19:08:30 +01:00
2022-12-06 10:47:31 +01:00
2022-08-09 18:22:55 -05:00
2022-12-05 11:49:43 -08:00
2022-12-26 02:06:23 -05:00
2022-08-06 09:38:55 +02:00
2022-04-04 10:25:46 -04:00
2022-12-23 01:30:57 -05:00
2022-11-30 14:50:55 +00:00
2022-11-15 09:40:06 -05:00
2022-11-15 01:18:13 +01:00
2022-10-03 13:18:29 -07:00
2022-12-12 16:14:04 +00:00
2022-11-30 14:50:55 +00:00