Raushan Turganbay
a29eabd0eb
Expand inputs in processors for VLMs (#30962)
* let it be
* draft
* should not have changed
* add warnings
* fix & add tests
* fix tests
* ipnuts embeds cannot be passed with pixels
* more updates
* paligemma ready!
* minor typos
* update blip-2
* fix tests & raise error
* docstring
* add blip2 test
* tmp
* add image seq length to config
* update docstring
* delete
* fix tests
* fix blip
* fix paligemma
* out-of-place scatter
* add llava-next-video
* Update src/transformers/models/blip_2/modeling_blip_2.py
Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>
* remove tmp
* codestyle
* nits
* more nits
* remove overriding in tests
* comprehension when merging video
* fix-copies
* revert changes for embeds test
* fix tests after making comprehension
* Update src/transformers/models/blip_2/processing_blip_2.py
Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>
* Update src/transformers/models/blip_2/processing_blip_2.py
Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>
* more updates
* fix tests
---------
Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>
2024-08-13 10:14:39 +05:00
..
2024-08-07 11:42:52 +02:00
2022-02-23 15:46:28 -05:00
2023-10-09 11:04:57 +02:00
2024-08-01 15:18:43 -04:00
2024-06-26 21:59:08 +01:00
2024-03-19 14:43:02 +00:00
2024-06-26 14:50:08 +01:00
2024-08-07 10:02:16 +05:00
2024-08-13 10:14:39 +05:00
2024-07-11 12:11:50 +01:00
2024-02-29 03:56:16 +01:00
2024-07-29 21:24:42 +08:00
2024-07-24 17:59:59 +02:00
2023-12-07 10:00:08 +01:00
2024-07-17 10:56:44 +01:00
2024-08-05 09:22:48 +02:00
2024-07-23 15:56:41 +02:00
2024-08-12 20:20:17 +01:00
2023-12-20 18:33:17 +00:00
2024-07-26 10:33:02 +02:00
2023-06-15 07:30:24 -04:00
2024-08-06 11:33:05 +01:00
2024-05-21 13:56:52 +01:00
2024-08-12 13:40:07 +05:00
2024-05-16 10:56:11 +01:00
2024-05-13 15:59:46 +01:00
2024-07-22 17:46:17 +01:00
2024-06-13 16:27:16 +02:00
2023-09-05 10:12:25 +02:00
2024-08-01 14:32:13 +02:00