Pablo Montalvo
a5bb528471
Fix signatures for processing kwargs (#35105)
* add conversion script
* remove pg2 refs
* fixup style
* small update
* get correct scaling
* add back missing bos
* fix missing config keys
* might revert this pos_embeddings
* fixup 9b config
* fix 9b
* fixup 9b conversion for good + add back num_hidden_layers
* add correct query scaling for 2b, 9b, 27b
* fixup 27b conversion
* Additional variant: 27b-896
* Use CPU for conversion to reduce GPU RAM requirements
* fix causal mask generation + formatting
* fix in-training causal mask generation edge case
* trigger CI
* update config
* update config
* update config
* update config
* update config
* update config
* update config
* update config
* update config
* move conversion file to main model dir
* handle multi-images + bos token
* address comments for input ids
* revert ci fixes
* [run-slow] paligemma
* fix
* [run-slow] paligemma
* skip end 2 end
* [run-slow] paligemma
---------
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2024-12-05 18:15:48 +01:00
..
2024-12-03 13:14:52 +01:00
2022-02-23 15:46:28 -05:00
2023-10-09 11:04:57 +02:00
2024-10-02 14:08:46 +01:00
2024-09-19 19:28:04 +01:00
2024-03-19 14:43:02 +00:00
2024-11-15 22:28:06 +01:00
2024-12-05 17:07:33 +01:00
2024-12-05 18:15:48 +01:00
2024-07-11 12:11:50 +01:00
2024-11-28 13:56:25 +01:00
2024-11-25 11:36:44 +01:00
2024-11-26 11:09:30 +01:00
2024-08-30 18:17:25 +02:00
2024-10-02 14:08:46 +01:00
2024-11-04 16:37:51 +01:00
2024-11-18 19:51:49 +01:00
2024-12-05 17:02:27 +01:00
2024-12-05 17:02:27 +01:00
2023-12-20 18:33:17 +00:00
2024-11-05 11:34:01 +01:00
2023-06-15 07:30:24 -04:00
2024-10-21 09:05:05 -04:00
2024-05-21 13:56:52 +01:00
2024-12-02 16:21:04 +01:00
2024-05-16 10:56:11 +01:00
2024-10-05 16:20:50 +02:00
2024-10-31 15:48:11 -04:00
2024-11-26 14:18:04 +00:00
2023-09-05 10:12:25 +02:00
2024-11-26 14:18:04 +00:00