Andy Vu
3b3ebcec40
Updated model card for OLMo2 ( #38394 )
...
* Updated OLMo2 model card
* added command line
* Add suggestions
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Added suggestions
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Indented code block as per suggestions
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-05-27 16:24:36 -07:00
Yoni Gozlan
f5307272f5
Falcon-H1 - Fix auto_docstring and add can_return_tuple decorator ( #38260 )
...
Fix auto_docstring and add can_return_tuple
2025-05-27 16:18:05 -04:00
Tanuj Rai
a092f6babf
Update granite.md ( #37791 )
...
* Update granite.md
* Update docs/source/en/model_doc/granite.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/granite.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/granite.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* update granite.md
* Update docs/source/en/model_doc/granite.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/granite.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/granite.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/granite.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/granite.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/granite.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* minor fixes
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-05-27 12:55:15 -07:00
RogerSinghChugh
be7aa3210b
New bart model card ( #37858 )
...
* Modified BART documentation wrt to issue #36979 .
* Modified BART documentation wrt to issue #36979 .
* fixed a typo.
* Update docs/source/en/model_doc/bart.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bart.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bart.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bart.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bart.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bart.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* blank commit.
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-05-27 11:51:41 -07:00
RogerSinghChugh
587c1b0ed1
Updated BERTweet model card. ( #37981 )
...
* Updated BERTweet model card.
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* updated toctree (EN).
* Updated BERTweet model card.
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* updated toctree (EN).
* Updated BERTweet model card.
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/bertweet.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* updated toctree (EN).
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-05-27 11:51:22 -07:00
RogerSinghChugh
b73faef52f
Updated BigBird Model card as per #36979 . ( #37959 )
...
* Updated BigBird Model card as per #36979 .
* Update docs/source/en/model_doc/big_bird.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/big_bird.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/big_bird.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/big_bird.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-05-27 11:24:28 -07:00
Madhav Kumar
538e847c06
Updated Zoedepth model card ( #37898 )
...
* Edited zoedepth model card according to specifications.
* Edited Zoedepth model file
* made suggested changes.
2025-05-27 10:06:53 -07:00
Parag Ekbote
4f7b0ff8d1
Update Model Card for Mamba-2 ( #37951 )
...
* update model page.
* update model page.
* Update docs/source/en/model_doc/mamba2.md
Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com >
* update the model page.
* update.
* Apply suggestions from code review
Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com >
* Apply the suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* add an quantization example and update the toctree.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* remove the additional comma
---------
Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com >
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-05-27 10:06:39 -07:00
Cory Cornelius
9c50576860
[mllama] Allow pixel_values with inputs_embeds ( #38334 )
...
* Allow pixel_values and inputs_embeds at the same time
* remove unnecessary overwritten tests
2025-05-27 16:33:56 +00:00
Joao Gante
0f5a8243c4
[tests] remove overload for deleted test (test_offloaded_cache_implementation) ( #37896 )
...
* remove overload for deleted tests
* make fixup
2025-05-27 16:45:15 +01:00
Joao Gante
f85fd90407
[cleanup] delete deprecated kwargs in qwen2_audio 🧹 ( #38404 )
...
delete deprecated
2025-05-27 16:08:53 +01:00
eustlb
b9f8f863d9
[CSM] update model id ( #38211 )
...
* update model id
* codec_model eval
* add processor img
* use ungated repo for processor tests
2025-05-27 17:03:55 +02:00
ivarflakstad
07dd6b2495
Add report_repo_id to mi300 workflow ( #38401 )
2025-05-27 16:35:07 +02:00
eustlb
3142bd8592
[CSM] infer codec model with no_grad + audio eos label ( #38215 )
...
* infer codec model with no_grad
* codec_model eval
* training labels: add audio eos token
2025-05-27 14:10:17 +00:00
Ye Liu
10ae443ec0
Fix Qwen2.5-VL Video Processor ( #38366 )
...
* Update processing_qwen2_5_vl.py
* Update processing_qwen2_5_vl.py
* Update modular_qwen2_5_vl.py
* Fix CI
* Update modular_qwen2_5_vl.py
* Update processing_qwen2_5_vl.py
* Update video_processing_utils.py
2025-05-27 13:46:37 +02:00
Joao Gante
80902ae9b1
[chat] use the checkpoint's generation_config.json as base parameterization ( #38330 )
...
* use model gen config
* unwanted diff
2025-05-27 10:35:33 +00:00
hoshi-hiyouga
008e0d87c5
Fix convert to original state dict for VLMs ( #38385 )
...
* fix convert to original state dict
* fix
* lint
* Update modeling_utils.py
2025-05-27 10:27:59 +00:00
Joao Gante
c769483188
[chat] improvements for thinking models and reduce default verbosity ( #38322 )
...
misc improvements
2025-05-27 10:20:58 +00:00
Marc Sun
55f2333366
guard size mismatch check to only quantized models ( #38397 )
...
fix
2025-05-27 11:45:03 +02:00
Raushan Turganbay
1a5be2f5c0
[aya vision] fix processor for vLLM ( #38371 )
...
accidentally merged two PRs in one (;-_-)
2025-05-27 09:43:53 +00:00
Raushan Turganbay
19fdb75cf0
[video utils] group and reorder by number of frames ( #38374 )
...
fix
2025-05-27 11:32:33 +02:00
Raushan Turganbay
b0735dc0c1
[paligemma] fix processor with suffix ( #38365 )
...
fix pg processor
2025-05-27 11:31:56 +02:00
Raushan Turganbay
9e1017b479
[transformers x vLLM] standardize processors ( #37915 )
...
* standardize
* fix tests
* batch update some processors, not final yet
* oke, now I tested that everything indeed runs. Still needs prettification
* emu3
* fixup
* gemma3 but it doesn't generate anything
* fuyu
* update
* why?
* Update src/transformers/models/aya_vision/processing_aya_vision.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com >
* address comments
* bc
* why do we need to guard import this every time?
* i hate guarded imports
* i am blind
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com >
2025-05-27 11:30:30 +02:00
Cyril Vallez
b5ececb900
Fix image token mask in Gemma3 ( #38295 )
...
fix mask
2025-05-27 11:15:52 +02:00
Jitesh Gupta
c4e71e8fff
Add AMD MI300 CI caller leveraging self-hosted runner scale set workflow in hf-workflows ( #38132 )
2025-05-26 23:13:02 +02:00
Matt
706b00928f
Stop autoconverting custom code checkpoints ( #37751 )
...
* Stop autoconverting custom code checkpoints
* make fixup
* Better auto class detection
* Match the kwarg ordering
2025-05-26 19:15:28 +01:00
Yih-Dar
07848a8405
update gemma tests ( #38384 )
...
* update
* update
* update
* update
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-05-26 19:54:04 +02:00
Joao Gante
cd0f3ce73b
[cli] cli usable without torch ( #38386 )
...
cli without torch
2025-05-26 16:54:18 +00:00
Matt
ba6d72226d
🚨 🚨 Fix custom code saving ( #37716 )
...
* Firstly: Better detection of when we're a custom class
* Trigger tests
* Let's break everything
* make fixup
* fix mistaken line doubling
* Let's try to get rid of it from config classes at least
* Let's try to get rid of it from config classes at least
* Fixup image processor
* no more circular import
* Let's go back to setting `_auto_class` again
* Let's go back to setting `_auto_class` again
* stash commit
* Revert the irrelevant changes until we figure out AutoConfig
* Change tests since we're breaking expectations
* make fixup
* do the same for all custom classes
* Cleanup for feature extractor tests
* Cleanup tokenization tests too
* typo
* Fix tokenizer tests
* make fixup
* fix image processor test
* make fixup
* Remove warning from register_for_auto_class
* Stop adding model info to auto map entirely
* Remove todo
* Remove the other todo
* Let's start slapping _auto_class on models why not
* Let's start slapping _auto_class on models why not
* Make sure the tests know what's up
* Make sure the tests know what's up
* Completely remove add_model_info_to_*
* Start adding _auto_class to models
* Start adding _auto_class to models
* Add a flaky decorator
* Add a flaky decorator and import
* stash commit
* More message cleanup
* make fixup
* fix indent
* Fix trust_remote_code prompts
* make fixup
* correct indentation
* Reincorporate changes into dynamic_module_utils
* Update call to trust_remote_code
* make fixup
* Fix video processors too
* Fix video processors too
* Remove is_flaky additions
* make fixup
2025-05-26 17:37:30 +01:00
Matt
701caef704
Stop TF weight rename reDOS ( #38325 )
...
* let's try a non-regex solution
* make fixup
* Slight adjustment
* Let's just use the original code with a check
* slight tweak to conditional
* slight tweak to conditional
2025-05-26 16:58:51 +01:00
Judd
0a4e8e2855
fix typo: tokenizer -> tokenize ( #38357 )
2025-05-26 15:29:16 +00:00
Ragnar
63964b7c67
fix typos ( #38336 )
...
* Update video_processor.md
* Update deepseek_v3.md
2025-05-26 14:42:37 +00:00
Cyril Vallez
8b03c8eaf2
Better check in initialize_weights ( #38382 )
...
* Update modeling_utils.py
* CIs
* CIs
2025-05-26 16:20:23 +02:00
Yih-Dar
eb74cf977b
Use one utils/notification_service.py ( #38379 )
...
* step 1
* step 2
* step 3
* step 4
* step 5
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-05-26 16:15:29 +02:00
Arthur
98328fd9a1
for now disable compile ( #38383 )
2025-05-26 15:57:11 +02:00
Manuel de Prada Corral
78079abeff
Improved cache docs ( #38060 )
...
* improved cache docs
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-05-26 13:53:41 +00:00
Dhia Eddine Rhaiem
7a9b071bfd
[Falcon H1] Fix slow path forward pass ( #38320 )
...
* Create push-important-models.yml
* feat: add falcon-h1
* fixup
* address comment
* fix
* fix copies
* fix copies
* fix
* fix
* fix
* fix
* fix copies
* fix
* fix copies
* fix test import to at least trigget the cis
* yups
* update
* fix make fix copies
* fix inits?
* fix style
* skip annoying test
* add integration test for Falcon H1
* fix copies
* fix
* fix typo
* make style
* fix slow path generations
* clean debug traces
* debug
* remove debug traces final confirmation
* clean debug traces final
* fix format and lineup
* make style
* debug
* Update src/transformers/models/falcon_h1/modular_falcon_h1.py
Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com >
* adress comments
* fix fix-copies
* fix integration test
* Merge pull request #7 from ydshieh/fix-slow-path
update
* another update (#8 )
* update
* update
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com >
Co-authored-by: Younes Belkada <younesbelkada@gmail.com >
Co-authored-by: younesbelkada <younes.belkada@tii.ae >
Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com >
Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com >
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com >
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-05-26 15:30:35 +02:00
Cyril Vallez
b5b76b5561
Protect get_default_device for torch<2.3 ( #38376 )
...
* Update modeling_utils.py
* CIs
2025-05-26 15:00:09 +02:00
Isotr0py
bff32678cc
Fix incorrect batching audio index calculation for Phi-4-Multimodal ( #38103 )
...
* fix
Signed-off-by: Isotr0py <2037008807@qq.com >
* add tests
Signed-off-by: Isotr0py <2037008807@qq.com >
* code format
Signed-off-by: Isotr0py <2037008807@qq.com >
* Update src/transformers/models/phi4_multimodal/feature_extraction_phi4_multimodal.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com >
---------
Signed-off-by: Isotr0py <2037008807@qq.com >
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com >
2025-05-26 12:41:31 +00:00
Cyril Vallez
9f0402bc4d
Fix all import errors based on older torch versions ( #38370 )
...
* Update masking_utils.py
* fix
* fix
* fix
* Update masking_utils.py
* Update executorch.py
* fix
2025-05-26 12:11:54 +02:00
Anton Vlasjuk
d03a3ca692
[OPT] Fix attention scaling ( #38290 )
...
* fix opt attention scaling
* add comment to why we do this
2025-05-26 11:02:16 +02:00
Yao Matrix
a5a0c7b888
switch to device agnostic device calling for test cases ( #38247 )
...
* use device agnostic APIs in test cases
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* fix style
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* add one more
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
* xpu now supports integer device id, aligning to CUDA behaviors
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* update to use device_properties
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* fix style
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* update comment
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* fix comments
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* fix style
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
---------
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-05-26 10:18:53 +02:00
Raushan Turganbay
cba279f46c
[VLMs] add helpers for get/set embedding ( #38144 )
...
* add helpers in VLMs
* fix tied weight key test
2025-05-26 09:50:32 +02:00
Yih-Dar
6e3063422c
Uninstall kernels for AMD docker images ( #38354 )
...
Uninstall kernels for AMD docker images
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-05-25 19:42:25 +02:00
Yih-Dar
4a03044ddb
Hot fix for AMD CI workflow ( #38349 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-05-25 11:15:31 +02:00
Yih-Dar
d0c9c66d1c
new failure CI reports for all jobs ( #38298 )
...
* new failures
* report_repo_id
* report_repo_id
* report_repo_id
* More fixes
* More fixes
* More fixes
* ruff
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-05-24 19:15:02 +02:00
Kseniya Parkhamchuk
31f8a0fe8a
[docs]: update roformer.md model card ( #37946 )
...
* Update roformer model card
* fix example purpose description
* fix model description according to the comments
* revert changes for autodoc
* remove unneeded tags
* fix review issues
* fix hfoption
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-05-23 16:27:56 -07:00
Bryan C.
36f97ae15b
docs(swinv2): Update SwinV2 model card to new standard format ( #37942 )
...
* docs(swinv2): Update SwinV2 model card to new standard format
* docs(swinv2): Apply review suggestions
Incorporates feedback from @stevhliu to:
- Enhance the introductory paragraph with more details about scaling and SimMIM.
- Generalize the tip from "image classification tasks" to "vision tasks".
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-05-23 13:04:13 -07:00
Aguedo
33d23c39ed
Update BioGPT model card ( #38214 )
...
* Update BioGPT model card
* Update docs/source/en/model_doc/biogpt.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/biogpt.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/biogpt.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/biogpt.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/biogpt.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/biogpt.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/biogpt.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/biogpt.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/biogpt.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/biogpt.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/model_doc/biogpt.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* correction for CPU fallback
* added quantization code and method
* fixed transformers-cli call
---------
Co-authored-by: Aguedo <aguedo@fakeemail.com >
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-05-23 13:03:47 -07:00
Cheery
dffb118013
Remove duplicate docstring: resample ( #38305 )
...
Duplicate of the line above.
2025-05-23 13:02:58 -07:00