HuggingFace_transformer

Author	SHA1	Message	Date
Andy Vu	3b3ebcec40	Updated model card for OLMo2 (#38394 ) * Updated OLMo2 model card * added command line * Add suggestions Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Added suggestions Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Indented code block as per suggestions --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-27 16:24:36 -07:00
Yoni Gozlan	f5307272f5	Falcon-H1 - Fix auto_docstring and add can_return_tuple decorator (#38260 ) Fix auto_docstring and add can_return_tuple	2025-05-27 16:18:05 -04:00
Tanuj Rai	a092f6babf	Update granite.md (#37791 ) * Update granite.md * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update granite.md * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * minor fixes --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-27 12:55:15 -07:00
RogerSinghChugh	be7aa3210b	New bart model card (#37858 ) * Modified BART documentation wrt to issue #36979. * Modified BART documentation wrt to issue #36979. * fixed a typo. * Update docs/source/en/model_doc/bart.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bart.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bart.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bart.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bart.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bart.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * blank commit. --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-27 11:51:41 -07:00
RogerSinghChugh	587c1b0ed1	Updated BERTweet model card. (#37981 ) * Updated BERTweet model card. * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * updated toctree (EN). * Updated BERTweet model card. * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * updated toctree (EN). * Updated BERTweet model card. * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * updated toctree (EN). --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-27 11:51:22 -07:00
RogerSinghChugh	b73faef52f	Updated BigBird Model card as per #36979 . (#37959 ) * Updated BigBird Model card as per #36979. * Update docs/source/en/model_doc/big_bird.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/big_bird.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/big_bird.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/big_bird.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-27 11:24:28 -07:00
Madhav Kumar	538e847c06	Updated Zoedepth model card (#37898 ) * Edited zoedepth model card according to specifications. * Edited Zoedepth model file * made suggested changes.	2025-05-27 10:06:53 -07:00
Parag Ekbote	4f7b0ff8d1	Update Model Card for Mamba-2 (#37951 ) * update model page. * update model page. * Update docs/source/en/model_doc/mamba2.md Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> * update the model page. * update. * Apply suggestions from code review Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> * Apply the suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * add an quantization example and update the toctree. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * remove the additional comma --------- Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-27 10:06:39 -07:00
Cory Cornelius	9c50576860	[mllama] Allow `pixel_values` with `inputs_embeds` (#38334 ) * Allow pixel_values and inputs_embeds at the same time * remove unnecessary overwritten tests	2025-05-27 16:33:56 +00:00
Joao Gante	0f5a8243c4	[tests] remove overload for deleted test (`test_offloaded_cache_implementation`) (#37896 ) * remove overload for deleted tests * make fixup	2025-05-27 16:45:15 +01:00
Joao Gante	f85fd90407	[cleanup] delete deprecated kwargs in qwen2_audio 🧹 (#38404 ) delete deprecated	2025-05-27 16:08:53 +01:00
eustlb	b9f8f863d9	[CSM] update model id (#38211 ) * update model id * codec_model eval * add processor img * use ungated repo for processor tests	2025-05-27 17:03:55 +02:00
ivarflakstad	07dd6b2495	Add report_repo_id to mi300 workflow (#38401 )	2025-05-27 16:35:07 +02:00
eustlb	3142bd8592	[CSM] infer codec model with no_grad + audio eos label (#38215 ) * infer codec model with no_grad * codec_model eval * training labels: add audio eos token	2025-05-27 14:10:17 +00:00
Ye Liu	10ae443ec0	Fix Qwen2.5-VL Video Processor (#38366 ) * Update processing_qwen2_5_vl.py * Update processing_qwen2_5_vl.py * Update modular_qwen2_5_vl.py * Fix CI * Update modular_qwen2_5_vl.py * Update processing_qwen2_5_vl.py * Update video_processing_utils.py	2025-05-27 13:46:37 +02:00
Joao Gante	80902ae9b1	[chat] use the checkpoint's `generation_config.json` as base parameterization (#38330 ) * use model gen config * unwanted diff	2025-05-27 10:35:33 +00:00
hoshi-hiyouga	008e0d87c5	Fix convert to original state dict for VLMs (#38385 ) * fix convert to original state dict * fix * lint * Update modeling_utils.py	2025-05-27 10:27:59 +00:00
Joao Gante	c769483188	[chat] improvements for thinking models and reduce default verbosity (#38322 ) misc improvements	2025-05-27 10:20:58 +00:00
Marc Sun	55f2333366	guard size mismatch check to only quantized models (#38397 ) fix	2025-05-27 11:45:03 +02:00
Raushan Turganbay	1a5be2f5c0	[aya vision] fix processor for vLLM (#38371 ) accidentally merged two PRs in one (；－＿－)	2025-05-27 09:43:53 +00:00
Raushan Turganbay	19fdb75cf0	[video utils] group and reorder by number of frames (#38374 ) fix	2025-05-27 11:32:33 +02:00
Raushan Turganbay	b0735dc0c1	[paligemma] fix processor with suffix (#38365 ) fix pg processor	2025-05-27 11:31:56 +02:00
Raushan Turganbay	9e1017b479	[transformers x vLLM] standardize processors (#37915 ) * standardize * fix tests * batch update some processors, not final yet * oke, now I tested that everything indeed runs. Still needs prettification * emu3 * fixup * gemma3 but it doesn't generate anything * fuyu * update * why? * Update src/transformers/models/aya_vision/processing_aya_vision.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * address comments * bc * why do we need to guard import this every time? * i hate guarded imports * i am blind --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2025-05-27 11:30:30 +02:00
Cyril Vallez	b5ececb900	Fix image token mask in Gemma3 (#38295 ) fix mask	2025-05-27 11:15:52 +02:00
Jitesh Gupta	c4e71e8fff	Add AMD MI300 CI caller leveraging self-hosted runner scale set workflow in hf-workflows (#38132 )	2025-05-26 23:13:02 +02:00
Matt	706b00928f	Stop autoconverting custom code checkpoints (#37751 ) * Stop autoconverting custom code checkpoints * make fixup * Better auto class detection * Match the kwarg ordering	2025-05-26 19:15:28 +01:00
Yih-Dar	07848a8405	update gemma tests (#38384 ) * update * update * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-26 19:54:04 +02:00
Joao Gante	cd0f3ce73b	[cli] cli usable without torch (#38386 ) cli without torch	2025-05-26 16:54:18 +00:00
Matt	ba6d72226d	🚨 🚨 Fix custom code saving (#37716 ) * Firstly: Better detection of when we're a custom class * Trigger tests * Let's break everything * make fixup * fix mistaken line doubling * Let's try to get rid of it from config classes at least * Let's try to get rid of it from config classes at least * Fixup image processor * no more circular import * Let's go back to setting `_auto_class` again * Let's go back to setting `_auto_class` again * stash commit * Revert the irrelevant changes until we figure out AutoConfig * Change tests since we're breaking expectations * make fixup * do the same for all custom classes * Cleanup for feature extractor tests * Cleanup tokenization tests too * typo * Fix tokenizer tests * make fixup * fix image processor test * make fixup * Remove warning from register_for_auto_class * Stop adding model info to auto map entirely * Remove todo * Remove the other todo * Let's start slapping _auto_class on models why not * Let's start slapping _auto_class on models why not * Make sure the tests know what's up * Make sure the tests know what's up * Completely remove add_model_info_to_* * Start adding _auto_class to models * Start adding _auto_class to models * Add a flaky decorator * Add a flaky decorator and import * stash commit * More message cleanup * make fixup * fix indent * Fix trust_remote_code prompts * make fixup * correct indentation * Reincorporate changes into dynamic_module_utils * Update call to trust_remote_code * make fixup * Fix video processors too * Fix video processors too * Remove is_flaky additions * make fixup	2025-05-26 17:37:30 +01:00
Matt	701caef704	Stop TF weight rename reDOS (#38325 ) * let's try a non-regex solution * make fixup * Slight adjustment * Let's just use the original code with a check * slight tweak to conditional * slight tweak to conditional	2025-05-26 16:58:51 +01:00
Judd	0a4e8e2855	fix typo: `tokenizer` -> `tokenize` (#38357 )	2025-05-26 15:29:16 +00:00
Ragnar	63964b7c67	fix typos (#38336 ) * Update video_processor.md * Update deepseek_v3.md	2025-05-26 14:42:37 +00:00
Cyril Vallez	8b03c8eaf2	Better check in `initialize_weights` (#38382 ) * Update modeling_utils.py * CIs * CIs	2025-05-26 16:20:23 +02:00
Yih-Dar	eb74cf977b	Use one `utils/notification_service.py` (#38379 ) * step 1 * step 2 * step 3 * step 4 * step 5 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-26 16:15:29 +02:00
Arthur	98328fd9a1	for now disable compile (#38383 )	2025-05-26 15:57:11 +02:00
Manuel de Prada Corral	78079abeff	Improved cache docs (#38060 ) * improved cache docs Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-26 13:53:41 +00:00
Dhia Eddine Rhaiem	7a9b071bfd	[Falcon H1] Fix slow path forward pass (#38320 ) * Create push-important-models.yml * feat: add falcon-h1 * fixup * address comment * fix * fix copies * fix copies * fix * fix * fix * fix * fix copies * fix * fix copies * fix test import to at least trigget the cis * yups * update * fix make fix copies * fix inits? * fix style * skip annoying test * add integration test for Falcon H1 * fix copies * fix * fix typo * make style * fix slow path generations * clean debug traces * debug * remove debug traces final confirmation * clean debug traces final * fix format and lineup * make style * debug * Update src/transformers/models/falcon_h1/modular_falcon_h1.py Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> * adress comments * fix fix-copies * fix integration test * Merge pull request #7 from ydshieh/fix-slow-path update * another update (#8) * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Younes Belkada <younesbelkada@gmail.com> Co-authored-by: younesbelkada <younes.belkada@tii.ae> Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com> Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-26 15:30:35 +02:00
Cyril Vallez	b5b76b5561	Protect `get_default_device` for torch<2.3 (#38376 ) * Update modeling_utils.py * CIs	2025-05-26 15:00:09 +02:00
Isotr0py	bff32678cc	Fix incorrect batching audio index calculation for Phi-4-Multimodal (#38103 ) * fix Signed-off-by: Isotr0py <2037008807@qq.com> * add tests Signed-off-by: Isotr0py <2037008807@qq.com> * code format Signed-off-by: Isotr0py <2037008807@qq.com> * Update src/transformers/models/phi4_multimodal/feature_extraction_phi4_multimodal.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Signed-off-by: Isotr0py <2037008807@qq.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2025-05-26 12:41:31 +00:00
Cyril Vallez	9f0402bc4d	Fix all import errors based on older torch versions (#38370 ) * Update masking_utils.py * fix * fix * fix * Update masking_utils.py * Update executorch.py * fix	2025-05-26 12:11:54 +02:00
Anton Vlasjuk	d03a3ca692	[`OPT`] Fix attention scaling (#38290 ) * fix opt attention scaling * add comment to why we do this	2025-05-26 11:02:16 +02:00
Yao Matrix	a5a0c7b888	switch to device agnostic device calling for test cases (#38247 ) * use device agnostic APIs in test cases Signed-off-by: Matrix Yao <matrix.yao@intel.com> * fix style Signed-off-by: Matrix Yao <matrix.yao@intel.com> * add one more Signed-off-by: YAO Matrix <matrix.yao@intel.com> * xpu now supports integer device id, aligning to CUDA behaviors Signed-off-by: Matrix Yao <matrix.yao@intel.com> * update to use device_properties Signed-off-by: Matrix Yao <matrix.yao@intel.com> * fix style Signed-off-by: Matrix Yao <matrix.yao@intel.com> * update comment Signed-off-by: Matrix Yao <matrix.yao@intel.com> * fix comments Signed-off-by: Matrix Yao <matrix.yao@intel.com> * fix style Signed-off-by: Matrix Yao <matrix.yao@intel.com> --------- Signed-off-by: Matrix Yao <matrix.yao@intel.com> Signed-off-by: YAO Matrix <matrix.yao@intel.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-26 10:18:53 +02:00
Raushan Turganbay	cba279f46c	[VLMs] add helpers for get/set embedding (#38144 ) * add helpers in VLMs * fix tied weight key test	2025-05-26 09:50:32 +02:00
Yih-Dar	6e3063422c	Uninstall `kernels` for AMD docker images (#38354 ) Uninstall kernels for AMD docker images Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-25 19:42:25 +02:00
Yih-Dar	4a03044ddb	Hot fix for AMD CI workflow (#38349 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-25 11:15:31 +02:00
Yih-Dar	d0c9c66d1c	new failure CI reports for all jobs (#38298 ) * new failures * report_repo_id * report_repo_id * report_repo_id * More fixes * More fixes * More fixes * ruff --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-24 19:15:02 +02:00
Kseniya Parkhamchuk	31f8a0fe8a	[docs]: update roformer.md model card (#37946 ) * Update roformer model card * fix example purpose description * fix model description according to the comments * revert changes for autodoc * remove unneeded tags * fix review issues * fix hfoption --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-23 16:27:56 -07:00
Bryan C.	36f97ae15b	docs(swinv2): Update SwinV2 model card to new standard format (#37942 ) * docs(swinv2): Update SwinV2 model card to new standard format * docs(swinv2): Apply review suggestions Incorporates feedback from @stevhliu to: - Enhance the introductory paragraph with more details about scaling and SimMIM. - Generalize the tip from "image classification tasks" to "vision tasks". Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-23 13:04:13 -07:00
Aguedo	33d23c39ed	Update BioGPT model card (#38214 ) * Update BioGPT model card * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * correction for CPU fallback * added quantization code and method * fixed transformers-cli call --------- Co-authored-by: Aguedo <aguedo@fakeemail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-23 13:03:47 -07:00
Cheery	dffb118013	Remove duplicate docstring: resample (#38305 ) Duplicate of the line above.	2025-05-23 13:02:58 -07:00

1 2 3 4 5 ...

19115 Commits