HuggingFace_transformer

Author	SHA1	Message	Date
Arthur Zucker	052e652d6d	v4.46.3 Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details Secret Leaks / trufflehog (push) Has been cancelled Details v4.46.3	2024-11-18 21:10:52 +01:00
Wing Lian	e01a61aeab	FSDP grad accum fix (#34645 ) * add gradient accumulation steps tests for fsdp * invert no_sync context to fix training for fsdp	2024-11-18 20:07:21 +01:00
Joao Gante	ccbd57a8b6	MPS: `isin_mps_friendly` can support 0D tensors (#34538 ) Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details Secret Leaks / trufflehog (push) Has been cancelled Details * apply fix * tested * make fixup v4.46.2	2024-11-05 19:03:55 +01:00
Arthur Zucker	e66224b544	v4.46.2	2024-11-05 19:02:59 +01:00
Zach Mueller	8c62a92b3c	Update trainer for easier handling of accumulate, compile fixes, and proper reporting (#34511 ) * Update trainer for easier handling of accumulate + proper reporting * test * Fixup tests * Full fix * Fix style * rm comment * Fix tests * Minimize test + remove py 311 check * Unused import * Forward contrib credits from discussions * Fix reported metrics * Refactor, good as it's going to get * rm pad tok id check * object detection and audio are being annoying * Fin * Fin x2 --------- Co-authored-by: Gyanateet Dutta <Ryukijano@users.noreply.github.com>	2024-11-05 19:02:12 +01:00
kang sheng	5b36cdabf5	enable average tokens across devices (#34373 ) * enable average tokens across devices * reduce earlier in case model needs it * simplify if statement * reformat code to make ruff happy * add doc for argument: average_tokens_across_devices * cannot find world size when pytorch is unavailable * format code --------- Co-authored-by: Zach Mueller <muellerzr@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-11-05 19:01:48 +01:00
Pablo Montalvo	f784d95c0f	fix pixtral processor (#34486 ) * fix pixtral processor * test out full length batches + remove undue ValueError * fix up processing * fix tests * fix * last fixup * style * [run-slow] pixtral * [run-slow] pixtral * fix config key * skip torchscript tests * [run-slow] pixtral * add missing key * [run-slow] pixtral * fix docs * [run-slow] pixtral * fix wrong url for integration test * [run-slow] pixtral * pixtralVisionModel does not have a lm head * [run-slow] pixtral	2024-11-05 17:58:38 +01:00
Raushan Turganbay	7da0eefc27	VLMs: fix number of image tokens (#34332 ) * fix * fix tests * add tests * style * style * fix qwen after rebase * fix video llava	2024-11-05 17:58:38 +01:00
Arthur Zucker	bc598c00db	v4.41.1 Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details Secret Leaks / trufflehog (push) Has been cancelled Details v4.46.1	2024-10-29 15:22:01 +01:00
Marc Sun	94ed13c1de	Fix regression loading dtype (#34409 ) * fix regression * add test for torchao * expected output * better fix	2024-10-29 15:21:39 +01:00
Raushan Turganbay	72c716de92	LLaVA: latency issues (#34460 ) * fix llavas * code style * green ci	2024-10-29 15:21:05 +01:00
Ilyas Moutawwakil	97bb9299c4	Fix pix2struct (#34374 ) * fix * fix and test use_cache test * style * remove atol	2024-10-29 15:21:05 +01:00
Ilyas Moutawwakil	565f0e97c2	Fix onnx non-expotable inplace aten op (#34376 ) * fix onnx non-expotable inplace op * mistral, qwen2, qwen2_vl, starcoder2 * fixup copies	2024-10-29 15:21:05 +01:00
Michael Benayoun	dcfe3c7e61	Fix `torch.fx` issue related to the new `loss_kwargs` keyword argument (#34380 ) * Fix FX * Unskip tests	2024-10-29 15:21:05 +01:00
Arthur Zucker	c2820c9491	fix list 3.8 Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details Secret Leaks / trufflehog (push) Has been cancelled Details v4.46.0	2024-10-24 10:03:56 +02:00
Arthur Zucker	b298161146	v4.46.0	2024-10-24 09:35:05 +02:00
Joao Gante	b0f0c61899	Add SynthID (watermerking by Google DeepMind) (#34350 ) * Add SynthIDTextWatermarkLogitsProcessor * esolving comments. * Resolving comments. * esolving commits, * Improving SynthIDWatermark tests. * switch to PT version * detector as pretrained model + style * update training + style * rebase * Update logits_process.py * Improving SynthIDWatermark tests. * Shift detector training to wikitext negatives and stabilize with lower learning rate. * Clean up. * in for 7B * cleanup * upport python 3.8. * README and final cleanup. * HF Hub upload and initiaze. * Update requirements for synthid_text. * Adding SynthIDTextWatermarkDetector. * Detector testing. * Documentation changes. * Copyrights fix. * Fix detector api. * ironing out errors * ironing out errors * training checks * make fixup and make fix-copies * docstrings and add to docs * copyright * BC * test docstrings * move import * protect type hints * top level imports * watermarking example * direct imports * tpr fpr meaning * process_kwargs * SynthIDTextWatermarkingConfig docstring * assert -> exception * example updates * no immutable dict (cant be serialized) * pack fn * einsum equivalent * import order * fix test on gpu * add detector example --------- Co-authored-by: Sumedh Ghaisas <sumedhg@google.com> Co-authored-by: Marc Sun <marc@huggingface.co> Co-authored-by: sumedhghaisas2 <138781311+sumedhghaisas2@users.noreply.github.com> Co-authored-by: raushan <raushan@huggingface.co>	2024-10-23 21:18:52 +01:00
Arthur	e50bf61dec	Fix red CI: benchmark script (#34351 ) * dont'trigger always * fux * oups * update * ?? * ? * aie	2024-10-23 18:33:52 +02:00
Yih-Dar	c42b3223db	skip `test_pipeline_depth_estimation` temporarily (#34316 ) skip Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-10-23 17:27:51 +02:00
Zach Mueller	d9f733625c	Enable Gradient Accumulation fix across all models + trainer fully in forward() (#34283 ) * Enable grad accum fix across all models + trainer fully in forward() * handle peft case * Account for DDP: need to run scale tests * Use accelerator state * Quality * Guard * Experiment w/ only fairseq fix * Fairseq only * Revert multiply_grads fix * Mult by grad accum to fully bring back solution * Style * Good to go now * Skip fx tests for now * Bookmark * Working now	2024-10-23 11:24:57 -04:00
Aymeric Roucher	1fb575fcf0	Support boolean tool args (#34208 ) Support boolean tool arguments	2024-10-23 16:48:21 +02:00
Filippos Ventirozos	343c8cb86f	Added Deberta model type support (#34308 ) * Added Deberta model type for 'add_prefix_space' functionality * housekeeping --------- Co-authored-by: Filippos Ventirozos <filippos.ventirozos@autotrader.co.uk>	2024-10-23 11:15:36 +02:00
Steven Liu	5ba85de7a4	[docs] Fix Korean toctree (#34324 ) fix	2024-10-23 10:52:51 +02:00
Vijay	049682a5a6	Example doc for token classification of Llama and Dependent/Copied Models (#34139 ) * Added Example Doc for token classification on all tokenClassificationModels copied from llama * Refactor code to add code sample docstrings for Gemma and Gemma2 models (including modular Gemma) * Refactor code to update model checkpoint names for Qwen2 models	2024-10-22 10:26:16 -07:00
wony617	644d5287b2	🌐 [i18n-KO] Translated `model_doc/bartpho.md` to Korean (#33981 ) * docs: ko: model_doc/bartpho.md * feat: nmt draft * Update docs/source/ko/model_doc/bartpho.md * Update docs/source/ko/_toctree.yml Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-10-22 09:46:52 -07:00
Ahnjj_DEV	b03dc0a87e	🌐 [i18n-KO] Translated `bert japanese.md` to Korean (#33890 ) * docs: ko: bert-japanese.md * Update _toctree.yml * fix: manual edits * Update docs/source/ko/_toctree.yml Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com> * Update docs/source/ko/_toctree.yml Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com> --------- Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-10-22 09:46:31 -07:00
Ahnjj_DEV	4b14aa1bcd	🌐 [i18n-KO] Translated `executorch.md` to Korean (#33888 ) * docs: ko: executorch.md * Update _toctree.yml * fix: manual edits * Update docs/source/ko/main_classes/executorch.md Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com> * Update docs/source/ko/_toctree.yml Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com> * Update docs/source/ko/_toctree.yml * Update docs/source/ko/_toctree.yml * Update docs/source/ko/_toctree.yml --------- Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com> Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-10-22 09:46:20 -07:00
Fanli Lin	688eeac81e	[docs] fix typo (#34235 ) fix typo	2024-10-22 09:46:07 -07:00
Mansu Kim	a65a6ce7fe	fix error in _get_eval_sampler when group_by_length enabled (#34237 ) * remove self in _get_eval_sampler * remove self in front of _get_eval_sampler	2024-10-22 18:02:42 +02:00
Yoni Gozlan	e7c3fa7f57	Fix continue_final_message for image-text-to-text chat templates (#34236 ) * fix continue_final_message for vlms * Add one test for vlms continue_final_message chat template	2024-10-22 11:57:44 -04:00
Chinedum Echeta	96f67c068b	Feature: Add `MLFLOW_MAX_LOG_PARAMS` to `MLflowCallback` (#34279 )	2024-10-22 16:34:17 +02:00
Michael Kamerath	eef6b0ba42	Add option for running ffmpeg_microphone_live as a background process (#32838 ) * Add option for running ffmpeg_microphone_live as a background process * Code quality checks for audio_utils * Code clean up for audio_utils * Fixing logic in ffmpeg_microphone calls in audio_utils * Allowing any arbitrary arguments to be passed to ffmpeg_microphone_live * Formatting * Fixing last problems with adding ffmpeg_additional_args * Fixing default arguments and formatting issues * Fixing comments for ffmpeg_additional_args * Adding two shorts tests for ffmpeg_microphone_live * Fixing test bug	2024-10-22 15:56:41 +02:00
Guang Yang	c14ccbcd64	Olmo is ExecuTorch Compatible (#34181 ) Co-authored-by: Guang Yang <guangyang@fb.com>	2024-10-22 15:53:01 +02:00
Guang Yang	7a08a772cc	Qwen2.5 is ExecuTorch Compatible (#34102 ) Qwen2 is ExecuTorch Compatible Co-authored-by: Guang Yang <guangyang@fb.com>	2024-10-22 15:52:23 +02:00
Alexandros Benetatos	c31a6ff474	Add post_process_depth_estimation to image processors and support ZoeDepth's inference intricacies (#32550 ) * add colorize_depth and matplotlib availability check * add post_process_depth_estimation for zoedepth + tests * add post_process_depth_estimation for DPT + tests * add post_process_depth_estimation in DepthEstimationPipeline & special case for zoedepth * run `make fixup` * fix import related error on tests * fix more import related errors on test * forgot some `torch` calls in declerations * remove `torch` call in zoedepth tests that caused error * updated docs for depth estimation * small fix for `colorize` input/output types * remove `colorize_depth`, fix various names, remove matplotlib dependency * fix formatting * run fixup * different images for test * update examples in `forward` functions * fixed broken links * fix output types for docs * possible format fix inside `<Tip>` * Readability related updates Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> * Readability related update * cleanup after merge * refactor `post_process_depth_estimation` to return dict; simplify ZoeDepth's `post_process_depth_estimation` * rewrite dict merging to support python 3.8 --------- Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>	2024-10-22 15:50:54 +02:00
pbelcak	104599d7a8	Fix: tensor of examples of the same length triggers invalid stacking (#34166 ) * Fix issue where tensor of examples of the same length triggers invalid stacking * Update data_collator.py	2024-10-22 15:49:21 +02:00
Cyril Vallez	51e395d13e	Fix FA2 attention for models supporting sliding window (#34093 ) Fix FA2	2024-10-22 15:37:21 +02:00
HALLOUARD	eb6a734995	[RT-DETR] Fix onnx inference bug for Optype (Where) (#33877 ) * feat: [RT-DETR] Add onnx runtime config and fix onnx inference bug Optype (Where) * fix lint * use dtype istead of torch.float32 * add doc * remove onnx config * use dtype info * use tensor to fix lint	2024-10-22 15:14:07 +02:00
Marc Sun	84b17e03f1	Update PR templates (#34065 ) update PR template	2024-10-22 15:11:54 +02:00
Matt	681fc43713	Sync video classification pipeline with huggingface_hub spec (#34288 ) * Sync video classification pipeline * Add disclaimer	2024-10-22 13:33:49 +01:00
regisss	93352e81f5	Fix Korean doc _toctree.yml (#34293 ) Fix korean doc _toctree.yml	2024-10-22 11:05:56 +02:00
Steven Liu	b644178ed4	[docs] Fix GenerationConfig params (#34299 ) fix generationconfigs	2024-10-22 11:03:25 +02:00
Raushan Turganbay	73d65e637b	T5 compile compatibilty (#34089 ) * this worked in normal generation, needs more tests * fix almost all tests in t5 * nit * longt5, umt5, mt5 * style * udop, pix2struct * more models * fix some tests * fix onnx tests * tracing tests fixed * compile enabled and tested for t5 models * fix small bug in slow tests * [run-slow] t5 * uncomment * style * update with new generation refactoring * nit * fix copies * this is the fix, had to change t5 to fix copies * update * [run-slow] t5 * [run-slow] t5 * update * add test for encoder only T5 * clean up after rebase * fix pop2piano * add comment * style * fix copies after rebase * fix copies missed this one	2024-10-22 08:23:53 +02:00
Raushan Turganbay	5077bc034f	VLM: add more modularity (#34175 ) * update * fix tests + fix copies * fix tests once more	2024-10-22 07:56:35 +02:00
Raushan Turganbay	21d5025826	Attn implementation for composite models (#32238 ) * first try * codestyle * idefics2 is happy * [run-slow] llava, llava_next, video_llava, vipllava, llava_next_video, idefics, idefics2, kosmos2, fuyu, blip, blip_2, instructblip, instructblipvideo, paligemma * fix-copies * [run-slow] llava, llava_next, video_llava, vipllava, llava_next_video, idefics, idefics2, kosmos2, fuyu, blip, blip_2, instructblip, instructblipvideo * blip-2 needs to init vision from config * when was this removed O_o * minor fix * tests * this way? * tests * model-agnostic code * codestyle * add tests for idefics * modify general test for VLMs * no generation test for vlm yet! * no generation test here also * wanr in VIT-SDPA if output attn * add more tests * user can pass dict as attn impl * repo consistency * update * muicgen * no prints * forgot speech enc-dec and clip * how many composite models we have? * musicgen meelody is same as mudicgen * +siglip * fix tests + add some more * remove idefics custom overriden code * make idefics2 automappable * nits * skip tests * doctests * Update src/transformers/models/idefics2/configuration_idefics2.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update tests/models/clip/test_modeling_clip.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update tests/models/idefics2/test_modeling_idefics2.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update tests/models/idefics2/test_modeling_idefics2.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/configuration_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * major update, no need for automap * clean up * add FA2 test * more tests * style * skip tests * why did these started failing now? * no attributes for FA2 needed * one tiny test * address comment about FA2 false warning * style * add new models and resolve conflicts * fix copies * let it be this way for now, come back tomorrow to review * some more fixes * update * more updates * update * fix copies * style and tests * another big update * fix tests * fix tests * update * another update * fix tests * fix copies * fix tests --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-10-22 06:54:44 +02:00
Andrés Marafioti	32590b5ecb	Fix method name which changes in tutorial (#34252 ) The method `model_download_tool` was called `model_download_counter` earlier in the tutorial, this raises an error when following the code.	2024-10-21 14:21:52 -03:00
Matt	f701b98e4a	Add a doc section on writing generation prompts (#34248 ) Add a section on writing generation prompts	2024-10-21 14:35:57 +01:00
Yoni Gozlan	a4122813d1	Add DetrImageProcessorFast (#34063 ) * add fully functionning image_processing_detr_fast * Create tensors on the correct device * fix copies * fix doc * add tests equivalence cpu gpu * fix doc en * add relative imports and copied from * Fix copies and nit	2024-10-21 09:05:05 -04:00
Yoni Gozlan	24bdc94da5	Change Paligemma import logging to work with modular (#34211 ) * change import logging * fix CI	2024-10-21 08:55:27 -04:00
Raushan Turganbay	ca541bd4f4	Generation tests: don't rely on main input name (#34228 ) * don't rely on main input name * update	2024-10-21 10:00:14 +02:00

1 2 3 4 5 ...

17245 Commits