Arthur
9c641dc161
v4.54.1
Secret Leaks / trufflehog (push) Has been cancelled
4.54.1
2025-07-29 17:32:03 +02:00
Manuel de Prada Corral
b04aae7398
Fix Cache.max_cache_len max value for Hybrid models ( #39737 )
...
* fix gemma
* fix min
* fix quant init issue
* fix gemma 3n
* skip quant cache test
* fix modular
* new test for Gemma
* include cyril change
---------
Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com >
2025-07-29 17:29:33 +02:00
Raushan Turganbay
0297e595c1
[modenbert] fix regression ( #39750 )
...
* fix regression
* add FA2 test
2025-07-29 17:29:33 +02:00
Cyril Vallez
b8e1b282a9
Fix version issue in modeling_utils.py ( #39759 )
...
fix version issue
2025-07-29 17:29:33 +02:00
Raushan Turganbay
166c1b47db
Fix GPT2 with cross attention ( #39754 )
...
* fix
* use new mask API
* style
* fix copies and attention tests
* fix head pruning tests
2025-07-29 17:29:33 +02:00
Manuel de Prada Corral
ab2a6091d9
Fix mamba regression ( #39728 )
...
* fix mamba regression
* fix compile test
2025-07-29 17:29:33 +02:00
Matej Sirovatka
3a6d13c887
Fix: add back base model plan ( #39733 )
...
* Fix: add back base model plan
* Fix: typo
* fixup
* remove unused import
---------
Co-authored-by: Arthur <arthur.zucker@gmail.com >
2025-07-29 17:29:33 +02:00
Arthur
a033ae4876
fix cache inheritance ( #39748 )
...
* fix cache inheritance
* styule
2025-07-29 17:29:33 +02:00
Raushan Turganbay
457b478e4a
Fix cache-related tests ( #39676 )
...
* fix
* fix kyutai at last
* fix unrelated tests and copies
* update musicgen as well
* revert tensor
* fix old test failures
* why it wasn't added?
2025-07-29 17:29:33 +02:00
Cyril Vallez
862cb55017
Fix Layer device placement in Caches ( #39732 )
...
* fix device placement
* style
* typo in comment
2025-07-29 17:29:33 +02:00
Matej Sirovatka
67fd36fcc8
PATCH: add back n-dim device-mesh + fix tp trainer saving ( #39693 )
...
* Feat: something
* Feat: initial changes
* tmp changes to unblock
* Refactor
* remove todo
* Feat: docstring
* Fix: saving of distributed model in trainer
* Fix: distributed saving with trainer
* Feat: add pure tp saving
* Only require tp dim if ndim > 1
* Fix: default to None
* Fix: better comments/errors
* Fix: properly check tp_size attribute
* Fix: properly check for None in tp_size
---------
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com >
2025-07-29 17:29:33 +02:00
Wing Lian
709c6fd008
fix missing model._tp_size from ep refactor ( #39688 )
...
* fix missing model._tp_size from ep refactor
* restore setting device_mesh too
2025-07-29 17:29:32 +02:00
Arthur
3fd456b200
v4.54-release
Release - Conda / build_and_package (push) Has been cancelled
Secret Leaks / trufflehog (push) Has been cancelled
v4.54-release
v4.54.0
2025-07-25 20:44:40 +02:00
Garrett Goon
97f8c71f52
Add padding-free to Granite hybrid moe models ( #39677 )
...
* start fixing kwarg handling
* fmt
* updates padding free tests
* docs
* add missing kwargs modeling_granitemoe.py
* run modular util
* rm unrelated changes from modular util
2025-07-25 20:10:50 +02:00
Cyril Vallez
d6e9f71a6e
Fix tied weight test ( #39680 )
...
Update test_modeling_common.py
2025-07-25 20:09:33 +02:00
bigmoyan
5da6ad2731
fix break for ckpt without _tp_plan ( #39658 )
...
* fix break for ckpt without _tp_plan
* Update src/transformers/modeling_utils.py
* Update src/transformers/modeling_utils.py
---------
Co-authored-by: wangzhengtao <wangzhengtao@msh.team >
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com >
2025-07-25 20:03:48 +02:00
lgai-exaone
c06d4cd6ce
Add EXAONE 4.0 model ( #39129 )
...
* Add EXAONE 4.0 model
* Refactor EXAONE 4.0 modeling code
* Fix cache slicing on SWA + FA2
* Fix cache slicing on FA2 + HybridCache
* Update EXAONE 4.0 modeling code for main branch
* Update o_proj for asymmetric projection
* Address PR feedback
* Add EXAONE 4.0 docs
* Update EXAONE 4.0 modeling code for main branch
* update
* fix updates
* updates
* fix
* fix
* fix
---------
Co-authored-by: Arthur <arthur.zucker@gmail.com >
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com >
2025-07-25 19:58:28 +02:00
Park Woorak
3e4d584a5b
Support typing.Literal as type of tool parameters or return value ( #39633 )
...
* support `typing.Literal` as type of tool parameters
* validate the `args` of `typing.Literal` roughly
* add test to get json schema for `typing.Literal` type hint
* fix: add `"type"` attribute to the parsed result of `typing.Literal`
* test: add argument `booleanish` to test multi-type literal
* style: auto fixup
2025-07-25 17:51:28 +00:00
Arthur
300d42a43e
Add ep ( #39501 )
...
* EP + updates
Co-authored-by: Nouamane Tazi <NouamaneTazi@users.noreply.github.com >
Co-authored-by: drbh <drbh@users.noreply.github.com >
* remove unrelated change
* not working yet but let's see where it goes!
* update the api a bit
* udpate
* where I am at for now
* fix ep
* refactor the API
* yups
* fix
* fixup
* clean modeling
* just support llama4 for now!
* properly avoid
* fix
* nits
* Update src/transformers/models/llama4/modeling_llama4.py
* Update src/transformers/integrations/tensor_parallel.py
* style
* ,,,,
* update
---------
Co-authored-by: Nouamane Tazi <NouamaneTazi@users.noreply.github.com >
Co-authored-by: drbh <drbh@users.noreply.github.com >
2025-07-25 19:46:17 +02:00
Dario Salvati
abaa043d60
bad_words_ids no longer slow on mps ( #39556 )
...
* fix: bad_words_ids no longer slow on mps
* fix: SequenceBiasLogitsProcessor slow `_prepare_bias_variables` method
* fix: re-adding a deleted comment
* fix: bug in no_bad_words_logits
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-07-25 19:45:41 +02:00
Cyril Vallez
6630c5b714
Add xlstm model ( #39665 )
...
* Add xLSTM cleanly with optimizations.
* Fix style.
* Fix modeling test.
* Make xLSTM package optional.
* Fix: Update torch version check.
* Fix: Bad variable naming in test.
* Fix: Import structure cleaning with Ruff.
* Fix: Update docstrings.
* Fix: Mitigate unused config attr tests by explicit usage.
* Fix: Skip tests, if xlstm library is not installed.
* Feat: Enable longer context window for inference by chunking.
* Fix: Make training test pass by lowering target accuracy.
* Chore: Increase test verbosity for failing generation test.
* Update docs/source/en/model_doc/xlstm.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Fix: Make xlstm available even without CUDA.
* Chore: Remove unnecessary import.
* Fix: Remove BOS insertion.
* Chore: Improve xLSTMCache documentation.
* Integrate basic xLSTM fallback code.
* Chore: Remove unnecessary import.
* Chore: Remove duplicate LayerNorm.
* chore: update copyright, minor reformatting
* fix: refactor mLSTMStateType due to missing torch import
* fix: add missing import
* Chore: Replace einops.
* fix: apply ruff formatting
* fix: run `make fix-copies` to re-generate dummy_pt_objects.py
* fix: make type hints Python 3.9 compatible
* fix: remove obsolete import
* fix: remove obsolete method from docs
* chore: remove obsolete `force_bos_token_insert` from config
* Chore: Remove duplicated xLSTMCache class.
* Fix: Formatting of modeling_xlstm.py
* Chore: Remove xlstm package requirement from test. Re-add update_rnn_state.
* Fix: Update xLSTMCache docstring.
* Feat: Add proper initialization of xLSTM.
* Chore: Re-format files.
* Chore: Adapt format.
* Fix: xLSTMCache import restructuring.
* Fix: Add __all__ lists to modeling and configuration files.
* Chore: Reformat.
* Fix: Remove unnecessary update_rnn_state function.
* Fix: Undo test accuracy quickfix.
* Fix: Update copyright year, remvoe config copy.
* Chore: Flatten all internal configs to xLSTMConfig.
* Fix: Unused config variables check.
* Chore: Remove unnecessary imports.
* Fix: Unify xlstm cache argument from batch_size to max_batch_size.
* Chore: Remove bad default arg value for xLSTMCache.
* Chore: Rename core configuration arguments to HF default in xLSTM.
* Chore: Fix formatting.
* Fix: xLSTM Cache config access.
* Fix: Update xlstm tests for config update.
* Feat: Re-add embbeding_dim, num_blocks config options for compat with xLSTM-7B.
* Fix: Configuration xLSTM python3.9 syntax.
* Fix: Difference to main in test_utils.py assertion.
* Fix: Bad syntax in xlstm config for python3.9.
* Fix: xLSTMConfig docstring.
* Fix: xLSTMConfig docstring.
* Fix typing issues in xLSTM and BeiT, Paligemma.
* Fix: Exclude xLSTM from test cache utils.
* Chore: Fix style.
* Chore: Fix format.
* Chore: Remove unnecessary LayerNorm, NormLayer layer abstractions.
* Chore: Remove asserts and replace with ValueErrors.
* Chore: Update __init__.py structure of xLSTM.
* Chore: Clean xLSTM initialization of weights.
* Fix index names in modeling_xlstm.py
* Update xlstm model test typing annotations.
* Fix: Remove all asserts.
* Revert changes to the main __init__.py
* Fix: Move xLSTMCache to modeling_xlstm.py
* Fix: Remove xLSTMForCausalLM mapping from modeling_auto.py
* Remove xLSTMCache from dummy_pt_objects.py
* Fix: Remove extended torchdynamo compilation check integrating cuda graph captures.
* Revert test_cache_utils.py xLSTM change.
* Fix: Move xLSTM init functions before init call.
* Remove xLSTMCache from generation utils.
* Fix: Clean xLSTM init functionality for recursive calls.
* Fix: Move xLSTMCache before its first call.
* Fix formatting.
* Add partial docstring for xLSTMModel forward.
* Fix xLSTMCache docstring in xLSTMModel.
* Remove xLSTMCache from public documentation. Update auto_docstring.
* Remove all agressive shape comments
* style
* Fix names
* simplify
* remove output_hidden_states
* Update modeling_xlstm.py
* Update modeling_xlstm.py
* Update test_modeling_xlstm.py
* Update modeling_xlstm.py
* Update modeling_xlstm.py
* fix
* fix
* style
* style
---------
Co-authored-by: Korbinian Poeppel <korbinian.poeppel@nx-ai.com >
Co-authored-by: Korbinian Pöppel <37810656+kpoeppel@users.noreply.github.com >
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
Co-authored-by: Sebastian Böck <sebastian.boeck@nx-ai.com >
Co-authored-by: Korbinian Poeppel <poeppel@ml.jku.at >
2025-07-25 19:39:17 +02:00
Yoni Gozlan
ed9a96bc6d
Use auto_docstring for perception_lm fast image processor ( #39679 )
2025-07-25 17:32:48 +00:00
Ryan Mullins
d913b39ef3
fix: HWIO to OIHW ( #39200 )
...
* fix: HWIO to OIHW
* Bug in attention type
* Conversion script docstring
* style
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com >
Co-authored-by: Arthur <arthur.zucker@gmail.com >
2025-07-25 19:23:15 +02:00
Yoni Gozlan
a26f0fabb8
Fix auto_docstring crashing when dependencies are missing ( #39564 )
...
* add try except to not crash auto_docstring when some dependency are missing
* safeguard None value in placeholder dict
2025-07-25 19:19:23 +02:00
Armaghan Shakir
69cff312f5
Add support for DeepseekAI's DeepseekVL ( #36248 )
...
* upload initial code
* update deepseek-vl adaptor
* update hierarchy of vision model classes
* udpate aligner model
* add text model
* Added Image Processor
* Added Image Processor
* Added Image Processor
* apply masks
* remove projection; add aligner
* remove interpolate_pos_encoding
* remove unused params in config
* cleaning
* Add the __init__ file
* added processing deepseek_vl class
* modified the deepseek-vl processor
* modified the deepseek-vl processor
* update __init__
* Update the image processor class name
* Added Deepseek to src/transformers/__init__.py file
* Added Deepseek to image_processing_auto.py
* update the __init__ file
* update deepseek_vl image processor
* Update Deepseek Processor
* upload fast image processor
* Revert "upload fast image processor"
This reverts commit 68c8fd50bafbb9770ac70c9de02448e2519219b4.
* update image processor
* flatten heirarchy
* remove DeepseekVLModel
* major update (complete modeling)
* auto modeling and other files
* formatting
* fix quality
* replace torchvision in modeling
* set default do_normalize to False
* add fast image processor template using tool
* update image processors
* add fast image processor to other files
* update liscense
* Added deepseek image testcases
* update image test
* update processor
* write CHAT_TEMPLATE
* update model for processor
* fix processor
* minor fixes and formatting
* fix image processing and tests
* fix interpolation in sam
* fix output_attentions in DeepseekVLModel
* upload test_modeling
* fix tests because of vocab size
* set use_high_res_vision=False in tests
* fix all modeling tests
* fix styling
* remove explicit background_color from image processors
* added test_processor
* added test_processor
* fix processor tests
* update docs
* update docs
* update docs
* update conversion script
* Fixed typos
* minor fixes from review
- remove model_id comments in examples
- remove from pre-trained auto mapping
- move to image-text-to-text from vision-to-seq in auto mapping
- add image_token_index to __init__ for config
- remove outdated temporary config in conversion script
- update example to use chat_template in docstring example
- update liscense 2021->2025
* fix type in config docstring
Co-authored-by: Raushan Turganbay <raushan.turganbay@alumni.nu.edu.kz >
* update get_image_features
* fix config
* improve DeepseekVLImageProcessor.preprocess
* return image_hidden_states
* use AutoTokenizer and AutoImageProcessor in Processor
* fix model outputs
* make num_image_tokens configurable
* fix docstring of processor
* move system prompt to chat template
* fix repo consistency
* fix return_dict
* replace SamVisionEncoder with SamVisionModel
* update to remove deepcopy
* 🛠️ Major Architectural Changes (Adds DeepseekVLHybrid)
* fix quality checks
* add missing hybrid in auto modeling
* run make style
* update sam_hq
* update high_res_size in test
* update docs following #36979
* update code with auto_docstring
* update conversion scripts
* fix style
* fix failing test because of tuple
* set weights_only=True in conversion script
* use safetensors.torch.load_file instead of torch.load in conversion script
* make output_dir optional in conversion script
* fix code snippets in docs (now the examples work fine)
* integration tests for DeepseekVL
* update expected texts
* make style
* integration tests for DeepseekVLHybrid
* fix class name
* update expected texts for hybrid
* run "make style"
* update since changes in main
* run make-style
* nits since changes in main
* undo changes in sam
* fix tests
* fix tests; update with main
* update with main: output_attention/output_hidden_states
* fix copied part in deepseek_vl
* run fix-copies
* fix output_hidden_states
* sam: fix _init_weigths
* use modular for DeepseekVL
* make image processor more modular
* modular: use JanusPreTrainedModel
* janus: provide kwargs in loss
* update processors in conversion script
* Revert "sam: fix _init_weigths"
This reverts commit db625d0c68956c0dad45edd7a469b6a074905c27.
* run fix-copies
---------
Co-authored-by: Shakib-IO <shakib.khan17@northsouth.edu >
Co-authored-by: Raushan Turganbay <raushan.turganbay@alumni.nu.edu.kz >
2025-07-25 19:18:50 +02:00
Cyril Vallez
a98bbc294c
Add missing flag for CacheLayer ( #39678 )
...
* fix
* Update cache_utils.py
2025-07-25 19:12:13 +02:00
Xibin Bayes Zhou
45c7bfb157
Add evolla rebase main ( #36232 )
...
* add evolla
* adding protein encoder part
* add initial processing test
* save processor
* add docstring
* add evolla processor
* add two test
* change vision to protein
* change resampler to sequence_compressor
* change vision to protein
* initial update for llama
* add initial update for llamaForCausalLM
* add `test_processor`, `test_saprot_output`, `test_protein_encoder_output`
* change evolla, but still working on it
* add test_single_forward
* pass test_attention_outputs
* pass test_hidden_states_output
* pass test_save_load and test_from_pretrained_no_checkpoint
* pass test_cpu_offload
* skip some tests
* update new progress
* skip test_model_is_small
* pass test_model_weights_reload_no_missing_tied_weights
* pass test_model_get_set_embeddings
* pass test_cpu_offload
* skip test_resize_embeddings
* add pipeline_model_mapping
* remote old setUp
* pass processor save_pretrained and load_pretrained
* remove pooling layer
* pass test_inputs_embeds_matches_input_ids
* pass test_model_is_small
* pass test_attention_outputs
* pass test_initialization
* pass test_model_get_set_embeddings
* pass test_single_forward
* skip test_disk_offload_bin and test_disk_offload_safetensors
* fix most tests
* pass test_protein_encoder_output
* remove useless code
* add EvollaForProteinText2Text
* pass test_saprot_output
* pass all EvollaModelTest test and remove processor test
* add processor test to its own file
* skip is_training since esm skipped it and the saprot code causes error when setting is_training True
* pass processor tests
* solve all except config
* pass most cases
* change init
* add doc to `configuration_evolla.py`
* remove image_processing test
* remove extra processor test
* remove extra modules
* remove extra modules
* change all configs into one config
* pass all evolla test
* pass `make fixup`
* update short summary
* update Evolla-10B-hf
* pass check_dummies.py and check_code_quality
* fix `tests/models/auto/test_tokenization_auto.py::AutoTokenizerTest::test_model_name_edge_cases_in_mappings`
* remove dummy codes
* change format
* fix llava issue
* update format
* update to solve llama3 access issue
* update to make forward right
* solve processor save load problem from instructblip solution
* remove unexpected file
* skip `test_generation_tester_mixin_inheritance`
* add `test_single_forward_correct` and `test_inference_natural_language_protein_reasoning`
* add `modular_evolla.py`
* solved issue #36362
* run `make fixup`
* update modular
* solve float32 training
* add fix
* solve `utils/check_docstrings.py`
* update
* update
* update
* remove other files and replace sequential and einsum
* add use case in document
* update the models
* update model
* change some wrong code
* Update src/transformers/models/evolla/modular_evolla.py
Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com >
* Update src/transformers/models/evolla/modular_evolla.py
Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com >
* Update src/transformers/models/evolla/modular_evolla.py
Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com >
* Update src/transformers/models/evolla/modular_evolla.py
Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com >
* fix issues mentioned in PR
* update style and rearrange the placement
* fix return_dict argument issue
* solve SaProtConfig issue
* Solve EvollaSaProtRotaryEmbedding issue
* solve attention_mask issue
* solve almosst all issues
* make style
* update config
* remove unrelated pickle file
* delete pickle files
* fix config
* simplify a lot
* remove past k-v from encoder
* continue work
* style
* skip it from init
* fix init
* fix init
* simplify more
* fill in docstrings
* change test for generation
* skip test
* fix style
---------
Co-authored-by: Chenchen Han <13980209828@163.com >
Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co >
Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com >
2025-07-25 19:11:57 +02:00
Yih-Dar
2670da66ce
update expected outputs for whisper after #38778 ( #39304 )
...
* fix
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-07-25 16:48:10 +00:00
Yih-Dar
4b125e2993
fix kyutai tests ( #39416 )
...
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
Co-authored-by: eustlb <94853470+eustlb@users.noreply.github.com >
2025-07-25 18:42:04 +02:00
Arthur
4f17bf0572
Fixes the BC ( #39636 )
...
* fix
* update
* Update src/transformers/utils/generic.py
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com >
* fixup
* fixes
* fix more models
* fix fix fix
* add embedding to more models
* update
* update
* fix
---------
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com >
2025-07-25 18:41:21 +02:00
Cyril Vallez
ddb0546d14
Delete bad rebasing functions ( #39672 )
...
* remove outdated stuff
* remove comment
* use register
* remove finally clause (to allow further check if fallback to sdpa)
* general exception
* add wrapper
* revert check
* typo
2025-07-25 18:28:09 +02:00
Anton Vlasjuk
a91653561e
[Ernie 4.5] Post merge adaptations ( #39664 )
...
* ernie 4.5 fixes
* Apply style fixes
* fix
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-07-25 17:36:18 +02:00
Joao Gante
5d0ba3e479
[CI] revert device in test_export_static_cache ( #39662 )
...
* revert device
* add todo
2025-07-25 15:36:12 +00:00
Pavel Iakubovskii
850bdeaa95
Fix ModernBERT Decoder model ( #39671 )
...
fix
2025-07-25 16:20:12 +01:00
Yoni Gozlan
17f02102c5
🚨 [Fast Image Processor] Force Fast Image Processor for Qwen2_VL/2_5_VL + Refactor ( #39591 )
...
* init
* Force qwen2VL image proc to fast
* refactor qwen2 vl fast
* fix copies
* Update after PR review and update tests to use return_tensors="pt"
* fix processor tests
* add BC for min pixels/max pixels
2025-07-25 11:11:28 -04:00
Lysandre Debut
f90de364c2
Rename huggingface_cli to hf ( #39630 )
...
* Rename huggingface_cli to hf
* hfh
2025-07-25 14:10:04 +02:00
revanth
3b3f9c0c46
fix(voxtral): correct typo in apply_transcription_request ( #39572 )
...
* fix(voxtral): correct typo in apply_transcription_request
* temporary wrapper: apply_transcrition_request
* Update processing_voxtral.py
* style: sort imports in processing_voxtral.py
* docs(voxtral): fix typo in voxtral.md
* make style
* doc update
---------
Co-authored-by: eustlb <94853470+eustlb@users.noreply.github.com >
Co-authored-by: Eustache Le Bihan <eulebihan@gmail.com >
2025-07-25 12:09:44 +00:00
Joao Gante
2a82cf06ad
make fixup ( #39661 )
2025-07-25 11:27:45 +00:00
Joao Gante
e3760501b0
[docs] fix ko cache docs ( #39644 )
...
fix ko docs
2025-07-25 10:06:03 +01:00
Quentin Lhoest
91f591f7bc
Make pytorch examples UV-compatible ( #39635 )
...
* update release.py
* add uv headers in some pytorch examples
* rest of pytorch examples
* style
2025-07-25 10:46:22 +02:00
Wing Lian
c46c17db57
revert change to cu_seqlen_k and max_k when preparing from position_ids ( #39653 )
2025-07-25 10:28:22 +02:00
Jeffrey Li
4600c27c4f
Fix: explicit not none check for tensors in flash attention ( #39639 )
...
fix: explicit not none check for tensors
2025-07-25 10:09:14 +02:00
Raushan Turganbay
c392d47c9b
[attention] fix test for packed padfree masking ( #39582 )
...
* fix most tests
* skip a few more tests
* address comments
* fix chameleon tests
* forgot to uncomment
* qwen has its own tests with images, rename it as well
2025-07-25 07:44:52 +00:00
lmarshall12
565c035a2e
Add owlv2 fast processor ( #39041 )
...
* add owlv2 fast image processor
* add Owlv2ImageProcessorFast to Owlv2Processor image_processor_class
* add Owlv2ImageProcessorFast to Owlv2Processor image_processor_class
* change references to owlVit to owlv2 in docstrings for post process methods
* change type hints from List, Dict, Tuple to list, dict, tuple
* remove unused typing imports
* add disable grouping argument to group images by shape
* run make quality and repo-consistency
* use modular
* fix auto_docstring
---------
Co-authored-by: Lewis Marshall <lewism@elderda.co.uk >
Co-authored-by: yonigozlan <yoni.gozlan@huggingface.co >
2025-07-25 02:40:11 +00:00
Wing Lian
5a81d7e0b3
revert behavior of _prepare_from_posids ( #39622 )
...
* revert behavior of _prepare_from_posids
* add back cu_seqlens_k and max_k for inference
2025-07-24 20:31:00 +02:00
eustlb
ad6fd2da0e
[Voxtral] values for A10 runners ( #39605 )
...
* values for A10 runners
* make
* as for Llava
* does not apply to Voxtral
2025-07-24 18:52:35 +02:00
Joao Gante
4741e1f1b7
[timm] new timm pin ( #39640 )
2025-07-24 16:01:59 +00:00
StevenBucaille
12b612830d
[efficientloftr] fix model_id in tests ( #39621 )
...
fix: wrong EfficientLoFTR model id in tests
2025-07-24 10:41:06 +01:00
Raushan Turganbay
947a37e8f5
Update recent processors for vLLM backend ( #39583 )
...
* update recent models and make sure it runs withh vLLM
* delete!
2025-07-24 10:29:27 +02:00
Matthew Hernandez
7b897fe583
[Docs] Translate audio_classification.md from English to Spanish ( #39513 )
...
* Docs: translate audio_classification to Spanish
* Update audio_classification.md
* Remove space
* Normalize backticks
* Update audio_classification.md
* Apply corrections recommended by aaronjimv
* Update _toctree.yml
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-07-23 15:55:13 -07:00