HuggingFace_transformer

Files

Pavel Iakubovskii 9bec2654ed Add V-JEPA for video classification model (#38788 )

* adding model and conversion scripts

* add imports to test vjepa conversion

* fix imports and make conversion work

* fix computation for short side

* replace attention with library attention function

* cleanup more attention classes

* remove config overrides

* add test cases, fix some of the failing ones

* fix the model outputs

* fix outputs of the model per review

* fix too big model test case

* fix styling __init__.py

* fix initialization test

* remove all asserts per review

* update sorting unsorting logic as per feedback

* remove is_video per review

* remove another is_video segment

* remove unwanted stuff

* small fixes

* add docstrings for the model

* revert adding vjepa2 config here

* update styling

* add config docstrings (wip)

* fix dpr issue

* removed test failing issues

* update styles

* merge predictor configs into main config

* remove processing code, add video processor

* remove permute which is not necessary now

* fix styles

* updated vjepa2 to be in video_processing_auto

* update comment for preprocessing

* test integration test and fix the outputs

* update test values, change test to look at repeated frames for a given image

* add a simple video processing test

* refactoring pixel_values_videos and upload ckpts to original

* fix torch_fx test cases

* remove unused config

* add all config docstrings

* add more integration tests

* add basic doc

* revert unwanted styling changes

* working make fixup

* Fix model_type in config

* Add ForVideoClassification model

* update attention implementation to fit new hf standards

* fix the preprocessing logic, ensure it matches the original model

* remove use_rope logic, cleanup

* fix docstrings

* Further cleanup, update doc

* Fix model prefix

* fix get_vision_features

* VJEPA2Embeddings style refactor

* nit, style comment

* change modules default values

* Only `str` activation in config

* GradientCheckpointingLayer

* fixup

* fix conversion script

* Remove return_dict

* remove None return typehint

* Refactor VJEPA2Layer, remove use_SiLU

* Fix fx tests

* dpr -> drop_path_rates

* move *ModelOutput on top

* format docs bit

* update docs

* update docs

* update doc example

* remove prune_heads from model

* remove unused config params

* refactor embed signature

* Add vjepa to docs

* Fix config docstring

* attention head

* update defaults

* Update docs/source/en/model_doc/vjepa2.md

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update docs/source/en/model_doc/vjepa2.md

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Fix import

* Min refactoring

* Update HUB_SOURCE and HUB_REPO in conversion script

* Add missing headers

* VJEPA -> V-JEPA in docs

* Add image to doc

* fix style

* fix init weights

* change checkpoint name in modeling tests

* Initial cls head setup

* remove rop attention from head (not needed)

* remove swigluffn - not needed

* Add siglip layer

* Replace with siglip layer

* Rename Siglip - VJEPA2

* remove unused modules

* remove siglip mlp

* nit

* remove MLP

* Refactor head cross attention

* refactor VJEPA2HeadCrossAttentionLayer

* nit renaming

* fixup

* remove commented code

* Add cls head params to config

* depth from config

* move pooler + classifier  to the model

* Update for cls model signature

* move layers, rename a bit

* fix docs

* update weights init

* remove typehint for init

* add to auto-mapping

* enable tests

* Add conversion script

* fixup

* add to docs

* fix docs

* nit

* refactor for mapping

* clean

* Add integration test

* Fixing multi gpu test

* update not-split-modules

* update video cls test tolerance

* Increase test_inference_image tolerance

* Update no-split modules for multi gpu

* Apply suggestions from code review

* fixing multi-gpu

* fix docstring

* Add cls snippet to docs

* Update checkpoint

2025-06-13 17:56:15 +01:00

albert

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

align

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

altclip

🚨 🚨 Setup -> setupclass conversion (#37282 )

2025-04-08 17:15:37 +01:00

aria

Don't run AriaForConditionalGenerationModelTest on CircleCI (#38615 )

2025-06-06 11:30:31 +02:00

audio_spectrogram_transformer

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

auto

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

autoformer

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

aya_vision

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

bamba

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

bark

enable more test cases on xpu (#38572 )

2025-06-06 09:29:51 +02:00

bart

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

barthez

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

bartpho

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

beit

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

bert

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

bert_generation

Fix typos in strings and comments (#37799 )

2025-04-28 11:39:11 +01:00

bert_japanese

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

bertweet

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

big_bird

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

bigbird_pegasus

[generation] bring back tests on vision models (#38603 )

2025-06-06 08:23:15 +00:00

biogpt

Bart: new cache format (#35314 )

2025-05-16 13:26:54 +02:00

bit

Add ImageProcessorFast to BiT processor (#37180 )

2025-04-14 17:07:48 +02:00

bitnet

Add Bitnet model (#37742 )

2025-04-28 15:08:46 +02:00

blenderbot

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

blenderbot_small

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

blip

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

blip_2

[generation] bring back tests on vision models (#38603 )

2025-06-06 08:23:15 +00:00

bloom

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

bridgetower

Add Optional to remaining types (#37808 )

2025-04-28 14:20:45 +01:00

bros

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

byt5

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

camembert

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

canine

Skip torchscript tests for 2 models (#38643 )

2025-06-06 20:17:37 +02:00

chameleon

Update some tests for torch 2.7.1 (#38701 )

2025-06-10 11:46:52 +02:00

chinese_clip

Add Fast Chinese-CLIP Processor (#37012 )

2025-04-15 18:31:20 +02:00

clap

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

clip

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

clipseg

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

clvp

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

code_llama

remove unhandled parameter (#38145 )

2025-06-02 15:57:32 +02:00

codegen

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

cohere

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

cohere2

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

colpali

Add ColQwen2 to 🤗 transformers (#35778 )

2025-06-02 12:58:01 +00:00

colqwen2

Update some tests for torch 2.7.1 (#38701 )

2025-06-10 11:46:52 +02:00

conditional_detr

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

convbert

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

convnext

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

convnextv2

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

cpm

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

cpmant

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

csm

Update CsmForConditionalGenerationIntegrationTest (#38424 )

2025-05-28 10:20:43 +02:00

ctrl

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

cvt

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

d_fine

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

dab_detr

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

dac

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

data2vec

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

dbrx

Refactor DBRX tests to use CausalLMModelTest base classes (#38475 )

2025-06-13 16:22:12 +01:00

deberta

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

deberta_v2

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

decision_transformer

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

deepseek_v3

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

deformable_detr

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

deit

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

depth_anything

Skip some export tests on torch 2.7 (#38677 )

2025-06-12 12:47:15 +02:00

depth_pro

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

detr

enable more test cases on xpu (#38572 )

2025-06-06 09:29:51 +02:00

diffllama

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

dinat

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

dinov2

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

dinov2_with_registers

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

distilbert

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

dit

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

donut

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

dpr

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

dpt

Skip some export tests on torch 2.7 (#38677 )

2025-06-12 12:47:15 +02:00

efficientnet

Add EfficientNet Image PreProcessor (#37055 )

2025-04-16 21:59:24 +02:00

electra

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

emu3

update emu3 test (#38543 )

2025-06-03 11:02:01 +02:00

encodec

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

encoder_decoder

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

ernie

Remove old code for PyTorch, Accelerator and tokenizers (#37234 )

2025-04-10 20:54:21 +02:00

esm

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

falcon

🚨 🚨 Inherited CausalLM Tests (#37590 )

2025-05-23 18:29:31 +01:00

falcon_h1

[Falcon H1] Fix slow path forward pass (#38320 )

2025-05-26 15:30:35 +02:00

falcon_mamba

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

fastspeech2_conformer

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

flaubert

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

flava

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

fnet

🚨 rm already deprecated pad_to_max_length arg (#37617 )

2025-05-01 15:21:55 +02:00

focalnet

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

fsmt

Fix typos in strings and comments (#37910 )

2025-05-01 14:58:58 +01:00

funnel

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

fuyu

🔴 [VLM] Add base model without head (#37033 )

2025-05-07 17:47:51 +02:00

gemma

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

gemma2

Unbreak optimum-executorch (#38646 )

2025-06-13 11:13:32 +02:00

gemma3

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

git

🚨 🚨 Setup -> setupclass conversion (#37282 )

2025-04-08 17:15:37 +01:00

glm

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

glm4

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

glpn

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

got_ocr2

[tests] expand flex-attn test for vision models (#38434 )

2025-06-03 07:40:44 +00:00

gpt2

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

gpt_bigcode

Remove head mask in generative models (#35786 )

2025-05-15 10:44:19 +02:00

gpt_neo

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

gpt_neox

🚨 🚨 Inherited CausalLM Tests (#37590 )

2025-05-23 18:29:31 +01:00

gpt_neox_japanese

Remove old code for PyTorch, Accelerator and tokenizers (#37234 )

2025-04-10 20:54:21 +02:00

gpt_sw3

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

gptj

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

granite

switch to device agnostic device calling for test cases (#38247 )

2025-05-26 10:18:53 +02:00

granite_speech

enable finegrained_fp8 and granite_speech cases on XPU (#38036 )

2025-05-14 08:58:40 +00:00

granitemoe

switch to device agnostic device calling for test cases (#38247 )

2025-05-26 10:18:53 +02:00

granitemoehybrid

switch to device agnostic device calling for test cases (#38247 )

2025-05-26 10:18:53 +02:00

granitemoeshared

switch to device agnostic device calling for test cases (#38247 )

2025-05-26 10:18:53 +02:00

grounding_dino

enable more test cases on xpu (#38572 )

2025-06-06 09:29:51 +02:00

groupvit

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

helium

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

herbert

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

hgnet_v2

Add D-FINE Model into Transformers (#36261 )

2025-04-29 12:17:55 +01:00

hiera

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

hubert

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

ibert

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

idefics

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

idefics2

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

idefics3

[VLMs] fix flash-attention tests (#37603 )

2025-04-24 11:48:11 +02:00

ijepa

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

imagegpt

[test] update test_past_key_values_format (#37614 )

2025-04-22 11:07:34 +01:00

informer

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

instructblip

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

instructblipvideo

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

internvl

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

jamba

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

janus

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

jetmoe

🚨 🚨 Inherited CausalLM Tests (#37590 )

2025-05-23 18:29:31 +01:00

kosmos2

[VLMs] support attention backends (#37576 )

2025-05-08 18:18:54 +02:00

layoutlm

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

layoutlmv2

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

layoutlmv3

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

layoutxlm

🚨 rm already deprecated pad_to_max_length arg (#37617 )

2025-05-01 15:21:55 +02:00

led

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

levit

Add Fast LeViT Processor (#37154 )

2025-04-14 17:07:36 +02:00

lilt

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

llama

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

llama4

enable more test cases on xpu (#38572 )

2025-06-06 09:29:51 +02:00

llava

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

llava_next

Fix llava_next tests (#38813 )

2025-06-13 15:19:41 +02:00

llava_next_video

[video processor] fix tests (#38104 )

2025-05-14 10:24:07 +00:00

llava_onevision

Fix llava_onevision tests (#38791 )

2025-06-12 15:06:49 +02:00

longformer

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

longt5

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

luke

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

lxmert

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

m2m_100

🔴🔴🔴 [Attention] Refactor Attention Interface for Bart-based Models (#38108 )

2025-05-22 17:12:58 +02:00

mamba

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

mamba2

Mamba2 remove unecessary test parameterization (#38227 )

2025-05-20 13:54:04 +00:00

marian

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

markuplm

🚨 rm already deprecated pad_to_max_length arg (#37617 )

2025-05-01 15:21:55 +02:00

mask2former

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

maskformer

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

mbart

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

mbart50

Use lru_cache for tokenization tests (#36818 )

2025-03-28 15:09:35 +01:00

megatron_bert

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

megatron_gpt2

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

mgp_str

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

mimi

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

minimax

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

mistral

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

mistral3

🔴 Video processors as a separate class (#35206 )

2025-05-12 11:55:51 +02:00

mixtral

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

mlcd

Add MLCD model (#36182 )

2025-04-15 11:33:09 +01:00

mllama

Fix mllama (#38704 )

2025-06-12 16:15:35 +02:00

mluke

Fix typos in strings and comments (#37799 )

2025-04-28 11:39:11 +01:00

mobilebert

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

mobilenet_v1

Add Fast Image Processor for MobileNetV1 (#37111 )

2025-04-23 15:55:41 -04:00

mobilenet_v2

Add Fast Mobilenet-V2 Processor (#37113 )

2025-04-14 17:08:47 +02:00

mobilevit

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

mobilevitv2

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

modernbert

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

moonshine

Skip torchscript tests for 2 models (#38643 )

2025-06-06 20:17:37 +02:00

moshi

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

mpnet

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

mpt

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

mra

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

mt5

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

musicgen

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

musicgen_melody

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

mvp

Fix typos in strings and comments (#37799 )

2025-04-28 11:39:11 +01:00

myt5

🚨 🚨 Setup -> setupclass conversion (#37282 )

2025-04-08 17:15:37 +01:00

nemotron

switch to device agnostic device calling for test cases (#38247 )

2025-05-26 10:18:53 +02:00

nllb

Use lru_cache for tokenization tests (#36818 )

2025-03-28 15:09:35 +01:00

nllb_moe

Fix typos in strings and comments (#37799 )

2025-04-28 11:39:11 +01:00

nougat

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

nystromformer

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

olmo

Unbreak optimum-executorch (#38646 )

2025-06-13 11:13:32 +02:00

olmo2

Make HF implementation match original OLMo 2 models for lower precisions (#38131 )

2025-05-19 15:35:23 +02:00

olmoe

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

omdet_turbo

enable more test cases on xpu (#38572 )

2025-06-06 09:29:51 +02:00

oneformer

Fix OneFormer integration test (#38016 )

2025-05-12 16:02:41 +02:00

openai

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

opt

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

owlv2

🚨 🚨 Setup -> setupclass conversion (#37282 )

2025-04-08 17:15:37 +01:00

owlvit

Add Fast owlvit Processor (#37164 )

2025-04-14 17:58:09 +02:00

paligemma

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

paligemma2

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

patchtsmixer

🔴🔴🔴 [Attention] Refactor Attention Interface for Bart-based Models (#38108 )

2025-05-22 17:12:58 +02:00

patchtst

Force torch>=2.6 with torch.load to avoid vulnerability issue (#37785 )

2025-04-25 16:57:09 +02:00

pegasus

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

pegasus_x

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

perceiver

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

persimmon

🚨 🚨 Inherited CausalLM Tests (#37590 )

2025-05-23 18:29:31 +01:00

phi

🚨 🚨 Inherited CausalLM Tests (#37590 )

2025-05-23 18:29:31 +01:00

phi3

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

phi4_multimodal

Fix incorrect batching audio index calculation for Phi-4-Multimodal (#38103 )

2025-05-26 12:41:31 +00:00

phimoe

Fix MoE gradient test (#38438 )

2025-05-28 16:44:20 +01:00

phobert

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

pix2struct

Fix typos in strings and comments (#37799 )

2025-04-28 11:39:11 +01:00

pixtral

Add args support for fast image processors (#37018 )

2025-05-16 12:01:46 -04:00

plbart

🔴🔴🔴 [Attention] Refactor Attention Interface for Bart-based Models (#38108 )

2025-05-22 17:12:58 +02:00

poolformer

Add Fast Image Processor for PoolFormer (#37182 )

2025-04-23 15:55:33 -04:00

pop2piano

Fix typos in strings and comments (#37799 )

2025-04-28 11:39:11 +01:00

prompt_depth_anything

Skip some export tests on torch 2.7 (#38677 )

2025-06-12 12:47:15 +02:00

prophetnet

[generation] bring back tests on vision models (#38603 )

2025-06-06 08:23:15 +00:00

pvt

Add Fast PVT Processor (#37204 )

2025-04-23 15:55:20 -04:00

pvt_v2

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

qwen2

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

qwen2_5_omni

Fix qwen_2_5 omni (#38658 )

2025-06-12 14:43:54 +02:00

qwen2_5_vl

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

qwen2_audio

[Tests] Clean up test cases for few models (#38315 )

2025-05-29 08:21:28 +00:00

qwen2_moe

Fix MoE gradient test (#38438 )

2025-05-28 16:44:20 +01:00

qwen2_vl

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

qwen3

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

qwen3_moe

Fix MoE gradient test (#38438 )

2025-05-28 16:44:20 +01:00

rag

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

recurrent_gemma

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

reformer

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

regnet

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

rembert

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

resnet

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

roberta

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

roberta_prelayernorm

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

roc_bert

Remove old code for PyTorch, Accelerator and tokenizers (#37234 )

2025-04-10 20:54:21 +02:00

roformer

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

rt_detr

enable more test cases on xpu (#38572 )

2025-06-06 09:29:51 +02:00

rt_detr_v2

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

rwkv

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

sam

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

sam_hq

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

seamless_m4t

[seamless_m4t] Skip some tests when speech is not available (#38430 )

2025-06-02 09:17:28 +00:00

seamless_m4t_v2

[seamless_m4t] Skip some tests when speech is not available (#38430 )

2025-06-02 09:17:28 +00:00

segformer

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

seggpt

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

sew

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

sew_d

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

shieldgemma2

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

siglip

[tests] expand flex-attn test for vision models (#38434 )

2025-06-03 07:40:44 +00:00

siglip2

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

smolvlm

[video processors] support frame sampling within processors (#38105 )

2025-06-12 09:34:30 +00:00

speech_encoder_decoder

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

speech_to_text

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

speecht5

[generation] bring back tests on vision models (#38603 )

2025-06-06 08:23:15 +00:00

splinter

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

squeezebert

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

stablelm

🚨 🚨 Inherited CausalLM Tests (#37590 )

2025-05-23 18:29:31 +01:00

starcoder2

🚨 🚨 Inherited CausalLM Tests (#37590 )

2025-05-23 18:29:31 +01:00

superglue

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

superpoint

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

swiftformer

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

swin

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

swin2sr

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

swinv2

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

switch_transformers

[generation] bring back tests on vision models (#38603 )

2025-06-06 08:23:15 +00:00

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

table_transformer

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

tapas

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

textnet

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

time_series_transformer

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

timesfm

[TimesFM] use the main revison instead of revision for integration test (#37558 )

2025-04-17 11:26:03 +02:00

timesformer

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

timm_backbone

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

timm_wrapper

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

trocr

🚨 🚨 Setup -> setupclass conversion (#37282 )

2025-04-08 17:15:37 +01:00

tvp

Add Optional to remaining types (#37808 )

2025-04-28 14:20:45 +01:00

udop

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

umt5

[generation] bring back tests on vision models (#38603 )

2025-06-06 08:23:15 +00:00

unispeech

🔴🔴🔴 [Attention] Refactor Attention Interface for Bart-based Models (#38108 )

2025-05-22 17:12:58 +02:00

unispeech_sat

🔴🔴🔴 [Attention] Refactor Attention Interface for Bart-based Models (#38108 )

2025-05-22 17:12:58 +02:00

univnet

chore: fix typos in the tests directory (#36813 )

2025-03-21 10:20:05 +01:00

upernet

Skip some export tests on torch 2.7 (#38677 )

2025-06-12 12:47:15 +02:00

video_llava

fix spelling errors (#38608 )

2025-06-05 13:57:23 +01:00

videomae

[tests] expand flex-attn test for vision models (#38434 )

2025-06-03 07:40:44 +00:00

vilt

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

vipllava

[tests] expand flex-attn test for vision models (#38434 )

2025-06-03 07:40:44 +00:00

vision_encoder_decoder

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

vision_text_dual_encoder

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

visual_bert

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

vit

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

vit_mae

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

vit_msn

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

vitdet

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

vitmatte

Skip some export tests on torch 2.7 (#38677 )

2025-06-12 12:47:15 +02:00

vitpose

Skip some export tests on torch 2.7 (#38677 )

2025-06-12 12:47:15 +02:00

vitpose_backbone

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

vits

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

vivit

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

vjepa2

Add V-JEPA for video classification model (#38788 )

2025-06-13 17:56:15 +01:00

wav2vec2

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

wav2vec2_bert

🚨 🚨 Setup -> setupclass conversion (#37282 )

2025-04-08 17:15:37 +01:00

wav2vec2_conformer

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

wav2vec2_phoneme

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

wav2vec2_with_lm

use torch.testing.assertclose instead to get more details about error in cis (#35659 )

2025-01-24 16:55:28 +01:00

wavlm

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

whisper

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

x_clip

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

xglm

Expectation fixes and added AMD expectations (#38729 )

2025-06-13 16:14:58 +02:00

xlm

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

xlm_roberta

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

xlm_roberta_xl

Remove old code for PyTorch, Accelerator and tokenizers (#37234 )

2025-04-10 20:54:21 +02:00

xlnet

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

xmod

Remove old code for PyTorch, Accelerator and tokenizers (#37234 )

2025-04-10 20:54:21 +02:00

yolos

🚨Early-error🚨 config will error out if output_attentions=True and the attn implementation is wrong (#38288 )

2025-05-23 17:17:38 +02:00

yoso

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

zamba

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

zamba2

Remove all traces of low_cpu_mem_usage (#38792 )

2025-06-12 16:39:33 +02:00

zoedepth

Skip some export tests on torch 2.7 (#38677 )

2025-06-12 12:47:15 +02:00

__init__.py

Move test model folders (#17034 )

2022-05-03 14:42:02 +02:00