HuggingFace_transformer

Files

efsotr 3ee72af6b6 Fix graph break in torch.compile when using FA2 with attention_mask=None and batch size > 1 (#37332 )

* Fix graph break in torch.compile when using FA2 with attention_mask=None and batch size > 1

* fix code format

* add test; replace position_ids with query_states becasue position_ids.shape[0] is always 1

* add assert loss is not nan

2025-06-25 07:58:34 +00:00

bettertransformer

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

deepspeed

Gaudi3 CI (#38790 )

2025-06-23 10:56:51 +02:00

extended

Add Optional to remaining types (#37808 )

2025-04-28 14:20:45 +01:00

fixtures

Implementation of SuperPoint and AutoModelForKeypointDetection (#28966 )

2024-03-19 14:43:02 +00:00

fsdp

Gaudi3 CI (#38790 )

2025-06-23 10:56:51 +02:00

generation

enable misc test cases on XPU (#38852 )

2025-06-18 09:20:49 +02:00

models

Skip sdpa dispatch on flash test due to unsupported head dims (#39010 )

2025-06-24 20:16:56 +02:00

optimization

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

peft_integration

FIX: Faulty PEFT tests (#37757 )

2025-04-28 15:10:46 +02:00

pipelines

[Feature] Support is_split_into_words in the TokenClassificationPipeline. (#38818 )

2025-06-23 15:31:32 +00:00

quantization

Fix HQQ model param device transfer issue (#38466 )

2025-06-18 15:09:00 +02:00

repo_utils

Use HF papers (#38184 )

2025-06-13 11:07:09 +00:00

sagemaker

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

tensor_parallel

[TP] Change command in tests to python3 (#38555 )

2025-06-03 11:03:33 +00:00

tokenization

Remove isort from dependencies (#38616 )

2025-06-05 16:42:49 +00:00

trainer

Gaudi3 CI (#38790 )

2025-06-23 10:56:51 +02:00

utils

Fix bugs in DynamicCache (#37880 )

2025-06-24 19:43:40 +02:00

__init__.py

…

causal_lm_tester.py

Refactor DBRX tests to use CausalLMModelTest base classes (#38475 )

2025-06-13 16:22:12 +01:00

test_backbone_common.py

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

test_configuration_common.py

Update composition flag usage (#36263 )

2025-04-09 11:48:49 +02:00

test_feature_extraction_common.py

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

test_image_processing_common.py

Add Idefics2/3 and SmolVLM Fast image processors + improvements for fast image processors (#38157 )

2025-06-23 14:17:25 +00:00

test_image_transforms.py

Fix pad image transform for batched inputs (#37544 )

2025-05-08 10:51:15 +01:00

test_modeling_common.py

Fix graph break in torch.compile when using FA2 with attention_mask=None and batch size > 1 (#37332 )

2025-06-25 07:58:34 +00:00

test_pipeline_mixin.py

No more Tuple, List, Dict (#38797 )

2025-06-17 19:37:18 +01:00

test_processing_common.py

[video processors] support frame sampling within processors (#38105 )

2025-06-12 09:34:30 +00:00

test_sequence_feature_extraction_common.py

No more Tuple, List, Dict (#38797 )

2025-06-17 19:37:18 +01:00

test_tokenization_common.py

🚨 rm already deprecated pad_to_max_length arg (#37617 )

2025-05-01 15:21:55 +02:00

test_training_args.py

Fix TrainingArguments.torch_empty_cache_steps post_init check (#36734 )

2025-03-17 16:09:46 +01:00

test_video_processing_common.py

[video processors] support frame sampling within processors (#38105 )

2025-06-12 09:34:30 +00:00