HuggingFace_transformer/tests at efceeaf2678678553e94dce78859f87776e633a7 - HuggingFace_transformer - Gitea: Git with SSUM

SUMIN/HuggingFace_transformer

Files

History

Arthur efceeaf267 Kernels flash attn (#39474 )

* use partial to wrap around `transformers` utils!

* try to refactor?

* revert one wrong change

* just a nit

* push

* reverter watever was wrong!

* some nits

* fixes when there is no attention mask

* bring the licence back

* some fixes

* nit

* style

* remove prints

* correct dtype

* fa flags for testing

* update

* use paged attention if requested!

* updates

* a clone was needed, not sure why

* automatically create cu seq lens when input is flash, this at least makes sure layers don't re-compute

* simplify and improve?

* flash attention is kinda broken on recent cuda version so allow the opportunity to use something else

* fix!

* protect kernels import

* update

* properly parse generation config being passed

* revert and update

* add two tests

* some fixes

* fix test FA2

* takes comment into account

* fixup

* revert changes

* revert the clone, it is only needed because the metal kernel is not doing it?

* [docs] update attention implementation and cache docs (#39547)

* update docs

* Apply suggestions from code review

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* applu suggestions

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* fix mps on our side for now

* Update src/transformers/integrations/flash_paged.py

* no qa

---------

Co-authored-by: Vasqu <antonprogamer@gmail.com>
Co-authored-by: Raushan Turganbay <raushan@huggingface.co>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

2025-07-22 15:41:06 +02:00

..

bettertransformer

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

[serve] Add speech to text (/v1/audio/transcriptions) (#39434 )

2025-07-17 14:29:57 +00:00

Remove script datasets in tests (#38940 )

2025-06-25 14:31:20 +00:00

Add Optional to remaining types (#37808 )

2025-04-28 14:20:45 +01:00

[tests] remove tests from libraries with deprecated support (flax, tensorflow_text, ...) (#39051 )

2025-06-26 16:25:00 +01:00

Gaudi3 CI (#38790 )

2025-06-23 10:56:51 +02:00

Rename _supports_flash_attn_2 in examples and tests (#39471 )

2025-07-21 14:02:57 +02:00

Add AMD expectations to Mistral3 tests (#39481 )

2025-07-22 15:40:16 +02:00

[tests] remove TF tests (uses of require_tf) (#38944 )

2025-06-25 17:29:10 +00:00

peft_integration

Enable some ruff checks for performance and readability (#39383 )

2025-07-17 13:21:59 +00:00

Raise TypeError instead of ValueError for invalid types (#38660 )

2025-07-21 12:42:00 +00:00

Remove residual quantization attribute from dequantized models (#39373 )

2025-07-15 17:16:10 +02:00

[modular] speedup check_modular_conversion with multiprocessing (#37456 )

2025-07-10 19:07:59 +01:00

Deprecate TF + JAX (#38758 )

2025-06-11 17:28:06 +01:00

tensor_parallel

enable static cache on TP model (#39164 )

2025-07-09 21:14:45 +00:00

[tests] remove tests from libraries with deprecated support (flax, tensorflow_text, ...) (#39051 )

2025-06-26 16:25:00 +01:00

Fix tests due to breaking change in accelerate (#39451 )

2025-07-17 13:51:50 +01:00

Refactor MambaCache to modeling_mamba.py (#38086 )

2025-07-21 14:59:36 +02:00

__init__.py

…

causal_lm_tester.py

[Ernie 4.5] Add ernie text models (#39228 )

2025-07-21 19:51:49 +02:00

test_backbone_common.py

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

test_configuration_common.py

Update composition flag usage (#36263 )

2025-04-09 11:48:49 +02:00

test_feature_extraction_common.py

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

test_image_processing_common.py

Fix overriding Fast Image/Video Processors instance attributes affect other instances (#39363 )

2025-07-12 23:39:06 +00:00

test_image_transforms.py

Raise TypeError instead of ValueError for invalid types (#38660 )

2025-07-21 12:42:00 +00:00

test_modeling_common.py

Kernels flash attn (#39474 )

2025-07-22 15:41:06 +02:00

test_pipeline_mixin.py

Enable some ruff checks for performance and readability (#39383 )

2025-07-17 13:21:59 +00:00

test_processing_common.py

[chat template] return assistant mask in processors (#38545 )

2025-07-18 12:23:20 +00:00

test_sequence_feature_extraction_common.py

[tests] remove TF tests (uses of require_tf) (#38944 )

2025-06-25 17:29:10 +00:00

test_tokenization_common.py

Fix pylint warnings (#39477 )

2025-07-21 12:38:05 +00:00

test_tokenization_mistral_common.py

Add voxtral (#39429 )

2025-07-18 00:02:04 +00:00

test_training_args.py

Fix TrainingArguments.torch_empty_cache_steps post_init check (#36734 )

2025-03-17 16:09:46 +01:00

test_video_processing_common.py

Fix overriding Fast Image/Video Processors instance attributes affect other instances (#39363 )

2025-07-12 23:39:06 +00:00