HuggingFace_transformer/tests at be3fd8a262fb1bfdbe2aaf1b00ab78e243632cba - HuggingFace_transformer - Gitea: Git with SSUM

SUMIN/HuggingFace_transformer

Files

History

bytebarde be3fd8a262 [Flash Attention 2] Add flash attention 2 for GPT-J (#28295 )

* initial implementation of flash attention for gptj

* modify flash attention and overwrite test_flash_attn_2_generate_padding_right

* update flash attention support list

* remove the copy line in the `CodeGenBlock`

* address copy mechanism

* Update src/transformers/models/gptj/modeling_gptj.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Add GPTJ attention classes

* add expected outputs in the gptj test

* Ensure repo consistency with 'make fix-copies'

---------

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

2024-03-13 08:43:00 +01:00

..

[Test refactor 1/5] Per-folder tests reorganization (#15725 )

2022-02-23 15:46:28 -05:00

bettertransformer

Fixed malapropism error (#26660 )

2023-10-09 11:04:57 +02:00

fix failing trainer ds tests (#29057 )

2024-02-16 17:18:45 +05:30

Device agnostic trainer testing (#27131 )

2023-10-30 18:16:40 +00:00

[WIP] add SpeechT5 model (#18922 )

2023-02-03 12:43:46 -05:00

Update all references to canonical models (#29001 )

2024-02-16 08:16:58 +01:00

[tests] use torch_device instead of auto for model testing (#29531 )

2024-03-08 11:21:43 +00:00

[Flash Attention 2] Add flash attention 2 for GPT-J (#28295 )

2024-03-13 08:43:00 +01:00

Make schedulers picklable by making lr_lambda fns global (#21768 )

2023-03-02 12:08:43 -05:00

peft_integration

FIX [CI]: Fix failing tests for peft integration (#29330 )

2024-02-29 03:56:16 +01:00

fix image-to-text batch incorrect output issue (#29342 )

2024-03-08 11:11:10 +00:00

Exllama kernels support for AWQ models (#28634 )

2024-03-05 03:22:48 +01:00

Allow # Ignore copy (#27328 )

2023-12-07 10:00:08 +01:00

Update all references to canonical models (#29001 )

2024-02-16 08:16:58 +01:00

Update all references to canonical models (#29001 )

2024-02-16 08:16:58 +01:00

Add support for for loops in python interpreter (#24429 )

2023-06-26 09:58:14 -04:00

[tests] use the correct n_gpu in TrainerIntegrationTest::test_train_and_eval_dataloaders for XPU (#29307 )

2024-03-08 10:52:25 -05:00

Update all references to canonical models (#29001 )

2024-02-16 08:16:58 +01:00

__init__.py

…

test_backbone_common.py

Align backbone stage selection with out_indices & out_features (#27606 )

2023-12-20 18:33:17 +00:00

test_cache_utils.py

Generate: add tests for caches with pad_to_multiple_of (#29462 )

2024-03-06 10:57:04 +00:00

test_configuration_common.py

[ PretrainedConfig] Improve messaging (#27438 )

2023-11-15 14:10:39 +01:00

test_configuration_utils.py

Update all references to canonical models (#29001 )

2024-02-16 08:16:58 +01:00

test_feature_extraction_common.py

Split common test from core tests (#24284 )

2023-06-15 07:30:24 -04:00

test_feature_extraction_utils.py

Remove-auth-token (#27060 )

2023-11-13 14:20:54 +01:00

test_image_processing_common.py

Raise unused kwargs image processor (#29063 )

2024-02-20 16:20:20 +01:00

test_image_processing_utils.py

Remove-auth-token (#27060 )

2023-11-13 14:20:54 +01:00

test_image_transforms.py

Normalize floating point cast (#27249 )

2023-11-10 15:35:27 +00:00

test_modeling_common.py

Add tests for batching support (#29297 )

2024-03-12 17:46:19 +00:00

test_modeling_flax_common.py

[Flax] Update no init test for Flax v0.7.1 (#28735 )

2024-01-26 18:20:39 +00:00

test_modeling_flax_utils.py

Enable safetensors conversion from PyTorch to other frameworks without the torch requirement (#27599 )

2024-01-23 10:28:23 +01:00

test_modeling_tf_common.py

Add tf_keras imports to prepare for Keras 3 (#28588 )

2024-01-30 17:26:36 +00:00

test_modeling_tf_utils.py

Add tf_keras imports to prepare for Keras 3 (#28588 )

2024-01-30 17:26:36 +00:00

test_modeling_utils.py

Experimental loading of MLX files (#29511 )

2024-03-11 18:42:06 +00:00

test_pipeline_mixin.py

Image Feature Extraction pipeline (#28216 )

2024-02-05 14:50:07 +00:00

test_processing_common.py

Don't save processor_config.json if a processor has no extra attribute (#28584 )

2024-01-19 09:59:14 +00:00

test_sequence_feature_extraction_common.py

Fix typo (#25966 )

2023-09-05 10:12:25 +02:00

test_tokenization_common.py

Update all references to canonical models (#29001 )

2024-02-16 08:16:58 +01:00

test_tokenization_utils.py

Update all references to canonical models (#29001 )

2024-02-16 08:16:58 +01:00