HuggingFace_transformer/utils at e314395277d784a34ee99526f48155d4d62cff3d - HuggingFace_transformer - Gitea: Git with SSUM

SUMIN/HuggingFace_transformer

Files

History

Arthur e314395277 Refactor flash attention implementation in transformers (#31446 )

* dumb commit

* nit

* update

* something like this

* unpack in modeling utils

* safe import

* oups

* update

* nits

* diff convert gemma

* update

* start propagating

* udpate other modeling code as well

* update for sliding window models

* nits

* more init cleanups

* styling

* fixup

* noice

* pass fixup

* typo typing_extension -> typing_extensions

* torch.nn.functionnal -> torch.nn.functional

* add to import structure

* unpack

* simplify a bit more for this first version

* nut

* update

* update

* nit

* ease the import of `Unpack`

* remove useless `use_sliding_window`

* no qua please

* protect import?

* style

* [run-slow]

* [run slow] llama,gemma,mistral,mixtral

* remove extra kwargs

* fix llama

* address review comments

* apply diff_model_converter to modeling_gemma.py

* remove cache_position 1

* remove cache_position 2

* some cleaning

* refactor gemma2 as well

* apply review comments

* rename file to modeling_flash_attention_utils.py

* siglip refactor

* remove dead code

* is the hub down?

* still down?

* fix siglip

* fix gemma2

* fatal: Could not read from remote repository.

* fix typo in softcap implem

* flacky

* Failed: Timeout >120.0s

---------

Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>

2024-07-11 20:37:31 +08:00

..

AutoImageProcessor (#20111 )

2022-11-08 19:54:41 +00:00

Check TF ops for ONNX compliance (#10025 )

2021-02-15 07:55:10 -05:00

add_pipeline_model_mapping_to_test.py

update ruff version (#30932 )

2024-05-22 06:40:15 +02:00

check_build.py

Clean up CUDA kernels (#23455 )

2023-05-18 14:14:43 -04:00

check_config_attributes.py

Refactor flash attention implementation in transformers (#31446 )

2024-07-11 20:37:31 +08:00

check_config_docstrings.py

Update all references to canonical models (#29001 )

2024-02-16 08:16:58 +01:00

check_copies.py

Improve error message for mismatched copies in code blocks (#31535 )

2024-06-25 13:55:11 +02:00

check_doc_toc.py

update ruff version (#30932 )

2024-05-22 06:40:15 +02:00

check_docstrings.py

Remove ConversationalPipeline and Conversation object (#31165 )

2024-06-07 17:50:18 +01:00

check_doctest_list.py

update ruff version (#30932 )

2024-05-22 06:40:15 +02:00

check_dummies.py

update ruff version (#30932 )

2024-05-22 06:40:15 +02:00

check_inits.py

Loading GGUF files support (#30391 )

2024-05-15 14:28:20 +02:00

check_model_tester.py

Add a new script to check model testers' config (#22063 )

2023-03-13 19:11:19 +01:00

check_repo.py

Add video modality for InstrucBLIP (#30182 )

2024-06-25 15:45:39 +05:00

check_self_hosted_runner.py

Tiny fix for check_self_hosted_runner.py (#24052 )

2023-06-06 18:17:41 +02:00

check_support_list.py

update ruff version (#30932 )

2024-05-22 06:40:15 +02:00

check_table.py

update ruff version (#30932 )

2024-05-22 06:40:15 +02:00

check_tf_ops.py

Check TF ops for ONNX compliance (#10025 )

2021-02-15 07:55:10 -05:00

create_dummy_models.py

Pass datasets trust_remote_code (#31406 )

2024-06-17 17:29:13 +01:00

custom_init_isort.py

update ruff version (#30932 )

2024-05-22 06:40:15 +02:00

deprecate_models.py

Remove copied froms for deprecated models (#31153 )

2024-06-03 09:42:53 +01:00

diff_model_converter.py

Fix typos (#31819 )

2024-07-08 11:52:47 +01:00

download_glue_data.py

update ruff version (#30932 )

2024-05-22 06:40:15 +02:00

extract_warnings.py

update github actions packages' version to suppress warnings (#30249 )

2024-04-15 15:08:09 +02:00

get_ci_error_statistics.py

Add artifact name in job step to maintain job / artifact correspondence (#28682 )

2024-01-31 15:58:17 +01:00

get_github_job_time.py

Make Slack CI reporting stronger (#21823 )

2023-02-28 17:12:44 +01:00

get_modified_files.py

exclude deleted files in the fixup script (#21436 )

2023-02-03 12:57:02 -05:00

get_previous_daily_ci.py

Update workflow_id in utils/get_previous_daily_ci.py (#30695 )

2024-05-07 16:58:50 +02:00

get_test_info.py

Add an utility file to get information from test files (#21856 )

2023-03-01 17:53:29 +01:00

important_models.txt

ENH: [CI] Add new workflow to run slow tests of important models on push main if they are modified (#29235 )

2024-04-12 10:01:28 +02:00

models_to_deprecate.py

update ruff version (#30932 )

2024-05-22 06:40:15 +02:00

not_doctested.txt

Remove ConversationalPipeline and Conversation object (#31165 )

2024-06-07 17:50:18 +01:00

notification_service_doc_tests.py

Refactor doctest (#30210 )

2024-04-15 13:20:36 +02:00

notification_service_quantization.py

Revive Nightly/Past CI (#31159 )

2024-06-20 18:57:24 +02:00

notification_service.py

Revive Nightly/Past CI (#31159 )

2024-06-20 18:57:24 +02:00

past_ci_versions.py

(Re-)Enable Nightly + Past CI (#22393 )

2023-03-30 21:06:35 +02:00

patch_helper.py

helper (#31152 )

2024-05-31 08:49:33 +02:00

pr_slow_ci_models.py

Avoid duplication in PR slow CI model list (#30634 )

2024-05-03 18:19:30 +02:00

print_env.py

Print more library versions in CI (#17384 )

2022-06-02 10:24:16 +02:00

release.py

update ruff version (#30932 )

2024-05-22 06:40:15 +02:00

set_cuda_devices_for_ci.py

Fix Cohere CI (#31263 )

2024-06-10 15:16:58 +02:00

slow_documentation_tests.txt

Update CodeLlama references (#30218 )

2024-05-09 22:57:52 +02:00

sort_auto_mappings.py

update ruff version (#30932 )

2024-05-22 06:40:15 +02:00

split_doctest_jobs.py

Refactor doctest (#30210 )

2024-04-15 13:20:36 +02:00

split_model_tests.py

consistent job / pytest report / artifact name correspondence (#30392 )

2024-04-24 22:32:42 +02:00

tests_fetcher.py

Fix test fetcher (doctest) + Idefics2's doc example (#30274 )

2024-04-16 21:25:06 +02:00

update_metadata.py

update ruff version (#30932 )

2024-05-22 06:40:15 +02:00

update_tiny_models.py

update ruff version (#30932 )

2024-05-22 06:40:15 +02:00