Files
HuggingFace_transformer/docs/source/en
Joao Gante 7ffe25f2b9 Generate: end-to-end compilation (#30788)
* mvp

* added test (a few models need fixes)

* fix a few test cases

* test nits

* harder test 😈

* revert changes in stablelm

* test with improved condition

* add todo

* tmp commit

* merged with main

* nits

* add todo

* final corrections

* add docs for generation compilation

* docs nits

* add  tip

* PR suggestions

* add more details to the compilation docs

* fix cache positions

* cache is now init in generate; update docs

* tag test as flaky

* docs

* post rebase make fixup and other nits

* remove unintended changes

* whisper (encoder-decoder) not supported

* move token default updates to ; add tests for token defaults

* push changes

* manual rebase

* chameleon doesn't support this

* fix test_static_cache_mha_mqa_gqa (broken in another PR)

* docs: dynamic is better with end-to-end compilation
2024-07-29 10:52:13 +01:00
..
2024-07-22 20:21:59 +02:00
2023-09-04 11:15:12 +01:00
2024-07-08 11:52:47 +01:00
2023-12-20 10:37:23 -08:00
2024-07-08 11:52:47 +01:00
2023-11-13 14:20:54 +01:00
2024-07-17 10:41:43 +05:00
2024-04-18 12:49:43 -04:00
2024-07-08 11:52:47 +01:00
2024-06-12 11:33:00 +01:00