Files
HuggingFace_transformer/docs/source/en
Jonathan Mamou e27465c801 Adaptive dynamic number of speculative tokens (#34156)
* initial commit

* update strategy

* add tradeoff FPR TPR with cost

* all probs

* fix

* fix

* fix style

* Update src/transformers/generation/configuration_utils.py

shorter docstring

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* import guard

* fix style

* add is_sklearn_available condition

* vectorizing to flatten the for-loop

* fix style

* disable adaptation for UAG

* update doc

* add TestAssistedCandidateGeneratorUpdateStrategy

* fix style

* protect import

* fix style

---------

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
2024-12-05 17:07:33 +01:00
..
2024-12-05 16:14:46 +01:00
2024-12-05 16:14:46 +01:00
2022-04-04 10:25:46 -04:00
2024-07-08 11:52:47 +01:00
2023-12-20 10:37:23 -08:00
2024-07-08 11:52:47 +01:00
2024-12-05 16:14:46 +01:00
2024-10-02 09:12:21 +01:00
2022-04-04 10:25:46 -04:00
2024-12-05 16:14:46 +01:00
2024-11-18 18:41:12 +00:00