Daniel Korat
fb0c6b521d
Universal Assisted Generation: Assisted generation with any assistant model (by Intel Labs) (#33383)
* Update candidate_generator.py
* Update utils.py
* add lookbehind params to _get_candidate_generator
* make fixup
* add unit tests
* fix failing tests
* add docstrings
* fix docstrings; remove non-optimized AnyTokenizer
* added any tokenizer generation correctness test
* make fixup
* fix assertion syntax
* PR review fixes
* address additional PR comments
* fix tests
* remove stropping criteria arg
* make fixup
* add AssistantConfig
* fix prev_tokens branching
* pass tokenizers through `generate()`kwargs
* fix lookbehind values; tokenizer params WIP
* fixup
* AssistantConfig
* remove AssistantConfig; apply PR suggestions
* restructure tests
* fixup
* fix assistant_tokenizer arg validation
* fixup
* fix tests in TestAssistedCandidateGeneratorDifferentTokenizers
* fix class docstring
* PR suggestions
* doc
* doc update and improvements to `_validate_assistant()`
---------
Co-authored-by: mosheber <moshe.berchansky@intel.com>
2024-10-10 14:41:53 +02:00
..
2024-09-18 11:07:51 +02:00
2022-02-23 15:46:28 -05:00
2023-10-09 11:04:57 +02:00
2024-10-02 14:08:46 +01:00
2024-09-19 19:28:04 +01:00
2024-03-19 14:43:02 +00:00
2024-08-20 11:42:36 +02:00
2024-10-10 14:41:53 +02:00
2024-10-10 11:50:39 +02:00
2024-07-11 12:11:50 +01:00
2024-10-03 16:15:36 +02:00
2024-10-10 13:38:14 +01:00
2024-10-10 13:42:18 +02:00
2024-08-30 18:17:25 +02:00
2024-10-02 14:08:46 +01:00
2024-09-18 12:32:02 +02:00
2024-10-09 10:45:06 -04:00
2024-10-10 11:58:26 +02:00
2023-12-20 18:33:17 +00:00
2024-07-26 10:33:02 +02:00
2023-06-15 07:30:24 -04:00
2024-08-06 11:33:05 +01:00
2024-05-21 13:56:52 +01:00
2024-10-09 12:15:48 +01:00
2024-05-16 10:56:11 +01:00
2024-10-05 16:20:50 +02:00
2024-10-10 13:38:14 +01:00
2024-10-02 10:41:08 +02:00
2023-09-05 10:12:25 +02:00
2024-10-02 14:08:46 +01:00