Generate: inner decoding methods are no longer public (#29437)

This commit is contained in:
Joao Gante
2024-03-05 10:27:36 +00:00
committed by GitHub
parent 4d892b7297
commit 87a0783dde
11 changed files with 117 additions and 104 deletions

View File

@@ -389,3 +389,6 @@ just like in multinomial sampling. However, in assisted decoding, reducing the t
>>> tokenizer.batch_decode(outputs, skip_special_tokens=True)
['Alice and Bob are going to the same party. It is a small party, in a small']
```
Alternativelly, you can also set the `prompt_lookup_num_tokens` to trigger n-gram based assisted decoding, as opposed
to model based assisted decoding. You can read more about it [here](https://twitter.com/joao_gante/status/1747322413006643259).