Joao Gante
3c2383b1c6
Generate: general test for decoder-only generation from inputs_embeds ( #25687 )
...
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com >
2023-08-23 19:17:01 +01:00
Joao Gante
3f9cb33504
Generate: fix default max length warning ( #25539 )
2023-08-16 15:30:54 +01:00
Joao Gante
123ad5363f
Generation: strict generation config validation at save time ( #25411 )
...
* strict gen config save; Add tests
* add note that the warning will be an exception in v4.34
2023-08-10 10:42:34 +01:00
hukuda222
cb3c821cb7
aligned sample_beam output selection with beam_search ( #25375 )
...
* aligned sample_beam specs with beam_search
* pull origin main
* Revert "pull origin main"
This reverts commit 06d356f1137bb52272e120a03636598c44449cf3.
* update test_utils.py
* fix format
* remove comment
---------
Co-authored-by: Shogo Fujita <shogo.fujita@legalontech.jp >
2023-08-09 18:28:57 +02:00
Guillaume "Vermeille" Sanchez
d533465150
add CFG for .generate() ( #24654 )
2023-08-06 20:15:24 +01:00
Yih-Dar
224da5df69
update use_auth_token -> token ( #25083 )
...
* update
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2023-07-26 15:09:59 +02:00
Benjamin Badger
caf5e369fc
Contrastive Search peak memory reduction ( #24120 )
...
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com >
2023-07-20 18:46:53 +01:00
Joao Gante
89136ff7f8
Generate: sequence bias can handle same terminations ( #24822 )
2023-07-20 12:23:17 +01:00
Xiaoli Wang
239ace152b
Fix TypeError: Object of type int64 is not JSON serializable ( #24340 )
...
* Fix TypeError: Object of type int64 is not JSON serializable
* Convert numpy.float64 and numpy.int64 to float and int for json serialization
* Black reformatted examples/pytorch/token-classification/run_ner_no_trainer.py
* * make style
2023-06-27 12:15:49 +01:00
Joao Gante
5f3efdf762
Generate: group_beam_search requires diversity_penalty>0.0 ( #24456 )
...
* add exception
* update docs
2023-06-27 10:46:39 +01:00
Bowen Bao
a28325e25e
Replace python random with torch.rand to enable dynamo.export ( #24434 )
...
* Replace python random with torch.rand to enable dynamo.export
* revert changes to flax model code
* Remove unused random import
* Fix torch template
* Move torch.manual_seed(0) to right location
2023-06-23 08:17:21 -04:00
Joao Gante
5f0801d174
Generate: add SequenceBiasLogitsProcessor ( #24334 )
2023-06-21 11:14:41 +01:00
Joao Gante
b1ea6b4bf5
Generate: GenerationConfig can overwrite attributes at from_pretrained time ( #24238 )
...
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com >
2023-06-13 17:59:21 +01:00
Joao Gante
612b2a1a6d
Generate: increase left-padding test atol ( #23448 )
...
increase atol
2023-06-07 11:56:57 +01:00
Matt
f8b2574416
Better TF docstring types ( #23477 )
...
* Rework TF type hints to use | None instead of Optional[] for tf.Tensor
* Rework TF type hints to use | None instead of Optional[] for tf.Tensor
* Don't forget the imports
* Add the imports to tests too
* make fixup
* Refactor tests that depended on get_type_hints
* Better test refactor
* Fix an old hidden bug in the test_keras_fit input creation code
* Fix for the Deit tests
2023-05-24 13:52:52 +01:00
Yih-Dar
2406dbdcfa
Less flaky test_assisted_decoding_matches_greedy_search ( #23451 )
...
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2023-05-18 17:28:22 +02:00
Joao Gante
aea7b23b57
Generate: skip left-padding tests on old models ( #23437 )
2023-05-18 11:04:51 +01:00
Joao Gante
918a06e25d
Generate: add test to check KV format ( #23403 )
...
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com >
2023-05-16 19:28:19 +01:00
Joao Gante
bbfb9fc22b
Generate: starcoder 🤜 🤛 assisted generation ( #23182 )
...
* starcoder has joined the chat
* indexing that works for all
2023-05-08 10:45:40 +01:00
Mayank Agarwal
c4e32e206f
Add support for beam search's num_return_sequencs flag in flax ( #23082 )
...
* add code for numReturnSeq
* add flax support for num return sequences
* Make Fix up for changes
* add test for num return sequences
* lint
2023-05-03 10:50:34 -04:00
Joao Gante
ce31e3c8bf
Generate: slow assisted generation test ( #23125 )
2023-05-03 14:24:50 +01:00
Joao Gante
849367ccf7
Generate: prepare assisted generation for release ( #23052 )
2023-04-29 10:53:30 +01:00
Joao Gante
e4a97f82bf
Generate: assisted generation with sample (take 2) ( #22949 )
...
* temperature controls speed
2023-04-24 19:54:55 +01:00
Joao Gante
78cda46f17
Generate: Add assisted generation ( #22211 )
...
* working mvp
* remove breakpoint
* fix commit
* standardize outputs
* tmp commit
* tests almost ready
* tmp commit
* skip a few models
* Add streaming; Docs and examples
* document limitations
* PR commits
* Amy PR comments
2023-04-18 17:36:56 +01:00
Yih-Dar
90247d3e01
Fix test_eos_token_id_int_and_list_top_k_top_sampling ( #22826 )
...
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2023-04-18 16:04:51 +02:00
Joao Gante
9dfd6a4baa
Generate: handle text conditioning with multimodal encoder-decoder models ( #22748 )
2023-04-13 19:51:13 +01:00
Joao Gante
861ff890d6
Generate: TextIteratorStreamer timeout ( #22576 )
2023-04-05 09:57:46 +01:00
Joao Gante
1905384fd5
Generate: Add text streamer decoding options ( #22544 )
2023-04-04 09:03:13 +01:00
Joao Gante
a55a822adf
Generate: TextIteratorStreamer (streamer for gradio) ( #22501 )
...
* haha text go brrr (but in gradio)
2023-04-03 15:04:37 +01:00
Joao Gante
228792a9dc
Generate: basic token streaming ( #22449 )
...
* haha tokens go brrrr
2023-03-30 12:00:12 +01:00
Joao Gante
502fec779b
Generate: add test for left-padding support ( #22322 )
2023-03-23 17:00:22 +00:00
Joao Gante
fd3eb3e3cd
Beef up Llama tests ( #22314 )
...
* tmp commit
* beef up llama tests
2023-03-22 15:20:48 +00:00
Joao Gante
12febc20db
Generate: Export TF generate with a TF tokenizer ( #22310 )
...
* Export TF generate with a TF tokenizer
* remove unused lines
2023-03-22 15:00:20 +00:00
Yih-Dar
67c2dbdb54
Time to Say Goodbye, torch 1.7 and 1.8 ( #22291 )
...
* time to say goodbye, torch 1.7 and 1.8
* clean up torch_int_div
* clean up is_torch_less_than_1_8-9
* update
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2023-03-21 19:22:01 +01:00
Yih-Dar
5110e5748e
🔥 py38 + torch 2 🔥 🔥 🔥 🚀 ( #22204 )
...
* py38 + torch 2
* increment cache versions
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2023-03-16 22:59:23 +01:00
Lucain
923110b74f
Remove set_access_token usage + fail tests if FutureWarning ( #22051 )
...
* Remove set_access_token usage + fail tests if FutureWarning
* do not fail on FutureWarning in CI
---------
Co-authored-by: testbot <lucainp@hf.co >
2023-03-09 09:23:48 -05:00
Elad Segal
eec46b4f75
Fix MinNewTokensLengthLogitsProcessor when used with a list of eos tokens ( #21959 )
...
* Fix MinNewTokensLengthLogitsProcessor when used with a list of eos tokens
* fix docs
* Empty commit
* formatting
2023-03-07 11:59:22 +00:00
Arthur
44e3e3fb49
prepare for "__floordiv__ is deprecated and its behavior will change in a future version of pytorch" ( #20211 )
...
* rounding_mode = "floor" instead of // to prevent behavioral change
* add other TODO
* use `torch_int_div` from pytrch_utils
* same for tests
* fix copies
* style
* use relative imports when needed
* Co-authored-by: sgugger <sylvain.gugger@gmail.com >
2023-03-01 10:49:21 +01:00
Aaron Gokaslan
5e8c8eb5ba
Apply ruff flake8-comprehensions ( #21694 )
2023-02-22 09:14:54 +01:00
Joao Gante
13e03e619d
Generate: filter encoder inputs when its signature does not accept wildcards ( #21603 )
2023-02-14 10:46:46 +00:00
Joao Gante
fa4bdb0a40
Generate: correct default model input creation for decoder-only models ( #21580 )
2023-02-13 17:04:49 +00:00
Joao Gante
24273268b7
Generate: Fix flaky indexing error in test_constrained_beam_search_generate_dict_output ( #21561 )
2023-02-13 15:12:07 +00:00
Joao Gante
eb6c59bc78
Generate: TF supports multiple eos tokens ( #21571 )
2023-02-13 12:24:22 +00:00
Joao Gante
2edf9a857b
Generate: TF .generate() can now be exported with dynamic length ( #21474 )
2023-02-09 12:52:30 +00:00
Joao Gante
e69f9715eb
Generate: make TF .generate() signature == PT .generate() signature ( #21525 )
2023-02-09 11:10:13 +00:00
Motoki Wu
9960506cbe
Fix multiple eos_token_ids in model.generate(...) ( #21461 )
...
* add tests with multiple eos_token_ids
* make math.prod instead of sum
* make fixup
* fix long and also use np.prod since math.prod does not exist <python 3.8
* make fixup
* add prod util
* use prod util instead of np.prod
* make fixup
* previous .long location
* use tensor ops
* remove prod
* remove prod
* update device
* make fixup
* fix none
2023-02-08 13:48:46 -05:00
Joao Gante
1d9c26a4b8
Generate: TF compute_transition_scores ( #21341 )
2023-02-08 16:36:43 +00:00
Joao Gante
1e4cf8bb44
Generate: TF can now generate from embeddings in encoder-decoder models ( #21475 )
2023-02-07 11:18:23 +00:00
Sylvain Gugger
6f79d26442
Update quality tooling for formatting ( #21480 )
...
* Result of black 23.1
* Update target to Python 3.7
* Switch flake8 to ruff
* Configure isort
* Configure isort
* Apply isort with line limit
* Put the right black version
* adapt black in check copies
* Fix copies
2023-02-06 18:10:56 -05:00
Joao Gante
4943331015
Generate: TF can now accept custom logits processors ( #21454 )
2023-02-06 15:44:47 +00:00