HuggingFace_transformer

Author	SHA1	Message	Date
Patrick von Platen	e33ed12c3b	uncomment expression	2020-03-05 13:41:04 +01:00
Patrick von Platen	4220fd52b9	remove ipdb	2020-03-05 13:36:21 +01:00
Patrick von Platen	c47394b0c9	refactoring and bug fixing beam search generate	2020-03-05 13:12:50 +01:00
Patrick von Platen	006097f8ad	rename variables named 'word' to 'token' in generate fn (#3119 ) * fix conflits * fixed naming bug * make style	2020-03-04 12:01:17 -05:00
Patrick von Platen	6701fb7859	fix beam_search behavior when sampling (#3106 ) * fix beam_search behavior when sampling * delete print * make correct style	2020-03-04 09:30:51 -05:00
Patrick von Platen	2fdc7f6ce8	correct greedy generation when doing beam search (#3078 ) * correct greedy generation when doing beam search * improve comment	2020-03-02 12:00:09 -05:00
Sam Shleifer	b54ef78d0c	Bart-CNN (#3059 ) `generate` code that produces 99% identical summarizations to fairseq on CNN test data, with caching.	2020-03-02 10:35:53 -05:00
Sam Shleifer	6a37588041	spelling: strictly (#3042 )	2020-02-27 10:22:35 -05:00
Bram Vanroy	a143d9479e	Add local_files_only parameter to pretrained items (#2930 ) * Add disable_outgoing to pretrained items Setting disable_outgoing=True disables outgonig traffic: - etags are not looked up - models are not downloaded * parameter name change * Remove forgotten print	2020-02-24 14:58:15 -05:00
Patrick von Platen	fc38d4c86f	Improve special_token_id logic in run_generation.py and add tests (#2885 ) * improving generation * finalized special token behaviour for no_beam_search generation * solved modeling_utils merge conflict * solve merge conflicts in modeling_utils.py * add run_generation improvements from PR #2749 * adapted language generation to not use hardcoded -1 if no padding token is available * remove the -1 removal as hard coded -1`s are not necessary anymore * add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown * add slow language generation tests for pretrained models using hardcoded output with pytorch seed * delete ipdb * check that all generated tokens are valid * renaming * renaming Generation -> Generate * make style * updated so that generate_beam_search has same token behavior than generate_no_beam_search * consistent return format for run_generation.py * deleted pretrain lm generate tests -> will be added in another PR * cleaning of unused if statements and renaming * run_generate will always return an iterable * make style * consistent renaming * improve naming, make sure generate function always returns the same tensor, add docstring * add slow tests for all lmhead models * make style and improve example comments modeling_utils * better naming and refactoring in modeling_utils * improving generation * finalized special token behaviour for no_beam_search generation * solved modeling_utils merge conflict * solve merge conflicts in modeling_utils.py * add run_generation improvements from PR #2749 * adapted language generation to not use hardcoded -1 if no padding token is available * remove the -1 removal as hard coded -1`s are not necessary anymore * add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown * add slow language generation tests for pretrained models using hardcoded output with pytorch seed * delete ipdb * check that all generated tokens are valid * renaming * renaming Generation -> Generate * make style * updated so that generate_beam_search has same token behavior than generate_no_beam_search * consistent return format for run_generation.py * deleted pretrain lm generate tests -> will be added in another PR * cleaning of unused if statements and renaming * run_generate will always return an iterable * make style * consistent renaming * improve naming, make sure generate function always returns the same tensor, add docstring * add slow tests for all lmhead models * make style and improve example comments modeling_utils * better naming and refactoring in modeling_utils * changed fast random lm generation testing design to more general one * delete in old testing design in gpt2 * correct old variable name * temporary fix for encoder_decoder lm generation tests - has to be updated when t5 is fixed * adapted all fast random generate tests to new design * better warning description in modeling_utils * better comment * better comment and error message Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-02-21 12:09:59 -05:00
Sam Shleifer	53ce3854a1	New BartModel (#2745 ) * Results same as fairseq * Wrote a ton of tests * Struggled with api signatures * added some docs	2020-02-20 18:11:13 -05:00
Sam Shleifer	ef74b0f07a	get_activation('relu') provides a simple mapping from strings i… (#2807 ) * activations.py contains a mapping from string to activation function * resolves some `gelu` vs `gelu_new` ambiguity	2020-02-13 08:28:33 -05:00
thomwolf	c6c5c3fd4e	style and quality	2020-02-07 08:58:06 +01:00
thomwolf	961c69776f	@julien-c proposal for TF/PT compat in hf_buckets	2020-02-07 08:53:17 +01:00
Lysandre	6c1b23554f	Sample instead of greedy decoding by default in generate	2020-02-03 17:23:53 -05:00
Julien Chaumond	b85c59f997	config.architectures	2020-01-30 19:26:59 -05:00
Julien Chaumond	11b13e94a3	Add type to help my IDE out	2020-01-24 14:00:57 -05:00
Lysandre	24d5ad1dcc	Run the examples in slow	2020-01-23 09:38:45 -05:00
Lysandre	0e9899f451	Fixes	2020-01-23 09:38:45 -05:00
Lysandre	00df3d4de0	ALBERT Modeling + required changes to utilities	2020-01-23 09:38:45 -05:00
Julien Chaumond	83a41d39b3	💄 super	2020-01-15 18:33:50 -05:00
Julien Chaumond	2f32dfd33b	Convention: name mixins mixins	2020-01-11 01:24:29 +00:00
Julien Chaumond	84c0aa1868	num_parameters helper	2020-01-10 17:40:02 +00:00
alberduris	81d6841b4b	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
alberduris	dd4df80f0b	Moved the encoded_prompts to correct device	2020-01-06 15:11:12 +01:00
Aymeric Augustin	0ffc8eaf53	Enforce target version for black. This should stabilize formatting.	2020-01-05 12:52:14 -05:00
Thomas Wolf	492bea9aa0	Merge pull request #2292 from patrickvonplaten/add_cached_past_for_language_generation Add cached past for language generation	2019-12-27 10:33:27 +01:00
patrickvonplaten	0f6017bee3	improve comments for examples	2019-12-26 00:35:11 +01:00
patrickvonplaten	87c8fca9bc	add example for ctrl text generation in docs	2019-12-26 00:29:19 +01:00
patrickvonplaten	88def24c45	merge conflicts - renamed to previous_token singular	2019-12-26 00:27:16 +01:00
patrickvonplaten	822f725a07	duplicated line for repeating_words_penalty_for_language_generation	2019-12-26 00:25:29 +01:00
patrickvonplaten	fc84bd5254	adapt style to predefined style layout	2019-12-25 23:32:44 +01:00
patrickvonplaten	deff792bb6	add prepare inputs for transfo_xl and xlnet	2019-12-25 23:17:24 +01:00
patrickvonplaten	9398058e19	add easy tensor shape match test	2019-12-25 23:17:24 +01:00
patrickvonplaten	90cda45e9e	add past re-ordering for beam search	2019-12-25 23:17:24 +01:00
patrickvonplaten	6bca56fdb0	check for self.config.mem_len instead of self.mem_len in _do_output_past	2019-12-25 23:17:24 +01:00
patrickvonplaten	365ccd0af2	make if statements cleaner for prepare_inputs_for_generation	2019-12-25 23:17:24 +01:00
patrickvonplaten	d039c679d2	better naming for if statement	2019-12-25 23:17:24 +01:00
patrickvonplaten	7e0c5c731a	changed do_output_past function to check for self.config.output_past instead of self.output_past	2019-12-25 23:17:24 +01:00
patrickvonplaten	eeaa402cd4	rename comments	2019-12-25 23:17:24 +01:00
patrickvonplaten	7bb4271291	remove ipdb debugging statements	2019-12-25 23:17:24 +01:00
patrickvonplaten	267587c258	add and improve comments	2019-12-25 23:17:24 +01:00
patrickvonplaten	d891fd0ae0	add past hidden key states for more efficient language generation & add prepare_inputs for gpt2 and ctrl model	2019-12-25 23:17:24 +01:00
Thomas Wolf	aeef4823ab	Merge pull request #2303 from patrickvonplaten/fix_error_with_repetition_penalty fix repetition penalty error in modeling_utils.py	2019-12-25 22:39:20 +01:00
James Noeckel	e1844d9a45	use positional arguments due to inconsistent API	2019-12-25 01:34:02 -08:00
James Noeckel	9fb7addd4d	revert erroneous fix	2019-12-24 22:26:09 -08:00
patrickvonplaten	18e5bdbec5	fix repetition penalty error in modeling_utils.py	2019-12-24 17:18:05 +01:00
Aymeric Augustin	4c09a96096	Simplify re-raising exceptions. Most module use the simpler `raise` version. Normalize those that don't.	2019-12-23 21:20:54 +01:00
James Noeckel	398bb03f98	fix out-of-place call to scatter, whose named argument name is source, not src	2019-12-22 23:30:52 -08:00
Aymeric Augustin	c824d15aa1	Remove __future__ imports.	2019-12-22 17:47:54 +01:00

1 2

51 Commits