HuggingFace_transformer

Author	SHA1	Message	Date
Patrick von Platen	c47394b0c9	refactoring and bug fixing beam search generate	2020-03-05 13:12:50 +01:00
Patrick von Platen	006097f8ad	rename variables named 'word' to 'token' in generate fn (#3119 ) * fix conflits * fixed naming bug * make style	2020-03-04 12:01:17 -05:00
Patrick von Platen	6701fb7859	fix beam_search behavior when sampling (#3106 ) * fix beam_search behavior when sampling * delete print * make correct style	2020-03-04 09:30:51 -05:00
Patrick von Platen	2fdc7f6ce8	correct greedy generation when doing beam search (#3078 ) * correct greedy generation when doing beam search * improve comment	2020-03-02 12:00:09 -05:00
Sam Shleifer	b54ef78d0c	Bart-CNN (#3059 ) `generate` code that produces 99% identical summarizations to fairseq on CNN test data, with caching.	2020-03-02 10:35:53 -05:00
Sam Shleifer	6a37588041	spelling: strictly (#3042 )	2020-02-27 10:22:35 -05:00
Bram Vanroy	a143d9479e	Add local_files_only parameter to pretrained items (#2930 ) * Add disable_outgoing to pretrained items Setting disable_outgoing=True disables outgonig traffic: - etags are not looked up - models are not downloaded * parameter name change * Remove forgotten print	2020-02-24 14:58:15 -05:00
Patrick von Platen	fc38d4c86f	Improve special_token_id logic in run_generation.py and add tests (#2885 ) * improving generation * finalized special token behaviour for no_beam_search generation * solved modeling_utils merge conflict * solve merge conflicts in modeling_utils.py * add run_generation improvements from PR #2749 * adapted language generation to not use hardcoded -1 if no padding token is available * remove the -1 removal as hard coded -1`s are not necessary anymore * add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown * add slow language generation tests for pretrained models using hardcoded output with pytorch seed * delete ipdb * check that all generated tokens are valid * renaming * renaming Generation -> Generate * make style * updated so that generate_beam_search has same token behavior than generate_no_beam_search * consistent return format for run_generation.py * deleted pretrain lm generate tests -> will be added in another PR * cleaning of unused if statements and renaming * run_generate will always return an iterable * make style * consistent renaming * improve naming, make sure generate function always returns the same tensor, add docstring * add slow tests for all lmhead models * make style and improve example comments modeling_utils * better naming and refactoring in modeling_utils * improving generation * finalized special token behaviour for no_beam_search generation * solved modeling_utils merge conflict * solve merge conflicts in modeling_utils.py * add run_generation improvements from PR #2749 * adapted language generation to not use hardcoded -1 if no padding token is available * remove the -1 removal as hard coded -1`s are not necessary anymore * add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown * add slow language generation tests for pretrained models using hardcoded output with pytorch seed * delete ipdb * check that all generated tokens are valid * renaming * renaming Generation -> Generate * make style * updated so that generate_beam_search has same token behavior than generate_no_beam_search * consistent return format for run_generation.py * deleted pretrain lm generate tests -> will be added in another PR * cleaning of unused if statements and renaming * run_generate will always return an iterable * make style * consistent renaming * improve naming, make sure generate function always returns the same tensor, add docstring * add slow tests for all lmhead models * make style and improve example comments modeling_utils * better naming and refactoring in modeling_utils * changed fast random lm generation testing design to more general one * delete in old testing design in gpt2 * correct old variable name * temporary fix for encoder_decoder lm generation tests - has to be updated when t5 is fixed * adapted all fast random generate tests to new design * better warning description in modeling_utils * better comment * better comment and error message Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-02-21 12:09:59 -05:00
Sam Shleifer	53ce3854a1	New BartModel (#2745 ) * Results same as fairseq * Wrote a ton of tests * Struggled with api signatures * added some docs	2020-02-20 18:11:13 -05:00
Sam Shleifer	ef74b0f07a	get_activation('relu') provides a simple mapping from strings i… (#2807 ) * activations.py contains a mapping from string to activation function * resolves some `gelu` vs `gelu_new` ambiguity	2020-02-13 08:28:33 -05:00
thomwolf	c6c5c3fd4e	style and quality	2020-02-07 08:58:06 +01:00
thomwolf	961c69776f	@julien-c proposal for TF/PT compat in hf_buckets	2020-02-07 08:53:17 +01:00
Lysandre	6c1b23554f	Sample instead of greedy decoding by default in generate	2020-02-03 17:23:53 -05:00
Julien Chaumond	b85c59f997	config.architectures	2020-01-30 19:26:59 -05:00
Julien Chaumond	11b13e94a3	Add type to help my IDE out	2020-01-24 14:00:57 -05:00
Lysandre	24d5ad1dcc	Run the examples in slow	2020-01-23 09:38:45 -05:00
Lysandre	0e9899f451	Fixes	2020-01-23 09:38:45 -05:00
Lysandre	00df3d4de0	ALBERT Modeling + required changes to utilities	2020-01-23 09:38:45 -05:00
Julien Chaumond	83a41d39b3	💄 super	2020-01-15 18:33:50 -05:00
Julien Chaumond	2f32dfd33b	Convention: name mixins mixins	2020-01-11 01:24:29 +00:00
Julien Chaumond	84c0aa1868	num_parameters helper	2020-01-10 17:40:02 +00:00
alberduris	81d6841b4b	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
alberduris	dd4df80f0b	Moved the encoded_prompts to correct device	2020-01-06 15:11:12 +01:00
Aymeric Augustin	0ffc8eaf53	Enforce target version for black. This should stabilize formatting.	2020-01-05 12:52:14 -05:00
Thomas Wolf	492bea9aa0	Merge pull request #2292 from patrickvonplaten/add_cached_past_for_language_generation Add cached past for language generation	2019-12-27 10:33:27 +01:00
patrickvonplaten	0f6017bee3	improve comments for examples	2019-12-26 00:35:11 +01:00
patrickvonplaten	87c8fca9bc	add example for ctrl text generation in docs	2019-12-26 00:29:19 +01:00
patrickvonplaten	88def24c45	merge conflicts - renamed to previous_token singular	2019-12-26 00:27:16 +01:00
patrickvonplaten	822f725a07	duplicated line for repeating_words_penalty_for_language_generation	2019-12-26 00:25:29 +01:00
patrickvonplaten	fc84bd5254	adapt style to predefined style layout	2019-12-25 23:32:44 +01:00
patrickvonplaten	deff792bb6	add prepare inputs for transfo_xl and xlnet	2019-12-25 23:17:24 +01:00
patrickvonplaten	9398058e19	add easy tensor shape match test	2019-12-25 23:17:24 +01:00
patrickvonplaten	90cda45e9e	add past re-ordering for beam search	2019-12-25 23:17:24 +01:00
patrickvonplaten	6bca56fdb0	check for self.config.mem_len instead of self.mem_len in _do_output_past	2019-12-25 23:17:24 +01:00
patrickvonplaten	365ccd0af2	make if statements cleaner for prepare_inputs_for_generation	2019-12-25 23:17:24 +01:00
patrickvonplaten	d039c679d2	better naming for if statement	2019-12-25 23:17:24 +01:00
patrickvonplaten	7e0c5c731a	changed do_output_past function to check for self.config.output_past instead of self.output_past	2019-12-25 23:17:24 +01:00
patrickvonplaten	eeaa402cd4	rename comments	2019-12-25 23:17:24 +01:00
patrickvonplaten	7bb4271291	remove ipdb debugging statements	2019-12-25 23:17:24 +01:00
patrickvonplaten	267587c258	add and improve comments	2019-12-25 23:17:24 +01:00
patrickvonplaten	d891fd0ae0	add past hidden key states for more efficient language generation & add prepare_inputs for gpt2 and ctrl model	2019-12-25 23:17:24 +01:00
Thomas Wolf	aeef4823ab	Merge pull request #2303 from patrickvonplaten/fix_error_with_repetition_penalty fix repetition penalty error in modeling_utils.py	2019-12-25 22:39:20 +01:00
James Noeckel	e1844d9a45	use positional arguments due to inconsistent API	2019-12-25 01:34:02 -08:00
James Noeckel	9fb7addd4d	revert erroneous fix	2019-12-24 22:26:09 -08:00
patrickvonplaten	18e5bdbec5	fix repetition penalty error in modeling_utils.py	2019-12-24 17:18:05 +01:00
Aymeric Augustin	4c09a96096	Simplify re-raising exceptions. Most module use the simpler `raise` version. Normalize those that don't.	2019-12-23 21:20:54 +01:00
James Noeckel	398bb03f98	fix out-of-place call to scatter, whose named argument name is source, not src	2019-12-22 23:30:52 -08:00
Aymeric Augustin	c824d15aa1	Remove __future__ imports.	2019-12-22 17:47:54 +01:00
Aymeric Augustin	6be7cdda66	Move source code inside a src subdirectory. This prevents transformers from being importable simply because the CWD is the root of the git repository, while not being importable from other directories. That led to inconsistent behavior, especially in examples. Once you fetch this commit, in your dev environment, you must run: $ pip uninstall transformers $ pip install -e .	2019-12-22 14:15:13 +01:00

49 Commits