HuggingFace_transformer

Author	SHA1	Message	Date
Patrick von Platen	6047f46b19	re-add eos token to get good bart results	2020-03-12 20:17:50 +01:00
Patrick von Platen	c11160114a	small clean-up	2020-03-12 20:02:35 +01:00
Patrick von Platen	a332cc9f7f	finalize generation merge	2020-03-11 11:53:36 +01:00
Patrick von Platen	d997ac7810	fix typo	2020-03-11 11:06:56 +01:00
Patrick von Platen	7351a8dbaf	re-add scoring filtering	2020-03-11 11:06:56 +01:00
Patrick von Platen	374deef48d	fixed typo	2020-03-11 11:06:56 +01:00
Patrick von Platen	ca2047bc35	refactor variable naming and improve tf generate in line with torch generate	2020-03-11 11:06:56 +01:00
patrickvonplaten	41b437ea3a	add draft version of propsoed changes for ROGUE score	2020-03-11 11:06:56 +01:00
patrickvonplaten	629aac92ec	do not allow do_sample and weird force bos token things	2020-03-11 11:06:56 +01:00
patrickvonplaten	d880a5fbde	finalized PR	2020-03-11 11:06:56 +01:00
patrickvonplaten	2acfe63964	best current version and make style	2020-03-11 11:06:56 +01:00
patrickvonplaten	c62444da39	fix conflicts	2020-03-11 11:06:56 +01:00
Patrick von Platen	333affcb81	add current changes	2020-03-11 11:06:56 +01:00
Patrick von Platen	7a11e925cf	work in progress	2020-03-11 11:06:56 +01:00
Patrick von Platen	7cba11fb9b	better naming	2020-03-11 11:06:56 +01:00
Patrick von Platen	ff648221bd	fix conflicts	2020-03-11 11:06:56 +01:00
Patrick von Platen	c0d9dd3ba9	refactored code a bit and made more generic	2020-03-11 11:06:56 +01:00
Patrick von Platen	d8e2b3c547	fix conflicts	2020-03-11 11:06:56 +01:00
Lysandre Debut	146c521235	Merge branch 'master' into add_models_special_tokens_to_specific_configs	2020-03-05 17:24:42 -05:00
Lysandre Debut	0001d05686	Correct missing keys + test (#3143 )	2020-03-05 17:01:54 -05:00
Patrick von Platen	e33ed12c3b	uncomment expression	2020-03-05 13:41:04 +01:00
Patrick von Platen	4220fd52b9	remove ipdb	2020-03-05 13:36:21 +01:00
Patrick von Platen	c47394b0c9	refactoring and bug fixing beam search generate	2020-03-05 13:12:50 +01:00
Patrick von Platen	006097f8ad	rename variables named 'word' to 'token' in generate fn (#3119 ) * fix conflits * fixed naming bug * make style	2020-03-04 12:01:17 -05:00
Patrick von Platen	6701fb7859	fix beam_search behavior when sampling (#3106 ) * fix beam_search behavior when sampling * delete print * make correct style	2020-03-04 09:30:51 -05:00
Patrick von Platen	2fdc7f6ce8	correct greedy generation when doing beam search (#3078 ) * correct greedy generation when doing beam search * improve comment	2020-03-02 12:00:09 -05:00
Sam Shleifer	b54ef78d0c	Bart-CNN (#3059 ) `generate` code that produces 99% identical summarizations to fairseq on CNN test data, with caching.	2020-03-02 10:35:53 -05:00
Sam Shleifer	6a37588041	spelling: strictly (#3042 )	2020-02-27 10:22:35 -05:00
Patrick von Platen	ec16142ee5	add special tokens to pretrain configs of respective lm head models	2020-02-25 16:37:59 +01:00
Bram Vanroy	a143d9479e	Add local_files_only parameter to pretrained items (#2930 ) * Add disable_outgoing to pretrained items Setting disable_outgoing=True disables outgonig traffic: - etags are not looked up - models are not downloaded * parameter name change * Remove forgotten print	2020-02-24 14:58:15 -05:00
Patrick von Platen	fc38d4c86f	Improve special_token_id logic in run_generation.py and add tests (#2885 ) * improving generation * finalized special token behaviour for no_beam_search generation * solved modeling_utils merge conflict * solve merge conflicts in modeling_utils.py * add run_generation improvements from PR #2749 * adapted language generation to not use hardcoded -1 if no padding token is available * remove the -1 removal as hard coded -1`s are not necessary anymore * add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown * add slow language generation tests for pretrained models using hardcoded output with pytorch seed * delete ipdb * check that all generated tokens are valid * renaming * renaming Generation -> Generate * make style * updated so that generate_beam_search has same token behavior than generate_no_beam_search * consistent return format for run_generation.py * deleted pretrain lm generate tests -> will be added in another PR * cleaning of unused if statements and renaming * run_generate will always return an iterable * make style * consistent renaming * improve naming, make sure generate function always returns the same tensor, add docstring * add slow tests for all lmhead models * make style and improve example comments modeling_utils * better naming and refactoring in modeling_utils * improving generation * finalized special token behaviour for no_beam_search generation * solved modeling_utils merge conflict * solve merge conflicts in modeling_utils.py * add run_generation improvements from PR #2749 * adapted language generation to not use hardcoded -1 if no padding token is available * remove the -1 removal as hard coded -1`s are not necessary anymore * add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown * add slow language generation tests for pretrained models using hardcoded output with pytorch seed * delete ipdb * check that all generated tokens are valid * renaming * renaming Generation -> Generate * make style * updated so that generate_beam_search has same token behavior than generate_no_beam_search * consistent return format for run_generation.py * deleted pretrain lm generate tests -> will be added in another PR * cleaning of unused if statements and renaming * run_generate will always return an iterable * make style * consistent renaming * improve naming, make sure generate function always returns the same tensor, add docstring * add slow tests for all lmhead models * make style and improve example comments modeling_utils * better naming and refactoring in modeling_utils * changed fast random lm generation testing design to more general one * delete in old testing design in gpt2 * correct old variable name * temporary fix for encoder_decoder lm generation tests - has to be updated when t5 is fixed * adapted all fast random generate tests to new design * better warning description in modeling_utils * better comment * better comment and error message Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-02-21 12:09:59 -05:00
Sam Shleifer	53ce3854a1	New BartModel (#2745 ) * Results same as fairseq * Wrote a ton of tests * Struggled with api signatures * added some docs	2020-02-20 18:11:13 -05:00
Sam Shleifer	ef74b0f07a	get_activation('relu') provides a simple mapping from strings i… (#2807 ) * activations.py contains a mapping from string to activation function * resolves some `gelu` vs `gelu_new` ambiguity	2020-02-13 08:28:33 -05:00
thomwolf	c6c5c3fd4e	style and quality	2020-02-07 08:58:06 +01:00
thomwolf	961c69776f	@julien-c proposal for TF/PT compat in hf_buckets	2020-02-07 08:53:17 +01:00
Lysandre	6c1b23554f	Sample instead of greedy decoding by default in generate	2020-02-03 17:23:53 -05:00
Julien Chaumond	b85c59f997	config.architectures	2020-01-30 19:26:59 -05:00
Julien Chaumond	11b13e94a3	Add type to help my IDE out	2020-01-24 14:00:57 -05:00
Lysandre	24d5ad1dcc	Run the examples in slow	2020-01-23 09:38:45 -05:00
Lysandre	0e9899f451	Fixes	2020-01-23 09:38:45 -05:00
Lysandre	00df3d4de0	ALBERT Modeling + required changes to utilities	2020-01-23 09:38:45 -05:00
Julien Chaumond	83a41d39b3	💄 super	2020-01-15 18:33:50 -05:00
Julien Chaumond	2f32dfd33b	Convention: name mixins mixins	2020-01-11 01:24:29 +00:00
Julien Chaumond	84c0aa1868	num_parameters helper	2020-01-10 17:40:02 +00:00
alberduris	81d6841b4b	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
alberduris	dd4df80f0b	Moved the encoded_prompts to correct device	2020-01-06 15:11:12 +01:00
Aymeric Augustin	0ffc8eaf53	Enforce target version for black. This should stabilize formatting.	2020-01-05 12:52:14 -05:00
Thomas Wolf	492bea9aa0	Merge pull request #2292 from patrickvonplaten/add_cached_past_for_language_generation Add cached past for language generation	2019-12-27 10:33:27 +01:00
patrickvonplaten	0f6017bee3	improve comments for examples	2019-12-26 00:35:11 +01:00
patrickvonplaten	87c8fca9bc	add example for ctrl text generation in docs	2019-12-26 00:29:19 +01:00

1 2

72 Commits