HuggingFace_transformer

Author	SHA1	Message	Date
Julien Chaumond	d4c2cb402d	Kill model archive maps (#4636 ) * Kill model archive maps * Fixup * Also kill model_archive_map for MaskedBertPreTrainedModel * Unhook config_archive_map * Tokenizers: align with model id changes * make style && make quality * Fix CI	2020-06-02 09:39:33 -04:00
Patrick von Platen	a27c795908	fix (#4419 )	2020-05-18 15:51:40 +02:00
Jared T Nielsen	64070cbb88	Fix TF input docstrings to refer to tf.Tensor rather than torch.FloatTensor. (#4051 )	2020-04-30 14:28:56 +02:00
Julien Chaumond	455c639093	CDN urls (#4030 ) * [file_utils] use_cdn + documentation * Move to cdn. urls for weights * [urls] Hotfix for bert-base-japanese	2020-04-28 20:27:14 -04:00
Patrick von Platen	38f7461df3	[TFT5, Cache] Add cache to TFT5 (#3772 ) * correct gpt2 test inputs * make style * delete modeling_gpt2 change in test file * translate from pytorch * correct tests * fix conflicts * fix conflicts * fix conflicts * fix conflicts * make tensorflow t5 caching work * make style * clean reorder cache * remove unnecessary spaces * fix test	2020-04-16 16:14:52 +02:00
Patrick von Platen	01c37dcdb5	[Config, Caching] Remove `output_past` everywhere and replace by `use_cache` argument (#3734 ) * remove output_past from pt * make style * add optional input length for gpt2 * add use cache to prepare input * save memory in gpt2 * correct gpt2 test inputs * make past input optional for gpt2 * finish use_cache for all models * make style * delete modeling_gpt2 change in test file * correct docstring * correct is true statements for gpt2	2020-04-14 14:40:28 -04:00
Patrick von Platen	092cf881a5	[Generation, EncoderDecoder] Apply Encoder Decoder 1.5GB memory… (#3778 )	2020-04-13 22:29:28 -04:00
Patrick von Platen	f68d22850c	delete bogus print statement (#3595 )	2020-04-02 21:49:34 +02:00
Patrick von Platen	b815edf69f	[T5, Testst] Add extensive hard-coded integration tests and make sure PT and TF give equal results (#3550 ) * add some t5 integration tests * finish summarization and translation integration tests for T5 - results loook good * add tf test * fix == vs is bug * fix tf beam search error and make tf t5 tests pass	2020-04-01 18:01:33 +02:00
Patrick von Platen	b38d552a92	[Generate] Add bad words list argument to the generate function (#3367 ) * add bad words list * make style * add bad_words_tokens * make style * better naming * make style * fix typo	2020-03-31 18:42:31 +02:00
Patrick von Platen	75ec6c9e3a	[T5] make decoder input ids optional for t5 training (#3521 ) * make decoder input ids optional for t5 training * lm_lables should not be shifted in t5 * add tests * finish shift right functionality for PT T5 * move shift right to correct class * cleaner code * replace -100 values with pad token id * add assert statement * remove unnecessary for loop * make style	2020-03-30 13:45:26 +02:00
LysandreJik	e2c05f06ef	Correct indentation in docstring For some reason Sphinx extremely dislikes this and crashes.	2020-03-27 09:28:52 -04:00
Patrick von Platen	9c683ef01e	Add t5 to pipeline(task='summarization') (#3413 ) * solve conflicts * move warnings below * incorporate changes * add pad_to_max_length to pipelines * add bug fix for T5 beam search * add prefix patterns * make style * fix conflicts * adapt pipelines for task specific parameters * improve docstring * remove unused patterns	2020-03-26 11:03:13 +01:00
Patrick von Platen	ffa17fe322	Extend config with task specific configs. (#3433 ) * add new default configs * change prefix default to None	2020-03-25 21:32:04 +01:00
Patrick von Platen	95e00d0808	Clean special token init in modeling_....py (#3264 ) * make style * fix conflicts	2020-03-20 21:41:04 +01:00
Patrick von Platen	bbf26c4e61	Support T5 Generation (#3228 ) * fix conflicts * update bart max length test * correct spelling mistakes * implemented model specific encode function * fix merge conflicts * better naming * save intermediate state -> need to rethink strucuture a bit * leave tf problem as it is for now * current version * add layers.pop * remove ipdb * make style * clean return cut decoding * remove ipdbs * Fix restoring layers in the decoders that doesnt exists. * push good intermediate solution for now * fix conflicts * always good to refuse to merge conflicts when rebasing * fix small bug * improve function calls * remove unused file * add correct scope behavior for t5_generate Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>	2020-03-19 23:18:23 +01:00
Patrick von Platen	ddb10c6447	improve doctstring (#3327 )	2020-03-18 13:24:09 +01:00
Patrick von Platen	e8f44af5bf	[generate] do_sample default back to False (#3298 ) * change do_samples back * None better default as boolean * adapt do_sample to True in test example * make style	2020-03-17 10:52:37 -04:00
Patrick von Platen	1ba21f96ca	fix bug in tf no_repeat_ngram_size	2020-03-11 11:06:56 +01:00
Patrick von Platen	9b8ee8cea0	delete print and make style	2020-03-11 11:06:56 +01:00
Patrick von Platen	ca1330f0b2	do not mess with the negative sign	2020-03-11 11:06:56 +01:00
Patrick von Platen	10989715d0	rename variable	2020-03-11 11:06:56 +01:00
Patrick von Platen	cf06290565	remove ipdb	2020-03-11 11:06:56 +01:00
Patrick von Platen	a2c8e516c2	fix torch to tf translation	2020-03-11 11:06:56 +01:00
Patrick von Platen	ca2047bc35	refactor variable naming and improve tf generate in line with torch generate	2020-03-11 11:06:56 +01:00
Patrick von Platen	3e624c64ca	fix repetition penalty mask in tf	2020-03-09 14:55:11 +01:00
Thomas Wolf	0416d437fb	Merge pull request #3148 from patrickvonplaten/refactoring_beam_search_for_tf_2 refactored beam search according to torch implementation	2020-03-06 22:01:46 +01:00
Thomas Wolf	9499a3778e	Merge pull request #3103 from gthb/keras-serialization Support keras JSON/HDF5 serialization of main layers	2020-03-06 12:59:13 +01:00
patrickvonplaten	9362eb4a07	refactored beam search according to torch implementation	2020-03-06 00:46:29 +01:00
Gunnlaugur Thor Briem	4c91a3af94	Document keras_serializable decorator	2020-03-05 11:48:10 +00:00
Gunnlaugur Thor Briem	4be01e5cbf	Use name transformers_config in Keras serialization Be explicit that this is config for the transformers package (as these layers may coexist with other custom stuff in a Keras model, plus the Keras container itself is called config, and config["config"] is not great) Add explicit error handling for initializer calls that have neither the `config` nor the `transformers_config` argument, or have both.	2020-03-05 11:47:35 +00:00
Gunnlaugur Thor Briem	a355f4f0fc	Add functools.wraps for wrapper initializer Preserve the original initializer function's metadata. See https://docs.python.org/3/library/functools.html#functools.update_wrapper	2020-03-05 11:18:50 +00:00
Gunnlaugur Thor Briem	4f338ed407	Explicit config_class instead of module inspection	2020-03-04 23:45:29 +00:00
Gunnlaugur Thor Briem	18f4b9274f	fix: work with Tensorflow < 2.1.0 tf.keras.utils.register_keras_serializable was added in TF 2.1.0, so don't rely on it being there; just decorate the class with it if it exists.	2020-03-04 16:57:29 +00:00
Patrick von Platen	7a89a3e493	correct beam search sampling	2020-03-04 17:27:47 +01:00
Patrick von Platen	c4c4c9998a	make GPT2 and CTRL shape consistent between torch and TF	2020-03-04 17:27:47 +01:00
patrickvonplaten	2529b2d37e	set redorder past sort dimension to its default	2020-03-04 17:27:47 +01:00
patrickvonplaten	61fef6e957	added beam_search generation for tf 2.0	2020-03-04 17:27:47 +01:00
Gunnlaugur Thor Briem	470753bcf5	Put @keras_serializable only on layers it works on And only run the test on TF*MainLayer classes so marked.	2020-03-03 22:44:45 +00:00
Gunnlaugur Thor Briem	0c716ede8c	Use class decorator instead of superclass When supplied by Keras deserialization, the config parameter to initializers will be a dict. So intercept it and convert to PretrainedConfig object (and store in instance attribute for get_config to get at it) before passing to the actual initializer. To accomplish this, and repeat as little code as possible, use a class decorator on TF*MainLayer classes.	2020-03-03 22:31:42 +00:00
Gunnlaugur Thor Briem	ba28170717	Support keras JSON/HDF5 serialization of main layers Fixes #3101	2020-03-03 15:21:41 +00:00
Patrick von Platen	4134100363	Add generate() functionality to TF 2.0 (#3063 ) * add first copy past test to tf 2 generate * add tf top_k_top_p_filter fn * add generate function for TF * add generate function for TF * implemented generate for all models expect transfoXL * implemented generate for all models expect transfoXL * implemented generate for all models expect transfoXL * make style * change permission of test file to correct ones * delete ipdb * delete ipdb * fix bug and finish simple gpt2 integration test * clean test file * clean test file * make style * make style * make style * make style * change import style * change import style * make style * make style * add decorators * add decorators * fix tf ctrl bug dim => axis in TF * make style * make style * refactored test file * refactored test file * take out test_torch_tf_conversion if nothing is defined * take out test_torch_tf_conversion if nothing is defined * remove useless files * remove useless files * fix conflicts * fix conflicts * fix conflicts * fix conflicts * fix conflicts * solve conflicts * solve conflicts * fix conflicts * fix conflicts * merge conflicts * delete ipdb * exposed top_k_top_p_filtering fns * delete weirdly created w! file * add comment to test tf common modeling * fix conflicts * fix conflicts * make style * merge conflicts * make style * change tf.tensor.shape to shape_list(tensor)	2020-03-03 09:42:15 -05:00
Bram Vanroy	5211d333bb	Update modeling_tf_utils.py (#2924 ) Tensorflow does not use .eval() vs .train(). closes https://github.com/huggingface/transformers/issues/2906	2020-02-21 11:28:32 -05:00
thomwolf	c6c5c3fd4e	style and quality	2020-02-07 08:58:06 +01:00
thomwolf	961c69776f	@julien-c proposal for TF/PT compat in hf_buckets	2020-02-07 08:53:17 +01:00
Lysandre	24d5ad1dcc	Run the examples in slow	2020-01-23 09:38:45 -05:00
Lysandre	3922a2497e	TF ALBERT + TF Utilities + Fix warnings	2020-01-23 09:38:45 -05:00
Julien Chaumond	83a41d39b3	💄 super	2020-01-15 18:33:50 -05:00
Julien Chaumond	2f32dfd33b	Convention: name mixins mixins	2020-01-11 01:24:29 +00:00
Julien Chaumond	84c0aa1868	num_parameters helper	2020-01-10 17:40:02 +00:00

1 2

56 Commits