HuggingFace_transformer

Author	SHA1	Message	Date
Patrick von Platen	06dd597552	fix bug in warnings T5 pipelines (#3545 )	2020-04-01 21:59:12 +02:00
Anirudh Srinivasan	9de9ceb6c5	Correct output shape for Bert NSP models in docs (#3482 )	2020-04-01 15:04:38 -04:00
Patrick von Platen	b815edf69f	[T5, Testst] Add extensive hard-coded integration tests and make sure PT and TF give equal results (#3550 ) * add some t5 integration tests * finish summarization and translation integration tests for T5 - results loook good * add tf test * fix == vs is bug * fix tf beam search error and make tf t5 tests pass	2020-04-01 18:01:33 +02:00
Patrick von Platen	b38d552a92	[Generate] Add bad words list argument to the generate function (#3367 ) * add bad words list * make style * add bad_words_tokens * make style * better naming * make style * fix typo	2020-03-31 18:42:31 +02:00
Patrick von Platen	55bcae7f25	remove useless and confusing lm_labels line (#3531 )	2020-03-31 09:32:25 -04:00
dougian	1f72865726	[BART] Update encoder and decoder on set_input_embedding (#3501 ) Co-authored-by: Ioannis Douratsos <ioannisd@amazon.com>	2020-03-30 12:20:37 -04:00
Julien Chaumond	cc598b312b	[InputExample] Unfreeze for now, cf. #3423	2020-03-30 10:41:49 -04:00
Julien Plu	d38bbb225f	Update the NER TF script (#3511 ) * Update the NER TF script to remove the softmax and make the pad token label id to -1 * Reformat the quality and style Co-authored-by: Julien Plu <julien.plu@adevinta.com>	2020-03-30 09:50:12 -04:00
LysandreJik	6f5a12a583	Release: v2.7.0 Some checks failed GitHub-hosted runner / check_code_quality (push) Has been cancelled Details	2020-03-30 08:49:24 -04:00
Patrick von Platen	296252c49e	fix lm lables in docstring (#3529 )	2020-03-30 14:26:24 +02:00
Patrick von Platen	75ec6c9e3a	[T5] make decoder input ids optional for t5 training (#3521 ) * make decoder input ids optional for t5 training * lm_lables should not be shifted in t5 * add tests * finish shift right functionality for PT T5 * move shift right to correct class * cleaner code * replace -100 values with pad token id * add assert statement * remove unnecessary for loop * make style	2020-03-30 13:45:26 +02:00
Patrick von Platen	5b44e0a31b	[T5] Add training documenation (#3507 ) * Add clear description of how to train T5 * correct docstring in T5 * correct typo * correct docstring format * update t5 model docs * implement collins feedback * fix typo and add more explanation for sentinal tokens * delete unnecessary todos	2020-03-30 13:35:53 +02:00
Sam Shleifer	f6a23d1911	[BART] add bart-large-xsum weights (#3422 )	2020-03-29 10:51:13 -04:00
Patrick von Platen	fa9af2468a	Add T5 to docs (#3461 ) * add t5 docs basis * improve docs * add t5 docs * improve t5 docstring * add t5 tokenizer docstring * finish docstring * make style * add pretrained models * correct typo * make examples work * finalize docs	2020-03-27 10:57:16 -04:00
LysandreJik	e2c05f06ef	Correct indentation in docstring For some reason Sphinx extremely dislikes this and crashes.	2020-03-27 09:28:52 -04:00
Sam Shleifer	3ee431dd4c	[Bart/Memory] Two separate, smaller decoder attention masks (#3371 )	2020-03-26 21:34:15 -04:00
Sam Shleifer	c10decf7a0	[Bart: example] drop columns that are exclusively pad_token_id… (#3400 ) * trim seq_len below 1024 if there are columns full of pad_token_id * Centralize trim_batch so SummarizationDataset can use it too	2020-03-26 19:33:54 -04:00
Sam Shleifer	63f4d8cad0	[Bart/Memory] SelfAttention only returns weights if config.outp… (#3369 )	2020-03-26 18:42:39 -04:00
Sam Shleifer	2b2a2f8df2	[Bart] Fix: put dummy_inputs on correct device (#3398 ) * Dummy inputs to model.device * Move self.device to ModuleUtilsMixin	2020-03-26 18:42:09 -04:00
Sam Shleifer	1a5aefc95c	[Seq2Seq Generation] Call encoder before expanding input_ids (#3370 )	2020-03-26 18:41:19 -04:00
Sam Shleifer	39371ee454	[Bart/Memory] don't create lm_head (#3323 ) * delete lm_head, skips weight tying * Fixed s3	2020-03-26 18:40:39 -04:00
sakares saengkaew	1a6c546c6f	Add missing token classification for XLM (#3277 ) * Add the missing token classification for XLM * fix styling * Add XLMForTokenClassification to AutoModelForTokenClassification class * Fix docstring typo for non-existing class * Add the missing token classification for XLM * fix styling * fix styling * Add XLMForTokenClassification to AutoModelForTokenClassification class * Fix docstring typo for non-existing class * Add missing description for AlbertForTokenClassification * fix styling * Add missing docstring for AlBert * Slow tests should be slow Co-authored-by: Sakares Saengkaew <s.sakares@gmail.com> Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>	2020-03-26 10:22:13 -04:00
Patrick von Platen	311970546f	rename string in pipeline	2020-03-26 14:59:49 +01:00
Patrick von Platen	022e8fab97	Adds translation pipeline (#3419 ) * fix merge conflicts * add t5 summarization example * change parameters for t5 summarization * make style * add first code snippet for translation * only add prefixes * add prefix patterns * make style * renaming * fix conflicts * remove unused patterns * solve conflicts * fix merge conflicts * remove translation example * remove summarization example * make sure tensors are in numpy for float comparsion * re-add t5 config * fix t5 import config typo * make style * remove unused numpy statements * update doctstring * import translation pipeline	2020-03-26 13:50:58 +01:00
Patrick von Platen	9c683ef01e	Add t5 to pipeline(task='summarization') (#3413 ) * solve conflicts * move warnings below * incorporate changes * add pad_to_max_length to pipelines * add bug fix for T5 beam search * add prefix patterns * make style * fix conflicts * adapt pipelines for task specific parameters * improve docstring * remove unused patterns	2020-03-26 11:03:13 +01:00
Lysandre Debut	ffcffebe85	Force the return of token type IDs (#3439 )	2020-03-26 09:41:36 +01:00
Patrick von Platen	ffa17fe322	Extend config with task specific configs. (#3433 ) * add new default configs * change prefix default to None	2020-03-25 21:32:04 +01:00
Julien Chaumond	83272a3853	Experiment w/ dataclasses (including Py36) (#3423 ) * [ci] Also run test_examples in py37 (will revert at the end of the experiment) * InputExample: use immutable dataclass * [deps] Install dataclasses for Py<3.7 * [skip ci] Revert "[ci] Also run test_examples in py37" This reverts commit d29afd9959786b77759b0b8fa4e6b4335b952015.	2020-03-25 11:10:20 -04:00
Julien Chaumond	f8823bad9a	Expose missing mappings (see #3415 )	2020-03-24 17:46:25 -04:00
LysandreJik	471cce24b3	Release: v2.6.0	2020-03-24 10:37:32 -04:00
Julien Chaumond	a8e3336a85	[examples] Use AutoModels in more examples	2020-03-23 20:11:14 -04:00
Julien Chaumond	e25c4f4027	[ALBERT] move things around for more consistent naming see #3359 cc @lysandrejik	2020-03-23 13:58:21 -04:00
Patrick von Platen	95e00d0808	Clean special token init in modeling_....py (#3264 ) * make style * fix conflicts	2020-03-20 21:41:04 +01:00
Julien Chaumond	ecfd336318	Simpler Error message when loading config/model with .from_pretrained() (#3341 )	2020-03-19 23:23:03 +01:00
Patrick von Platen	bbf26c4e61	Support T5 Generation (#3228 ) * fix conflicts * update bart max length test * correct spelling mistakes * implemented model specific encode function * fix merge conflicts * better naming * save intermediate state -> need to rethink strucuture a bit * leave tf problem as it is for now * current version * add layers.pop * remove ipdb * make style * clean return cut decoding * remove ipdbs * Fix restoring layers in the decoders that doesnt exists. * push good intermediate solution for now * fix conflicts * always good to refuse to merge conflicts when rebasing * fix small bug * improve function calls * remove unused file * add correct scope behavior for t5_generate Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>	2020-03-19 23:18:23 +01:00
Lysandre Debut	f049be7ad4	Export ALBERT main layer in TensorFlow (#3354 )	2020-03-19 13:53:05 -04:00
Serkan Karakulak	b2c2c31c60	Minor Bug Fix for Running Roberta on Glue (#3240 ) * added return_token_type_ids argument for tokenizers which do not generate return_type_ids by default * fixed styling * Style Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>	2020-03-19 12:08:31 -04:00
Sam Shleifer	4e4403c9b4	[BART] torch 1.0 compatibility (#3322 ) * config.activation_function	2020-03-19 11:56:54 -04:00
Sam Shleifer	ad7233fc01	[BART] cleanup: remove redundant kwargs, improve docstrings (#3319 )	2020-03-19 11:16:51 -04:00
Mohamed El-Geish	cd21d8bc00	Typo in warning message (#3219 ) `T5Tokenizer` instead of `XLNetTokenizer`	2020-03-19 09:49:25 -04:00
Matthew Goldey	8d3e218ea6	fix typo in docstring demonstrating usage (#3213 )	2020-03-19 09:47:54 -04:00
Patrick von Platen	cec3cdda15	Fix input ids can be none attn mask (#3345 ) * fix issue 3289 * fix attention mask if input_ids None behavior	2020-03-19 09:55:17 +01:00
Lysandre Debut	d6afbd323d	XLM-R Tokenizer now passes common tests + Integration tests (#3198 ) * XLM-R now passes common tests + Integration tests * Correct mask index * Model input names * Style * Remove text preprocessing * Unneccessary import	2020-03-18 09:52:49 -04:00
Patrick von Platen	292186a3e7	Adding LM Head to Transfo-XL and first step to fixing problem with Adaptive Embeddings in TransfoXL (#3286 ) * first commit * work in progress * make language generation task pass * update to working version for LM * delete print * remove dead code * make style	2020-03-18 09:24:27 -04:00
Patrick von Platen	ddb10c6447	improve doctstring (#3327 )	2020-03-18 13:24:09 +01:00
Sam Shleifer	38a555a83c	Add Summarization to Pipelines (#3128 ) * passing * Undo stupid chg * docs * undo rename * delete-cruft * only import if you have torch * Dont rely on dict ordering * Fix dict ordering upstream * docstring link * docstring link * remove trailing comma for 3.5 compat * new name * delegate kwarging * Update kwargs	2020-03-17 18:04:21 -04:00
Patrick von Platen	e8f44af5bf	[generate] do_sample default back to False (#3298 ) * change do_samples back * None better default as boolean * adapt do_sample to True in test example * make style	2020-03-17 10:52:37 -04:00
Thomas Wolf	2187c49f5c	CPU/GPU memory benchmarking utilities - Remove support for python 3.5 (now only 3.6+) (#3186 ) * memory benchmark rss * have both forward pass and line-by-line mem tracing * cleaned up tracing * refactored and cleaning up API * no f-strings yet... * add GPU mem logging * fix GPU memory monitoring * style and quality * clean up and doc * update with comments * Switching to python 3.6+ * fix quality	2020-03-17 10:17:11 -04:00
Patrick von Platen	4759176313	add camembert for Question answering for examples	2020-03-16 14:42:11 -04:00
Sam Shleifer	11573231c6	[BART] generation_mode as a kwarg not a class attribute (#3278 )	2020-03-16 12:47:53 -04:00

1 2 3 4 5 ...

426 Commits