HuggingFace_transformer

Author	SHA1	Message	Date
Lysandre	569897ce2c	Fix a few issues regarding the language modeling script	2020-02-12 13:23:14 -05:00
VictorSanh	ee5a6856ca	distilbert-base-cased weights + Readmes + omissions	2020-02-07 15:28:13 -05:00
Julien Chaumond	42f08e596f	[examples] rename run_lm_finetuning to run_language_modeling	2020-02-07 09:15:28 -05:00
Julien Chaumond	4f7bdb0958	[examples] Fix broken markdown	2020-02-07 09:15:28 -05:00
Peter Izsak	6fc3d34abd	Fix multi-gpu evaluation in run_glue.py	2020-02-06 16:38:55 -05:00
Julien Chaumond	ada24def22	[run_lm_finetuning] Tweak fix for non-long tensor, close #2728 see `1ebfeb7946` and #2728 Co-Authored-By: Lysandre Debut <lysandre.debut@reseau.eseo.fr>	2020-02-05 12:49:18 -05:00
Yuval Pinter	d1ab1fab1b	pass langs parameter to certain XLM models (#2734 ) * pass langs parameter to certain XLM models Adding an argument that specifies the language the SQuAD dataset is in so language-sensitive XLMs (e.g. `xlm-mlm-tlm-xnli15-1024`) don't default to language `0`. Allows resolution of issue #1799 . * fixing from `make style` * fixing style (again)	2020-02-04 17:12:42 -05:00
Lysandre	3bf5417258	Revert erroneous fix	2020-02-04 16:31:07 -05:00
Lysandre	1ebfeb7946	Cast to long when masking tokens	2020-02-04 15:56:16 -05:00
Lysandre	239dd23f64	[Follow up 213] Masked indices should have -1 and not -100. Updating documentation + scripts that were forgotten	2020-02-03 16:08:05 -05:00
Antonio Carlos Falcão Petri	2ba147ecff	Fix typo in examples/utils_ner.py "%s-%d".format() -> "{}-{}".format()	2020-02-01 11:10:57 -05:00
Lysandre	d18d47be67	run_generation style	2020-01-31 12:05:48 -05:00
Lysandre	7365f01d43	do_sample should be set to True in run_generation.py	2020-01-31 11:49:32 -05:00
Jared Nielsen	71a382319f	Correct documentation	2020-01-30 18:41:24 -05:00
Hang Le	f0a4fc6cd6	Add Flaubert	2020-01-30 10:04:18 -05:00
Jared Nielsen	adb8c93134	Remove lines causing a KeyError	2020-01-29 14:01:16 -05:00
Lysandre	335dd5e68a	Default save steps 50 to 500 in all scripts	2020-01-28 09:42:11 -05:00
Julien Chaumond	6b4c3ee234	[run_lm_finetuning] GPT2 tokenizer doesn't have a pad_token ping @lysandrejik	2020-01-27 20:14:02 -05:00
VictorSanh	1ce3fb5cc7	update correct eval metrics (distilbert & co)	2020-01-24 11:45:22 -05:00
Julien Chaumond	1a8e87be4e	Line-by-line text dataset (including padding)	2020-01-21 16:57:38 -05:00
Julien Chaumond	b94cf7faac	change order	2020-01-21 16:57:38 -05:00
Julien Chaumond	2eaa8b6e56	Easier to not support this, as it could be confusing cc @lysandrejik	2020-01-21 16:57:38 -05:00
Julien Chaumond	801aaa5508	make style	2020-01-21 16:57:38 -05:00
Julien Chaumond	56d4ba8ddb	[run_lm_finetuning] Train from scratch	2020-01-21 16:57:38 -05:00
jiyeon_baek	6d5049a24d	Fix typo in examples/run_squad.py Rul -> Run	2020-01-17 11:22:51 -05:00
Lysandre	6e2c28a14a	Run SQuAD warning when the doc stride may be too high	2020-01-16 13:59:26 -05:00
thomwolf	258ed2eaa8	adding details in readme	2020-01-16 13:21:30 +01:00
thomwolf	50ee59578d	update formating - make flake8 happy	2020-01-16 13:21:30 +01:00
thomwolf	1c9333584a	formating	2020-01-16 13:21:30 +01:00
thomwolf	e25b6fe354	updating readme	2020-01-16 13:21:30 +01:00
thomwolf	27c7b99015	adding details in readme - moving file	2020-01-16 13:21:30 +01:00
Nafise Sadat Moosavi	99d4515572	HANS evaluation	2020-01-16 13:21:30 +01:00
Julien Chaumond	83a41d39b3	💄 super	2020-01-15 18:33:50 -05:00
Julien Chaumond	715fa638a7	Merge branch 'master' into from_scratch_training	2020-01-14 18:58:21 +00:00
Julien Chaumond	b803b067bf	Config to Model mapping	2020-01-13 20:05:20 +00:00
IWillPull	a3085020ed	Added repetition penalty to PPLM example (#2436 ) * Added repetition penalty * Default PPLM repetition_penalty to neutral * Minor modifications to comply with reviewer's suggestions. (j -> token_idx) * Formatted code with `make style`	2020-01-10 23:00:07 -05:00
VictorSanh	e83d9f1c1d	cleaning - change ' to " (black requirements)	2020-01-10 19:34:25 -05:00
VictorSanh	ebba9e929d	minor spring cleaning - missing configs + processing	2020-01-10 19:14:58 -05:00
Victor SANH	331065e62d	missing import	2020-01-10 11:42:53 +01:00
Victor SANH	414e9e7122	indents test	2020-01-10 11:42:53 +01:00
Victor SANH	3cdb38a7c0	indents	2020-01-10 11:42:53 +01:00
Victor SANH	ebd45980a0	Align with `run_squad` + fix some errors	2020-01-10 11:42:53 +01:00
Victor SANH	45634f87f8	fix Sampler in distributed training - evaluation	2020-01-10 11:42:53 +01:00
Victor SANH	af1ee9e648	Move `torch.nn.utils.clip_grad_norm_`	2020-01-10 11:42:53 +01:00
Lysandre	164c794eb3	New SQuAD API for distillation script	2020-01-10 11:42:53 +01:00
Lysandre	16ce15ed4b	DistilBERT token type ids removed from inputs in run_squad	2020-01-08 13:18:30 +01:00
Lysandre Debut	f24232cd1b	Fix error with global step in run_squad.py	2020-01-08 11:39:00 +01:00
Oren Amsalem	43114b89ba	spelling correction (#2434 )	2020-01-07 17:25:25 +01:00
Lysandre Debut	27c1b656cc	Fix error with global step in run_lm_finetuning.py	2020-01-07 16:16:12 +01:00
Simone Primarosa	176d3b3079	Add support for Albert and XLMRoberta for the Glue example (#2403 ) * Add support for Albert and XLMRoberta for the Glue example	2020-01-07 14:55:55 +01:00

1 2 3 4 5 ...

848 Commits