Commit Graph

1591 Commits

Author SHA1 Message Date
thomwolf
62df4ba59a add dilbert tokenizer and tests 2019-08-28 12:22:56 +02:00
thomwolf
4ce5f36f78 update readmes 2019-08-28 12:14:31 +02:00
erenup
ec4b1c659f logging truth error 2019-08-28 16:50:40 +08:00
erenup
df52abe373 add sep_toekn between question and choice 2019-08-28 16:36:21 +08:00
erenup
43c243254a avoid invalid labels of truth 2019-08-28 16:03:17 +08:00
erenup
3c7e676f8b add test related code: test the best dev acc model when model is training 2019-08-28 15:57:29 +08:00
VictorSanh
a5fe16687b fix typo 2019-08-28 07:22:54 +00:00
VictorSanh
497f73c964 add DilBERT to master REAME 2019-08-28 07:16:30 +00:00
VictorSanh
93e82ab424 Write README for DilBERT 2019-08-28 06:26:09 +00:00
VictorSanh
19b7c9b0b7 add DilBert model for squad 2019-08-28 06:25:44 +00:00
VictorSanh
fea921d382 add licensing 2019-08-28 04:45:39 +00:00
VictorSanh
da1e4e53fc some fixes in train.py for loading previous checkpoint 2019-08-28 04:01:03 +00:00
VictorSanh
0d8f8848d5 add scripts/extract_for_distil.py 2019-08-28 04:00:19 +00:00
VictorSanh
7f2c384c80 add scripts/token_counts.py 2019-08-28 04:00:03 +00:00
VictorSanh
4d16b279e5 add scripts/binarized_data.py 2019-08-28 03:59:48 +00:00
LysandreJik
c513415b19 Dilbert tests from CommonTests 2019-08-27 23:59:00 -04:00
LysandreJik
778a263f09 GilBert added to AutoModels 2019-08-27 23:14:00 -04:00
VictorSanh
74d78beeb4 fix: add qa_dropout and seq_classif_dropout 2019-08-28 03:13:11 +00:00
VictorSanh
7f5d85347e fix small typo 2019-08-28 02:44:51 +00:00
VictorSanh
906581ae3c add s3 links for dilbert (+fix small typo) 2019-08-28 02:43:33 +00:00
VictorSanh
b247b0d880 add train.py for distillation 2019-08-28 02:12:47 +00:00
VictorSanh
780f183e55 add requirements 2019-08-28 01:39:52 +00:00
VictorSanh
e424d2e45d add README 2019-08-28 01:10:10 +00:00
VictorSanh
1ae81e4aa1 add dataset. distiller, utils 2019-08-28 01:10:05 +00:00
VictorSanh
5d29f8e99b fix bugs 2019-08-28 00:57:16 +00:00
VictorSanh
a8ad83040d fix bugs 2019-08-28 00:45:33 +00:00
Shijie Wu
ca4baf8ca1 Match order of casing in OSS XLM; Improve document; Clean up dependency 2019-08-27 20:03:18 -04:00
VictorSanh
60c984da6c fix bugs 2019-08-27 22:25:55 +00:00
VictorSanh
42968138c8 wip wouf 2019-08-27 22:00:38 +00:00
VictorSanh
1d23240068 wip 2019-08-27 14:27:47 +00:00
Thomas Wolf
d06c5a2a0a Merge pull request #1120 from CrafterKolyan/patch-3
Change attention mask dtype to be bool. Fix #1119
2019-08-27 15:01:01 +02:00
Thomas Wolf
edc5222fc3 Merge pull request #1118 from CrafterKolyan/patch-2
Documentation fix #1117
2019-08-27 14:58:50 +02:00
Thomas Wolf
9cf298dfc1 Merge pull request #1116 from CrafterKolyan/patch-1
Delete nonexistent parameter from documentation fix #1115
2019-08-27 14:56:43 +02:00
thomwolf
0d288727b8 fix #1106 2019-08-27 14:50:22 +02:00
thomwolf
447afe9cdf updating docstring for AutoModel 2019-08-27 14:42:03 +02:00
thomwolf
a175a9dc01 add kwargs to base encode function 2019-08-27 14:05:59 +02:00
Nikolay Korolev
53282b5bd0 Change attention mask dtype to be bool. Fix #1119 2019-08-27 14:19:03 +03:00
Nikolay Korolev
26bda77225 Fix documentation #1117
Rename parameter in documentation + Delete its second occurrence.
2019-08-27 12:22:42 +03:00
Nikolay Korolev
c8933bb2d9 Delete nonexistent parameter from documentation
Changed documentation of GPT2Model, GPT2LMHeadModel and GPT2DoubleHeadsModel
2019-08-27 12:10:36 +03:00
LysandreJik
e08c01aa1a fix #1102 2019-08-26 18:13:06 -04:00
LysandreJik
84a3a9689d Pytorch Hub & AutoModels 2019-08-26 16:08:43 -04:00
LysandreJik
f68339639a Tests for added AutoModels 2019-08-26 16:02:23 -04:00
LysandreJik
cb60ce59dd Added multiple AutoModel classes: AutoModelWithLMHead, AutoModelForQuestionAnswering and AutoModelForSequenceClassification 2019-08-26 15:44:30 -04:00
LysandreJik
529a16dec6 Generic encoding implementation. 2019-08-26 15:00:43 -04:00
Shijie Wu
f1b018740c Add use_lang_emb to config 2019-08-23 20:33:01 -04:00
Shijie Wu
e85123d398 Add custom tokenizer for zh and ja 2019-08-23 20:27:52 -04:00
thomwolf
06510ccb53 typo 2019-08-23 22:08:10 +02:00
thomwolf
3bcbebd440 max_len_single_sentence & max_len_sentences_pair as attributes so they can be modified 2019-08-23 22:07:26 +02:00
Shijie Wu
436ce07218 Tokenization behave the same as original XLM proprocessing for most languages except zh, ja and th; Change API to allow specifying language in tokenize 2019-08-23 14:40:17 -04:00
thomwolf
ab7bd5ef98 fixing tokenization and training 2019-08-23 17:31:21 +02:00