Commit Graph

51 Commits

Author SHA1 Message Date
thomwolf
a6bcfb8015 fix tests 2019-09-25 21:14:12 +02:00
thomwolf
8a618e0af5 clean up __init__ 2019-09-25 21:04:52 +02:00
thomwolf
0f091062d4 Merge branch 'glue-example' into tf2 2019-09-25 10:21:52 +02:00
thomwolf
c4acc3a8e9 let encode accept tensor inputs 2019-09-25 10:19:14 +02:00
thomwolf
99a90e43d4 update data processors __init__ 2019-09-24 17:16:46 +02:00
thomwolf
b5ec526f85 updated data processor and metrics 2019-09-24 17:10:50 +02:00
thomwolf
f5397ffc3b update loading logics 2019-09-24 14:03:58 +02:00
thomwolf
e9a103c17a bidirectional conversion TF <=> PT - extended tests 2019-09-24 13:25:50 +02:00
thomwolf
a7e01a248b converting distilled/fine-tuned models 2019-09-24 10:58:52 +02:00
thomwolf
8ba44ced95 fix roberta conversion script 2019-09-24 09:48:23 +02:00
thomwolf
2b11fa5174 update __init__ and conversion script 2019-09-23 22:35:45 +02:00
erenup
84b9d1c423 Merge remote-tracking branch 'huggingface/master'
# Conflicts:
#	pytorch_transformers/__init__.py
2019-09-16 19:06:12 +08:00
thomwolf
b97af8cce9 skip finetuned checkpoints 2019-09-13 16:43:49 +02:00
thomwolf
4356f791a2 XLM passing tests 2019-09-11 11:49:54 +02:00
thomwolf
465870c33f Xlnet working - also added simple question answering model for XLNet 2019-09-10 16:44:41 +02:00
thomwolf
32aabe8c33 WIP XLNet 2019-09-10 12:17:18 +02:00
thomwolf
33cb00f41a add GPT2 to init - fix weights loading - remove tf.function 2019-09-09 14:29:24 +02:00
thomwolf
b7175a2701 fixed imports in tests and gpt2 config test 2019-09-09 11:04:03 +02:00
thomwolf
01597e5b90 add tf auto models + tests 2019-09-08 15:02:06 +03:00
thomwolf
518307dfcd test suite independent of framework 2019-09-08 15:02:06 +03:00
thomwolf
ad0ab9afe9 fix test when tf is not here 2019-09-08 15:02:06 +03:00
thomwolf
59fe641b8b also gathering file names in file_utils 2019-09-08 15:02:06 +03:00
thomwolf
7ae642b72d update conversion scripts 2019-09-08 15:02:06 +03:00
thomwolf
1efb1f1660 split configuration and modeling files 2019-09-08 15:02:06 +03:00
thomwolf
85df4f7cca also gathering file names in file_utils 2019-09-05 02:34:09 +02:00
thomwolf
121f88cae3 update conversion scripts 2019-09-05 02:17:50 +02:00
thomwolf
2a667b1eb9 split configuration and modeling files 2019-09-05 00:27:11 +02:00
thomwolf
89fd3450a6 Release: 1.2.0 2019-09-04 13:32:18 +02:00
LysandreJik
dc43215c01 Added multiple AutoModel classes: AutoModelWithLMHead, AutoModelForQuestionAnswering and AutoModelForSequenceClassification 2019-08-30 12:48:55 -04:00
erenup
6e1ac34e2b Merge remote-tracking branch 'huggingface/master' 2019-08-30 15:50:11 +08:00
thomwolf
912a377e90 dilbert -> distilbert 2019-08-28 13:59:42 +02:00
thomwolf
62df4ba59a add dilbert tokenizer and tests 2019-08-28 12:22:56 +02:00
VictorSanh
60c984da6c fix bugs 2019-08-27 22:25:55 +00:00
VictorSanh
42968138c8 wip wouf 2019-08-27 22:00:38 +00:00
erenup
5582bc4b23 add multiple choice to robreta and xlnet, test on swag, roberta=0.82.28
, xlnet=0.80
2019-08-18 16:01:48 +08:00
LysandreJik
fe02e45e48 Release: 1.1.0 2019-08-15 11:15:08 -04:00
LysandreJik
6c41a8f5dc Encode and Decode are back in the superclass. They now handle sentence pairs special tokens. 2019-08-08 18:20:32 -04:00
LysandreJik
d2cc6b101e Merge branch 'master' into RoBERTa 2019-08-08 09:42:05 -04:00
LysandreJik
770043eea2 Sentence-pair tasks handling. Using common tests on RoBERTa. Forced push to fix indentation. 2019-08-07 12:53:19 -04:00
Thomas Wolf
d43dc48b34 Merge branch 'master' into auto_models 2019-08-05 19:17:35 +02:00
thomwolf
0b524b0848 remove derived classes for now 2019-08-05 19:08:19 +02:00
thomwolf
13936a9621 update doc and tests 2019-08-05 18:48:16 +02:00
thomwolf
ed4e542260 adding tests 2019-08-05 18:14:07 +02:00
thomwolf
328afb7097 cleaning up tokenizer tests structure (at last) - last remaining ppb refs 2019-08-05 14:08:56 +02:00
thomwolf
009273dbdd big doc update [WIP] 2019-08-04 12:14:57 +02:00
thomwolf
632d711411 fix #908 2019-07-26 21:14:37 +02:00
thomwolf
ed7549bb1a release version 1.0 2019-07-16 16:10:58 +02:00
thomwolf
ec07cf5a66 rewamp optimization 2019-07-11 14:48:22 +02:00
thomwolf
b19786985d unified tokenizer api and serialization + tests 2019-07-09 10:25:18 +02:00
thomwolf
36bca545ff tokenization abstract class - tests for examples 2019-07-05 15:02:59 +02:00