180 Commits

Author SHA1 Message Date
thomwolf
6a72d9aa52 updated examples in readme 2019-07-16 16:09:29 +02:00
thomwolf
b59043bf8f update readme 2019-07-16 16:03:48 +02:00
thomwolf
edc79acb3b simpler quick tour 2019-07-16 16:02:32 +02:00
thomwolf
5c82d3488f indicate default evaluation in breaking changes 2019-07-16 15:45:58 +02:00
thomwolf
4acaa65068 model in evaluation mode by default after from_pretrained 2019-07-16 15:41:57 +02:00
thomwolf
1849aa7d39 update readme and pretrained model weight files 2019-07-16 15:11:29 +02:00
thomwolf
43e0e8fa04 updates to readme and doc 2019-07-16 13:56:47 +02:00
thomwolf
352e3ff998 added migration guide to readme 2019-07-16 09:03:49 +02:00
thomwolf
8ad7e5b4f2 indeed 2019-07-16 00:29:15 +02:00
thomwolf
064d0a0b76 update readme 2019-07-16 00:21:33 +02:00
thomwolf
3b8b0e01bb update readme 2019-07-16 00:12:55 +02:00
thomwolf
2397f958f9 updating examples and doc 2019-07-14 23:20:10 +02:00
thomwolf
6135de2fa3 readme update 2019-07-11 15:39:49 +02:00
thomwolf
e468192e2f Merge branch 'pytorch-transformers' into xlnet 2019-07-09 17:05:37 +02:00
LysandreJik
ab30651802 Hugging Face theme. 2019-07-08 16:05:26 -04:00
thomwolf
eb91f6437e update readme and setup 2019-07-05 12:30:15 +02:00
thomwolf
0231ba291e circle-ci 2019-07-05 11:59:04 +02:00
thomwolf
0bab55d5d5 [BIG] name change 2019-07-05 11:55:36 +02:00
thomwolf
93e9971c54 fix tests 2019-06-26 10:02:45 +02:00
thomwolf
e55d4c4ede various updates to conversion, models and examples 2019-06-26 00:57:53 +02:00
thomwolf
603c513b35 update main conversion script and readme 2019-06-25 10:45:07 +02:00
thomwolf
62d78aa37e updating GLUE utils for compatibility with XLNet 2019-06-24 14:36:11 +02:00
thomwolf
c304593d8f BERTology details in readme 2019-06-20 10:05:06 +02:00
thomwolf
34d706a0e1 pruning in bertology 2019-06-19 15:25:49 +02:00
thomwolf
dc8e0019b7 updating examples 2019-06-19 13:23:20 +02:00
thomwolf
68ab9599ce small fix and updates to readme 2019-06-19 09:38:38 +02:00
thomwolf
4d8c4337ae test barrier in distrib training 2019-06-18 22:41:28 +02:00
thomwolf
15ebd67d4e cache in run_classifier + various fixes to the examples 2019-06-18 15:58:22 +02:00
thomwolf
d82e5deeb1 set find_unused_parameters=True in DDP 2019-06-18 12:13:14 +02:00
thomwolf
f964753090 explanation on the current location of the caching folder 2019-06-18 11:36:28 +02:00
thomwolf
382e2d1e50 spliting config and weight files for bert also 2019-06-18 10:37:16 +02:00
thomwolf
4447f270b2 updating hub 2019-06-17 16:21:28 +02:00
thomwolf
33d3db5c43 updating head masking, readme and docstrings 2019-06-17 15:51:28 +02:00
thomwolf
34858ae1d9 adding bert whole words, bertgerman and gpt-2 medium models, head masking 2019-06-17 11:02:39 +02:00
timoeller
16af9ff7b0 Add German Bert model to code, update readme 2019-06-14 17:42:46 +02:00
Colanim
1eba8b9d96 Fix link in README 2019-05-30 14:01:46 +09:00
lukovnikov
331a46ff04 - replaced OpenAIGPTAdam with OpenAIAdam in docs 2019-04-25 16:04:37 +02:00
lukovnikov
704037ad51 - updated docs for new LR API
- added some images for illustration
- updated comments in optimization
2019-04-25 15:59:39 +02:00
thomwolf
18a8a15f78 improving GPT2 tokenization and adding tests 2019-04-16 17:00:55 +02:00
thomwolf
1135f2384a clean up logger in examples for distributed case 2019-04-15 15:22:40 +02:00
thomwolf
cc43307023 update readme 2019-04-15 15:06:10 +02:00
thomwolf
60ea6c59d2 added best practices for serialization in README and examples 2019-04-15 15:00:33 +02:00
thomwolf
20577d8a7c add configuration serialization to readme 2019-04-15 14:21:41 +02:00
thomwolf
b17963d82f update readme 2019-04-15 13:44:30 +02:00
Weixin Wang
f26ce6992e Fix links in README 2019-04-02 17:20:32 +08:00
Sepehr Sameni
b588ff362a fix lm_finetuning's link 2019-03-29 12:39:24 +04:30
Thomas Wolf
694e2117f3 Merge pull request #388 from ananyahjha93/master
Added remaining GLUE tasks to 'run_classifier.py'
2019-03-28 09:06:53 +01:00
Thomas Wolf
bbff03fbfc Merge pull request #394 from desireevl/master
Minor change in README
2019-03-27 12:03:00 +01:00
thomwolf
34561e61a5 update main readme also 2019-03-27 12:00:04 +01:00
Ananya Harsh Jha
f471979167 added GLUE dev set results and details on how to run GLUE tasks 2019-03-21 15:38:30 -04:00