jinoobaek-qz
|
69629c4f0f
|
Improve naming and only do regex when necessary
|
2019-10-09 08:48:40 -04:00 |
|
jinoobaek-qz
|
bf34a252b8
|
Golden path
|
2019-10-09 08:48:40 -04:00 |
|
jinoobaek-qz
|
528d3f327b
|
Improve readability and improve make less assumptions about checkpoint format
|
2019-10-09 08:48:40 -04:00 |
|
jinoobaek-qz
|
56301bd9e8
|
Extract method
|
2019-10-09 08:48:40 -04:00 |
|
jinoobaek-qz
|
d6c5469712
|
Delete older checkpoint after saving new checkpoint
|
2019-10-09 08:48:40 -04:00 |
|
jinoobaek-qz
|
54a31f50fb
|
Add save_total_limit
|
2019-10-09 08:48:40 -04:00 |
|
Thomas Wolf
|
439fac723a
|
Merge pull request #1409 from brian41005/master
Evaluation result.txt path changing #1286
|
2019-10-09 03:14:34 +02:00 |
|
VictorSanh
|
7ce83b4931
|
update weights for distilgpt2
|
2019-10-07 12:30:27 -04:00 |
|
LysandreJik
|
f3e0218fbb
|
Correct device assignment in run_generation
|
2019-10-05 21:05:16 -04:00 |
|
VictorSanh
|
0820bb0555
|
unecessary carriage return
|
2019-10-04 17:23:15 -04:00 |
|
VictorSanh
|
f5891c3821
|
run_squad --> run_squad_w_distillation
|
2019-10-04 17:23:15 -04:00 |
|
VictorSanh
|
764a7923ec
|
add distillation+finetuning option in run_squad
|
2019-10-04 17:23:15 -04:00 |
|
Lysandre Debut
|
d3f24dfad7
|
Merge branch 'master' into master
|
2019-10-03 22:43:09 +00:00 |
|
LysandreJik
|
ecc4f1bdfa
|
XLM use_lang_embedding flag in run_generation
|
2019-10-03 17:42:16 -04:00 |
|
LysandreJik
|
c2c2ca0fdb
|
Added XLM to run_generation, with prompt language selection.
|
2019-10-03 17:18:48 -04:00 |
|
Brian Ma
|
7af0777910
|
Update run_glue.py
add DistilBert model shortcut into ALL_MODELS
|
2019-10-03 15:31:11 +00:00 |
|
VictorSanh
|
5f07d8f11a
|
prepare release
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
35071007cb
|
incoming release 🔥 update links to arxiv preprint
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
2a91f6071f
|
upddate README - TODO updadte link to paper
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
c51e533a5f
|
update train.py
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
a76c3f9cb0
|
update requirements
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
bb9c5ead54
|
update distiller
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
a12ab0a8db
|
update binarized_data
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
4d6dfbd376
|
update extract
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
23edebc079
|
update extract_distilbert
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
cbfcfce205
|
update token_counts
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
19e4ebbe3f
|
grouped_batch_sampler
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
594202a934
|
lm_seqs_dataset
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
38084507c4
|
add distillation_configs
|
2019-10-03 10:27:11 -04:00 |
|
Brian Ma
|
2195c0d5f9
|
Evaluation result.txt path changing #1286
|
2019-10-03 12:49:12 +08:00 |
|
Thomas Wolf
|
963529e29b
|
Merge pull request #1288 from echan00/master
Typo with LM Fine tuning script
|
2019-10-01 18:46:07 -04:00 |
|
thomwolf
|
f7978f70ec
|
use format instead of f-strings
|
2019-10-01 18:45:38 -04:00 |
|
Denny
|
9478590630
|
Update run_lm_finetuning.py
The previous method, just as phrased, did not exist in the class.
|
2019-09-27 15:18:42 -03:00 |
|
Thomas Wolf
|
d83d295763
|
Merge pull request #1337 from mgrankin/fastdataset
faster dataset building
|
2019-09-27 10:35:12 +02:00 |
|
thomwolf
|
da2e47ad15
|
clean up a little run_tf_glue
|
2019-09-27 09:41:15 +02:00 |
|
thomwolf
|
528c288fa9
|
clean up run_tf_glue
|
2019-09-27 09:40:29 +02:00 |
|
VictorSanh
|
702f589848
|
fix input in run_glue for distilbert
|
2019-09-27 00:20:14 -04:00 |
|
mgrankin
|
f71a4577b8
|
faster dataset building
|
2019-09-26 16:53:13 +03:00 |
|
thomwolf
|
481d9c4fb5
|
Merge branch 'master' into tf2
|
2019-09-26 12:02:54 +02:00 |
|
thomwolf
|
31c23bd5ee
|
[BIG] pytorch-transformers => transformers
|
2019-09-26 10:15:53 +02:00 |
|
thomwolf
|
5705333441
|
add initialization for everybody
|
2019-09-26 10:06:20 +02:00 |
|
thomwolf
|
7c9f8f93f9
|
fix tests
|
2019-09-26 01:59:53 +02:00 |
|
thomwolf
|
d6dde438ea
|
add batch dimension in encode
|
2019-09-26 01:45:55 +02:00 |
|
thomwolf
|
4a21c4d88d
|
add warning if neither pt nor tf are found
|
2019-09-26 01:30:06 +02:00 |
|
thomwolf
|
3b7fb48c3b
|
fix loading from tf/pt
|
2019-09-25 17:46:16 +02:00 |
|
thomwolf
|
a049c8043b
|
push fix to training
|
2019-09-25 17:33:16 +02:00 |
|
thomwolf
|
5def3302f4
|
update run_glue
|
2019-09-25 12:38:08 +02:00 |
|
thomwolf
|
f71758f7a4
|
update internal glue processors
|
2019-09-25 12:00:50 +02:00 |
|
thomwolf
|
b5ec526f85
|
updated data processor and metrics
|
2019-09-24 17:10:50 +02:00 |
|
LysandreJik
|
f09e5ecef0
|
[Proposal] GLUE processors included in library
|
2019-09-24 09:47:34 -04:00 |
|