Victor SANH
|
fa735208c9
|
update readme - fix example command distil*
|
2019-10-30 14:27:28 -04:00 |
|
Thomas Wolf
|
c7058d8224
|
Merge pull request #1608 from focox/master
Error raised by "tmp_eval_loss += tmp_eval_loss.item()" when using multi-gpu
|
2019-10-30 17:14:07 +01:00 |
|
Thomas Wolf
|
04c69db399
|
Merge pull request #1628 from huggingface/tfglue
run_tf_glue works with all tasks
|
2019-10-30 17:04:03 +01:00 |
|
Thomas Wolf
|
3df4367244
|
Merge pull request #1601 from huggingface/clean-roberta
Clean roberta model & all tokenizers now add special tokens by default (breaking change)
|
2019-10-30 17:00:40 +01:00 |
|
Thomas Wolf
|
36174696cc
|
Merge branch 'master' into clean-roberta
|
2019-10-30 16:51:06 +01:00 |
|
Thomas Wolf
|
228cdd6a6e
|
Merge branch 'master' into conditional-generation
|
2019-10-30 16:40:35 +01:00 |
|
Rémi Louf
|
070507df1f
|
format utils for summarization
|
2019-10-30 11:24:12 +01:00 |
|
Rémi Louf
|
da10de8466
|
fix bug with padding mask + add corresponding test
|
2019-10-30 11:19:58 +01:00 |
|
Rémi Louf
|
3b0d2fa30e
|
rename seq2seq to encoder_decoder
|
2019-10-30 10:54:46 +01:00 |
|
Rémi Louf
|
9c1bdb5b61
|
revert renaming of lm_labels to ltr_lm_labels
|
2019-10-30 10:43:13 +01:00 |
|
Rémi Louf
|
098a89f312
|
update docstrings; rename lm_labels to more explicit ltr_lm_labels
|
2019-10-29 20:08:03 +01:00 |
|
Rémi Louf
|
dfce409691
|
resolve PR comments
|
2019-10-29 17:10:20 +01:00 |
|
altsoph
|
079bfb32fb
|
Evaluation fixed.
|
2019-10-28 10:18:58 -04:00 |
|
altsoph
|
438f2730a0
|
Evaluation code fixed.
|
2019-10-28 10:18:58 -04:00 |
|
Rémi Louf
|
4c3ac4a7d8
|
here's one big commit
|
2019-10-28 10:49:50 +01:00 |
|
Rémi Louf
|
932543f77e
|
fix test of truncation function
|
2019-10-28 10:49:49 +01:00 |
|
Rémi Louf
|
a67413ccc8
|
extend works in-place
|
2019-10-28 10:49:49 +01:00 |
|
Rémi Louf
|
b915ba9dfe
|
pad sequence with 0, mask with -1
|
2019-10-28 10:49:49 +01:00 |
|
Lysandre
|
bab6ad01aa
|
run_tf_glue works with all tasks
|
2019-10-24 21:41:45 +00:00 |
|
Matt Maybeno
|
ae1d03fc51
|
Add roberta to doc
|
2019-10-24 14:32:48 -04:00 |
|
Matt Maybeno
|
4e5f88b74f
|
Add Roberta to run_ner.py
|
2019-10-24 14:32:48 -04:00 |
|
VictorSanh
|
5b6cafb11b
|
[release] fix table weirdness
|
2019-10-23 10:35:16 -04:00 |
|
VictorSanh
|
8ad5c591cd
|
[RELEASE] DistilRoBERTa
|
2019-10-23 10:29:47 -04:00 |
|
focox@qq.com
|
bd847ce7d7
|
fixed the bug raised by "tmp_eval_loss += tmp_eval_loss.item()" when parallelly using multi-gpu.
|
2019-10-23 20:27:13 +08:00 |
|
Julien Chaumond
|
ef1b8b2ae5
|
[CTRL] warn if generation prompt does not start with a control code
see also https://github.com/salesforce/ctrl/pull/50
|
2019-10-22 21:30:32 +00:00 |
|
Lysandre
|
7d709e55ed
|
Remove
|
2019-10-22 14:12:33 -04:00 |
|
Lysandre
|
1cfd974868
|
Option to benchmark only one of the two libraries
|
2019-10-22 13:32:23 -04:00 |
|
Pasquale Minervini
|
abd7110e21
|
gradient norm clipping should be done right before calling the optimiser - fixing run_glue and run_ner as well
|
2019-10-21 19:56:52 +01:00 |
|
Pasquale Minervini
|
3775550c4b
|
gradient norm clipping should be done right before calling the optimiser
|
2019-10-20 22:33:56 +01:00 |
|
LysandreJik
|
7dd29ed2f1
|
Benchmarks example script
|
2019-10-18 10:53:04 -04:00 |
|
leo-du
|
ecd15667f3
|
fix repetition penalty
|
2019-10-17 14:47:14 -04:00 |
|
thomwolf
|
8cd56e3036
|
fix data processing in script
|
2019-10-17 16:33:26 +02:00 |
|
Rémi Louf
|
578d23e061
|
add training pipeline (formatting temporary)
|
2019-10-17 14:02:27 +02:00 |
|
Rémi Louf
|
47a06d88a0
|
use two different tokenizers for storyand summary
|
2019-10-17 13:04:26 +02:00 |
|
Rémi Louf
|
bfb9b540d4
|
add Model2Model to __init__
|
2019-10-17 12:59:51 +02:00 |
|
Rémi Louf
|
c1bc709c35
|
correct the truncation and padding of dataset
|
2019-10-17 10:41:53 +02:00 |
|
Rémi Louf
|
e4e0ee14bd
|
add separator between data import and train
|
2019-10-16 20:05:32 +02:00 |
|
Rémi Louf
|
0d81fc853e
|
specify in readme that both datasets are required
|
2019-10-15 15:26:33 +02:00 |
|
Rémi Louf
|
1aec940587
|
test the full story processing
|
2019-10-15 15:18:07 +02:00 |
|
Rémi Louf
|
22e1af6859
|
truncation function is fully tested
|
2019-10-15 14:43:50 +02:00 |
|
Rémi Louf
|
260ac7d9a8
|
wip commit, switching computers
|
2019-10-15 12:24:35 +02:00 |
|
thomwolf
|
be916cb3fb
|
Merge branch 'master' of https://github.com/huggingface/transformers
|
2019-10-15 10:37:13 +02:00 |
|
thomwolf
|
5875aaf762
|
install tensorboard
|
2019-10-15 10:36:46 +02:00 |
|
Thomas Wolf
|
40f14ff545
|
Merge pull request #1513 from slayton58/amp_fp16_einsum
Force einsum to run in fp16
|
2019-10-15 10:25:00 +02:00 |
|
Thomas Wolf
|
d147671c6c
|
Merge pull request #1508 from tlkh/master
Added performance enhancements (XLA, AMP) to examples
|
2019-10-15 09:57:18 +02:00 |
|
thomwolf
|
2c1d5564ad
|
add readme information
|
2019-10-15 09:56:52 +02:00 |
|
thomwolf
|
c55badcee0
|
Add NER finetuning details by @stefan-it in example readme
|
2019-10-15 09:33:52 +02:00 |
|
Julien Chaumond
|
788e632622
|
[ner] Honor args.overwrite_cache
|
2019-10-15 09:17:31 +02:00 |
|
thomwolf
|
0f9ebb0b43
|
add seqeval as requirement for examples
|
2019-10-15 09:17:31 +02:00 |
|
thomwolf
|
66adb71734
|
update to transformers
|
2019-10-15 09:17:31 +02:00 |
|