Ralph Tang
a2c8c8ef00
Fix hanging when loading pretrained models
...
- Fix hanging when loading pretrained models from the cache without having internet access. This is a widespread issue on supercomputers whose internal compute nodes are firewalled.
2019-10-19 16:19:20 -04:00
VictorSanh
fd97761c5a
soft launch distilroberta
2019-10-17 15:28:58 -04:00
Thomas Wolf
e703e4dfe1
Merge pull request #1509 from julian-pani/patch-3
...
remove leftover usage of DUMMY_INPUTS
2019-10-15 10:24:13 +02:00
thomwolf
898ce064f8
add tests on TF2.0 & PT checkpoint => model convertion functions
2019-10-15 10:04:19 +02:00
Thomas Wolf
8aa3b753bd
Merge pull request #1434 from bryant1410/patch-1
...
Remove unnecessary use of FusedLayerNorm in XLNet
2019-10-15 09:44:19 +02:00
Thomas Wolf
80889a0226
Merge pull request #1512 from louismartin/fix-roberta-convert
...
Fix import error in script to convert faisreq roberta checkpoints
2019-10-14 17:40:32 +02:00
Thomas Wolf
f62f992cf7
Merge pull request #1502 from jeffxtang/master
...
the working example code to use BertForQuestionAnswering
2019-10-14 16:14:52 +02:00
Louis MARTIN
49cba6e543
Fix import error in script to convert faisreq roberta checkpoints
2019-10-14 01:38:57 -07:00
JulianPani
0993586758
remove usage of DUMMY_INPUTS
...
Hey @thomwolf
This change da26bae61b (diff-8ddce309e88e8eb5b4d02228fd8881daL28-L29) removed the constant, but one usage of that constant remains in the code.
2019-10-14 02:09:53 +03:00
jeffxtang
e76d71521c
the working example code to use BertForQuestionAnswering and get an answer from a text and a question
2019-10-11 17:04:02 -07:00
Lysandre
a701c9b321
CTRL to tf automodels
2019-10-11 16:05:30 -04:00
Lysandre
3ddce1d74c
Release: 2.1.1
2019-10-11 06:37:49 -04:00
Thomas Wolf
3b43b01872
Merge pull request #1482 from huggingface/tf2_integration_tests
...
Integration of TF 2.0 models with other Keras modules
2019-10-11 16:25:43 +02:00
thomwolf
18a3cef7d5
no nans
2019-10-11 16:09:42 +02:00
thomwolf
1f5d9513d8
fix test
2019-10-11 15:55:01 +02:00
thomwolf
0f9fc4fbde
adding option to desactivate past/memory outputs
2019-10-11 15:47:08 +02:00
Thomas Wolf
700331b5ec
Merge pull request #1492 from stefan-it/bert-german-dbmdz-models
...
Add new BERT models for German (cased and uncased)
2019-10-11 13:01:52 +02:00
Thomas Wolf
573dde9b44
Merge pull request #1405 from slayton58/xlnet_layer_reorder
...
Re-order XLNet attention head outputs for better perf
2019-10-11 12:10:58 +02:00
Stefan Schweter
5f25a5f367
model: add support for new German BERT models (cased and uncased) from @dbmdz
2019-10-11 10:20:33 +02:00
thomwolf
751e246087
using tf.print in roberta
2019-10-10 15:47:20 +02:00
thomwolf
c9e8c51946
fixing SequenceSummary head in TF 2.0
2019-10-10 15:16:05 +02:00
thomwolf
da26bae61b
adding more tests on TF and pytorch serialization - updating configuration for better serialization
2019-10-10 14:30:48 +02:00
thomwolf
bb04edb45b
Add tests that TF 2.0 model can be integrated with other Keras modules
2019-10-10 13:08:24 +02:00
thomwolf
177a721205
move back to simple space spliting
2019-10-10 11:45:47 +02:00
thomwolf
a5997dd81a
better error messages
2019-10-10 11:31:01 +02:00
thomwolf
43a237f15e
switching to moses tokenizer
2019-10-10 10:11:16 +02:00
LysandreJik
036483fae5
Temporary CTRL tokenizer fix
2019-10-09 16:33:15 -04:00
LysandreJik
9c2e0a4acf
Release: 2.1.0
2019-10-09 12:14:03 -04:00
LysandreJik
7fe98d8c18
Update CTRL documentation
2019-10-09 12:12:36 -04:00
Lysandre Debut
2431fea98a
Merge pull request #1383 from keskarnitish/master
...
Adding CTRL
2019-10-09 11:31:05 -04:00
thomwolf
d9e60f4f0d
Merge branch 'master' into pr/1383
2019-10-09 17:25:08 +02:00
Lysandre Debut
e84470ef81
Merge pull request #1384 from huggingface/encoding-qol
...
Quality of life enhancements in encoding + patch MLM masking
2019-10-09 11:18:24 -04:00
thomwolf
07d055f849
higher tolerance
2019-10-09 17:10:04 +02:00
thomwolf
48b438ff2a
doc and conversion
2019-10-09 17:06:30 +02:00
thomwolf
c19b8e4ae0
fixing CTRL tests and OpenAI GPT tests
2019-10-09 13:51:05 +02:00
thomwolf
6dce6dda1b
fixing TF 2.0 model - adding more severe test on pt/tf equivalence
2019-10-09 11:57:55 +02:00
thomwolf
c56d921dda
adding TF 2.0 model
2019-10-09 11:07:43 +02:00
thomwolf
1c5079952f
simpler distilbert mask - fix tf tests
2019-10-09 04:26:20 +02:00
thomwolf
23b7138ab4
fix #1378 and #1453
2019-10-09 01:54:44 +02:00
thomwolf
45dc04f33d
tf model [WIP]
2019-10-08 17:37:17 +02:00
thomwolf
248314772f
fix tokenization
2019-10-08 17:19:28 +02:00
thomwolf
03c2c762a6
update tokenizer
2019-10-08 17:12:03 +02:00
thomwolf
3edfa1d6aa
update model to use past
2019-10-08 17:11:58 +02:00
VictorSanh
9f81f1cba8
fix convert pt_to_tf2 for custom weights
2019-10-07 12:30:19 -04:00
thomwolf
bd5363cc83
update CTRL configuration
2019-10-07 15:37:30 +02:00
thomwolf
dc89441167
update CTRL pytorch model
2019-10-07 15:37:25 +02:00
thomwolf
320b7a7e01
fix #1416
2019-10-07 14:26:59 +02:00
Santiago Castro
1dea291a02
Remove unnecessary use of FusedLayerNorm in XLNet
2019-10-06 13:35:01 -04:00
thomwolf
78ef1a9930
fixes
2019-10-04 17:59:44 -04:00
thomwolf
6c1d0bc066
update encode_plus - add truncation strategies
2019-10-04 17:38:38 -04:00