Thomas Wolf
80889a0226
Merge pull request #1512 from louismartin/fix-roberta-convert
...
Fix import error in script to convert faisreq roberta checkpoints
2019-10-14 17:40:32 +02:00
Thomas Wolf
f62f992cf7
Merge pull request #1502 from jeffxtang/master
...
the working example code to use BertForQuestionAnswering
2019-10-14 16:14:52 +02:00
Louis MARTIN
49cba6e543
Fix import error in script to convert faisreq roberta checkpoints
2019-10-14 01:38:57 -07:00
jeffxtang
e76d71521c
the working example code to use BertForQuestionAnswering and get an answer from a text and a question
2019-10-11 17:04:02 -07:00
Lysandre
a701c9b321
CTRL to tf automodels
2019-10-11 16:05:30 -04:00
Lysandre
3ddce1d74c
Release: 2.1.1
2019-10-11 06:37:49 -04:00
Thomas Wolf
3b43b01872
Merge pull request #1482 from huggingface/tf2_integration_tests
...
Integration of TF 2.0 models with other Keras modules
2019-10-11 16:25:43 +02:00
thomwolf
18a3cef7d5
no nans
2019-10-11 16:09:42 +02:00
thomwolf
1f5d9513d8
fix test
2019-10-11 15:55:01 +02:00
thomwolf
0f9fc4fbde
adding option to desactivate past/memory outputs
2019-10-11 15:47:08 +02:00
Thomas Wolf
700331b5ec
Merge pull request #1492 from stefan-it/bert-german-dbmdz-models
...
Add new BERT models for German (cased and uncased)
2019-10-11 13:01:52 +02:00
Thomas Wolf
573dde9b44
Merge pull request #1405 from slayton58/xlnet_layer_reorder
...
Re-order XLNet attention head outputs for better perf
2019-10-11 12:10:58 +02:00
Stefan Schweter
5f25a5f367
model: add support for new German BERT models (cased and uncased) from @dbmdz
2019-10-11 10:20:33 +02:00
thomwolf
751e246087
using tf.print in roberta
2019-10-10 15:47:20 +02:00
thomwolf
c9e8c51946
fixing SequenceSummary head in TF 2.0
2019-10-10 15:16:05 +02:00
thomwolf
da26bae61b
adding more tests on TF and pytorch serialization - updating configuration for better serialization
2019-10-10 14:30:48 +02:00
thomwolf
bb04edb45b
Add tests that TF 2.0 model can be integrated with other Keras modules
2019-10-10 13:08:24 +02:00
thomwolf
177a721205
move back to simple space spliting
2019-10-10 11:45:47 +02:00
thomwolf
a5997dd81a
better error messages
2019-10-10 11:31:01 +02:00
thomwolf
43a237f15e
switching to moses tokenizer
2019-10-10 10:11:16 +02:00
LysandreJik
036483fae5
Temporary CTRL tokenizer fix
2019-10-09 16:33:15 -04:00
LysandreJik
9c2e0a4acf
Release: 2.1.0
2019-10-09 12:14:03 -04:00
LysandreJik
7fe98d8c18
Update CTRL documentation
2019-10-09 12:12:36 -04:00
Lysandre Debut
2431fea98a
Merge pull request #1383 from keskarnitish/master
...
Adding CTRL
2019-10-09 11:31:05 -04:00
thomwolf
d9e60f4f0d
Merge branch 'master' into pr/1383
2019-10-09 17:25:08 +02:00
Lysandre Debut
e84470ef81
Merge pull request #1384 from huggingface/encoding-qol
...
Quality of life enhancements in encoding + patch MLM masking
2019-10-09 11:18:24 -04:00
thomwolf
07d055f849
higher tolerance
2019-10-09 17:10:04 +02:00
thomwolf
48b438ff2a
doc and conversion
2019-10-09 17:06:30 +02:00
thomwolf
c19b8e4ae0
fixing CTRL tests and OpenAI GPT tests
2019-10-09 13:51:05 +02:00
thomwolf
6dce6dda1b
fixing TF 2.0 model - adding more severe test on pt/tf equivalence
2019-10-09 11:57:55 +02:00
thomwolf
c56d921dda
adding TF 2.0 model
2019-10-09 11:07:43 +02:00
thomwolf
1c5079952f
simpler distilbert mask - fix tf tests
2019-10-09 04:26:20 +02:00
thomwolf
23b7138ab4
fix #1378 and #1453
2019-10-09 01:54:44 +02:00
thomwolf
45dc04f33d
tf model [WIP]
2019-10-08 17:37:17 +02:00
thomwolf
248314772f
fix tokenization
2019-10-08 17:19:28 +02:00
thomwolf
03c2c762a6
update tokenizer
2019-10-08 17:12:03 +02:00
thomwolf
3edfa1d6aa
update model to use past
2019-10-08 17:11:58 +02:00
VictorSanh
9f81f1cba8
fix convert pt_to_tf2 for custom weights
2019-10-07 12:30:19 -04:00
thomwolf
bd5363cc83
update CTRL configuration
2019-10-07 15:37:30 +02:00
thomwolf
dc89441167
update CTRL pytorch model
2019-10-07 15:37:25 +02:00
thomwolf
320b7a7e01
fix #1416
2019-10-07 14:26:59 +02:00
thomwolf
78ef1a9930
fixes
2019-10-04 17:59:44 -04:00
thomwolf
6c1d0bc066
update encode_plus - add truncation strategies
2019-10-04 17:38:38 -04:00
thomwolf
92c0f2fb90
Merge remote-tracking branch 'origin/julien_multiple-choice' into encoding-qol
2019-10-04 15:48:06 -04:00
LysandreJik
7bddb45a6f
Decode documentaton
2019-10-04 14:27:38 -04:00
keskarnitish
dbed1c5d94
Adding CTRL (squashed commit)
...
adding conversion script
adding first draft of modeling & tokenization
adding placeholder for test files
bunch of changes
registering the tokenizer/model/etc
tests
change link; something is very VERY wrong here
weird end-of-word thingy going on
i think the tokenization works now ; wrote the unit tests
overall structure works;load w next
the monster is alive!
works after some cleanup as well
adding emacs autosave to gitignore
currently only supporting the 48 layer one; seems to infer fine on my macbook
cleanup
fixing some documentation
fixing some documentation
tests passing?
now works on CUDA also
adding greedy?
adding greedy sampling
works well
2019-10-03 22:29:03 -07:00
Thomas Wolf
1569610f2d
Merge pull request #1296 from danai-antoniou/add-duplicate-tokens-error
...
Added ValueError for duplicates in list of added tokens
2019-10-03 17:06:17 -04:00
drc10723
e1b2949ae6
DistillBert Documentation Code Example fixes
2019-10-03 15:51:33 -04:00
Simon Layton
899883644f
Fix test fails and warnings
...
Attention output was in bnij ordering instead of ijbn which everything
else will expect. This was an oversight on my part, and keeps the
attention inputs/outputs identical to the original code.
Also moved back from tensor slicing to index_select in rel_shift_bnij to
make the tracer happy.
2019-10-03 12:05:15 -04:00
LysandreJik
aebd83230f
Update naming + remove f string in run_lm_finetuning example
2019-10-03 11:31:36 -04:00