HuggingFace_transformer

Author	SHA1	Message	Date
Thomas Wolf	80889a0226	Merge pull request #1512 from louismartin/fix-roberta-convert Fix import error in script to convert faisreq roberta checkpoints	2019-10-14 17:40:32 +02:00
Thomas Wolf	f62f992cf7	Merge pull request #1502 from jeffxtang/master the working example code to use BertForQuestionAnswering	2019-10-14 16:14:52 +02:00
Louis MARTIN	49cba6e543	Fix import error in script to convert faisreq roberta checkpoints	2019-10-14 01:38:57 -07:00
jeffxtang	e76d71521c	the working example code to use BertForQuestionAnswering and get an answer from a text and a question	2019-10-11 17:04:02 -07:00
Lysandre	a701c9b321	CTRL to tf automodels	2019-10-11 16:05:30 -04:00
Lysandre	3ddce1d74c	Release: 2.1.1	2019-10-11 06:37:49 -04:00
Thomas Wolf	3b43b01872	Merge pull request #1482 from huggingface/tf2_integration_tests Integration of TF 2.0 models with other Keras modules	2019-10-11 16:25:43 +02:00
thomwolf	18a3cef7d5	no nans	2019-10-11 16:09:42 +02:00
thomwolf	1f5d9513d8	fix test	2019-10-11 15:55:01 +02:00
thomwolf	0f9fc4fbde	adding option to desactivate past/memory outputs	2019-10-11 15:47:08 +02:00
Thomas Wolf	700331b5ec	Merge pull request #1492 from stefan-it/bert-german-dbmdz-models Add new BERT models for German (cased and uncased)	2019-10-11 13:01:52 +02:00
Thomas Wolf	573dde9b44	Merge pull request #1405 from slayton58/xlnet_layer_reorder Re-order XLNet attention head outputs for better perf	2019-10-11 12:10:58 +02:00
Stefan Schweter	5f25a5f367	model: add support for new German BERT models (cased and uncased) from @dbmdz	2019-10-11 10:20:33 +02:00
thomwolf	751e246087	using tf.print in roberta	2019-10-10 15:47:20 +02:00
thomwolf	c9e8c51946	fixing SequenceSummary head in TF 2.0	2019-10-10 15:16:05 +02:00
thomwolf	da26bae61b	adding more tests on TF and pytorch serialization - updating configuration for better serialization	2019-10-10 14:30:48 +02:00
thomwolf	bb04edb45b	Add tests that TF 2.0 model can be integrated with other Keras modules	2019-10-10 13:08:24 +02:00
thomwolf	177a721205	move back to simple space spliting	2019-10-10 11:45:47 +02:00
thomwolf	a5997dd81a	better error messages	2019-10-10 11:31:01 +02:00
thomwolf	43a237f15e	switching to moses tokenizer	2019-10-10 10:11:16 +02:00
LysandreJik	036483fae5	Temporary CTRL tokenizer fix	2019-10-09 16:33:15 -04:00
LysandreJik	9c2e0a4acf	Release: 2.1.0	2019-10-09 12:14:03 -04:00
LysandreJik	7fe98d8c18	Update CTRL documentation	2019-10-09 12:12:36 -04:00
Lysandre Debut	2431fea98a	Merge pull request #1383 from keskarnitish/master Adding CTRL	2019-10-09 11:31:05 -04:00
thomwolf	d9e60f4f0d	Merge branch 'master' into pr/1383	2019-10-09 17:25:08 +02:00
Lysandre Debut	e84470ef81	Merge pull request #1384 from huggingface/encoding-qol Quality of life enhancements in encoding + patch MLM masking	2019-10-09 11:18:24 -04:00
thomwolf	07d055f849	higher tolerance	2019-10-09 17:10:04 +02:00
thomwolf	48b438ff2a	doc and conversion	2019-10-09 17:06:30 +02:00
thomwolf	c19b8e4ae0	fixing CTRL tests and OpenAI GPT tests	2019-10-09 13:51:05 +02:00
thomwolf	6dce6dda1b	fixing TF 2.0 model - adding more severe test on pt/tf equivalence	2019-10-09 11:57:55 +02:00
thomwolf	c56d921dda	adding TF 2.0 model	2019-10-09 11:07:43 +02:00
thomwolf	1c5079952f	simpler distilbert mask - fix tf tests	2019-10-09 04:26:20 +02:00
thomwolf	23b7138ab4	fix #1378 and #1453	2019-10-09 01:54:44 +02:00
thomwolf	45dc04f33d	tf model [WIP]	2019-10-08 17:37:17 +02:00
thomwolf	248314772f	fix tokenization	2019-10-08 17:19:28 +02:00
thomwolf	03c2c762a6	update tokenizer	2019-10-08 17:12:03 +02:00
thomwolf	3edfa1d6aa	update model to use past	2019-10-08 17:11:58 +02:00
VictorSanh	9f81f1cba8	fix convert pt_to_tf2 for custom weights	2019-10-07 12:30:19 -04:00
thomwolf	bd5363cc83	update CTRL configuration	2019-10-07 15:37:30 +02:00
thomwolf	dc89441167	update CTRL pytorch model	2019-10-07 15:37:25 +02:00
thomwolf	320b7a7e01	fix #1416	2019-10-07 14:26:59 +02:00
thomwolf	78ef1a9930	fixes	2019-10-04 17:59:44 -04:00
thomwolf	6c1d0bc066	update encode_plus - add truncation strategies	2019-10-04 17:38:38 -04:00
thomwolf	92c0f2fb90	Merge remote-tracking branch 'origin/julien_multiple-choice' into encoding-qol	2019-10-04 15:48:06 -04:00
LysandreJik	7bddb45a6f	Decode documentaton	2019-10-04 14:27:38 -04:00
keskarnitish	dbed1c5d94	Adding CTRL (squashed commit) adding conversion script adding first draft of modeling & tokenization adding placeholder for test files bunch of changes registering the tokenizer/model/etc tests change link; something is very VERY wrong here weird end-of-word thingy going on i think the tokenization works now ; wrote the unit tests overall structure works;load w next the monster is alive! works after some cleanup as well adding emacs autosave to gitignore currently only supporting the 48 layer one; seems to infer fine on my macbook cleanup fixing some documentation fixing some documentation tests passing? now works on CUDA also adding greedy? adding greedy sampling works well	2019-10-03 22:29:03 -07:00
Thomas Wolf	1569610f2d	Merge pull request #1296 from danai-antoniou/add-duplicate-tokens-error Added ValueError for duplicates in list of added tokens	2019-10-03 17:06:17 -04:00
drc10723	e1b2949ae6	DistillBert Documentation Code Example fixes	2019-10-03 15:51:33 -04:00
Simon Layton	899883644f	Fix test fails and warnings Attention output was in bnij ordering instead of ijbn which everything else will expect. This was an oversight on my part, and keeps the attention inputs/outputs identical to the original code. Also moved back from tensor slicing to index_select in rel_shift_bnij to make the tracer happy.	2019-10-03 12:05:15 -04:00
LysandreJik	aebd83230f	Update naming + remove f string in run_lm_finetuning example	2019-10-03 11:31:36 -04:00

1 2

77 Commits