HuggingFace_transformer

Author	SHA1	Message	Date
Sam Shleifer	07dd7c2fd8	[cleanup] test_tokenization_common.py (#4390 )	2020-05-19 10:46:55 -04:00
Iz Beltagy	8f1d047148	Longformer (#4352 ) * first commit * bug fixes * better examples * undo padding * remove wrong VOCAB_FILES_NAMES * License * make style * make isort happy * unit tests * integration test * make `black` happy by undoing `isort` changes!! * lint * no need for the padding value * batch_size not bsz * remove unused type casting * seqlen not seq_len * staticmethod * `bert` selfattention instead of `n2` * uint8 instead of bool + lints * pad inputs_embeds using embeddings not a constant * black * unit test with padding * fix unit tests * remove redundant unit test * upload model weights * resolve todo * simpler _mask_invalid_locations without lru_cache + backward compatible masked_fill_ * increase unittest coverage	2020-05-19 16:04:43 +02:00
Shaoyen	384f0eb2f9	Map optimizer to correct device after loading from checkpoint. (#4403 ) * Map optimizer to correct device after loading from checkpoint. * Make style test pass Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-18 23:16:05 -04:00
Julien Chaumond	bf14ef75f1	[Trainer] move model to device before setting optimizer (#4450 )	2020-05-18 23:13:33 -04:00
Julien Chaumond	5e7fe8b585	Distributed eval: SequentialDistributedSampler + gather all results (#4243 ) * Distributed eval: SequentialDistributedSampler + gather all results * For consistency only write to disk from world_master Close https://github.com/huggingface/transformers/issues/4272 * Working distributed eval * Hook into scripts * Fix #3721 again * TPU.mesh_reduce: stay in tensor space Thanks @jysohn23 * Just a small comment * whitespace * torch.hub: pip install packaging * Add test scenarii	2020-05-18 22:02:39 -04:00
Julien Chaumond	4c06893610	Fix nn.DataParallel compatibility in PyTorch 1.5 (#4300 ) * Test case for #3936 * multigpu tests pass on pytorch 1.4.0 * Fixup * multigpu tests pass on pytorch 1.5.0 * Update src/transformers/modeling_utils.py * Update src/transformers/modeling_utils.py * rename multigpu to require_multigpu * mode doc	2020-05-18 20:34:50 -04:00
Rakesh Chada	9de4afa897	Make get_last_lr in trainer backward compatible (#4446 ) * makes fetching last learning late in trainer backward compatible * split comment to multiple lines * fixes black styling issue * uses version to create a more explicit logic	2020-05-18 20:17:36 -04:00
Funtowicz Morgan	ca4a3f4da9	Adding optimizations block from ONNXRuntime. (#4431 ) * Adding optimizations block from ONNXRuntime. * Turn off external data format by default for PyTorch export. * Correct the way use_external_format is passed through the cmdline args.	2020-05-18 20:32:33 +02:00
Patrick von Platen	d39bf0ac2d	better naming in tf t5 (#4401 )	2020-05-18 11:34:00 -04:00
Patrick von Platen	590adb130b	improve docstring (#4422 )	2020-05-18 11:31:35 -04:00
Patrick von Platen	026a5d0888	[T5 fp16] Fix fp16 in T5 (#4436 ) * fix fp16 in t5 * make style * refactor invert_attention_mask fn * fix typo	2020-05-18 17:25:58 +02:00
Patrick von Platen	a27c795908	fix (#4419 )	2020-05-18 15:51:40 +02:00
Mehrad Moradshahi	8581a670e3	[MbartTokenizer] save to sentencepiece.bpe.model (#4335 )	2020-05-18 08:54:04 -04:00
Lorenzo Ampil	18d233d525	Allow the creation of "entity groups" for NerPipeline #3548 (#3957 ) * Add index to be returned by NerPipeline to allow for the creation of * Add entity groups * Convert entity list to dict * Add entity to entity_group_disagg atfter updating entity gorups * Change 'group' parameter to 'grouped_entities' * Add unit tests for grouped NER pipeline case * Correct variable name typo for NER_FINETUNED_MODELS * Sync grouped tests to recent test updates	2020-05-17 09:25:17 +02:00
Julien Chaumond	3e0f062106	Fix addcmul_	2020-05-15 17:44:17 -04:00
Julien Chaumond	fc2a4c88ce	Fix: one more try	2020-05-15 17:38:48 -04:00
Julien Chaumond	55bda52555	Same fix for `addcmul_`	2020-05-15 17:23:48 -04:00
Julien Chaumond	ad02c961c6	Fix UserWarning: This overload of add_ is deprecated in pytorch==1.5.0	2020-05-15 17:09:11 -04:00
Julien Chaumond	15550ce0d1	[skip ci] remove local rank	2020-05-15 17:08:38 -04:00
Jared T Nielsen	34706ba050	Allow for None gradients in GradientAccumulator. (#4372 )	2020-05-15 09:52:00 -04:00
Lysandre Debut	7defc6670f	p_mask in SQuAD pre-processing (#4049 ) * Better p_mask building * Adressing @mfuntowicz comments	2020-05-14 17:07:52 -04:00
Funtowicz Morgan	db0076a9df	Conversion script to export transformers models to ONNX IR. (#4253 ) * Added generic ONNX conversion script for PyTorch model. * WIP initial TF support. * TensorFlow/Keras ONNX export working. * Print framework version info * Add possibility to check the model is correctly loading on ONNX runtime. * Remove quantization option. * Specify ONNX opset version when exporting. * Formatting. * Remove unused imports. * Make functions more generally reusable from other part of the code. * isort happy. * flake happy * Export only feature-extraction for now * Correctly check inputs order / filter before export. * Removed task variable * Fix invalid args call in load_graph_from_args. * Fix invalid args call in convert. * Fix invalid args call in infer_shapes. * Raise exception and catch in caller function instead of exit. * Add 04-onnx-export.ipynb notebook * More WIP on the notebook * Remove unused imports * Simplify & remove unused constants. * Export with constant_folding in PyTorch * Let's try to put function args in the right order this time ... * Disable external_data_format temporary * ONNX notebook draft ready. * Updated notebooks charts + wording * Correct error while exporting last chart in notebook. * Adressing @LysandreJik comment. * Set ONNX opset to 11 as default value. * Set opset param mandatory * Added ONNX export unittests * Quality. * flake8 happy * Add keras2onnx dependency on extras["tf"] * Pin keras2onnx on github master to v1.6.5 * Second attempt. * Third attempt. * Use the right repo URL this time ... * Do the same for onnxconverter-common * Added keras2onnx and onnxconveter-common to 1.7.0 to supports TF2.2 * Correct commit hash. * Addressing PR review: Optimization are enabled by default. * Addressing PR review: small changes in the notebook * setup.py comment about keras2onnx versioning.	2020-05-14 16:35:52 -04:00
Suraj Patil	2d05480174	Fix trainer evaluation (#4363 ) * fix loss calculation in evaluation * fix evaluation on TPU when prediction_loss_only is True	2020-05-14 14:39:44 -04:00
Sam Shleifer	9535bf1977	Tokenizer.batch_decode convenience method (#4159 )	2020-05-14 13:50:47 -04:00
Sam Shleifer	7822cd38a0	[tests] make pipelines tests faster with smaller models (#4238 ) covers torch and tf. Also fixes a failing @slow test	2020-05-14 13:36:02 -04:00
Julien Chaumond	448c467256	Fix: unpin flake8 and fix cs errors (#4367 ) * Fix: unpin flake8 and fix cs errors * Ok we still need to quote those	2020-05-14 13:14:26 -04:00
Julien Chaumond	c547f15a17	Use Filelock to ensure distributed barriers see context in https://github.com/huggingface/transformers/pull/4223	2020-05-14 11:58:32 -04:00
Lysandre Debut	ef46ccb05c	TPU needs a rendezvous (#4339 )	2020-05-14 08:59:52 -04:00
Lysandre	7cb203fae4	Release: v2.9.1 Some checks failed GitHub-hosted runner / check_code_quality (push) Has been cancelled Details	2020-05-13 17:38:50 -04:00
Sam Shleifer	9a687ebb77	[Marian Fixes] prevent predicting pad_token_id before softmax, support language codes, name multilingual models (#4290 )	2020-05-13 17:29:41 -04:00
Julien Plu	ca13618681	Question Answering for TF trainer (#4320 ) * Add QA trainer example for TF * Make data_dir optional * Fix parameter logic * Fix feature convert * Update the READMEs to add the question-answering task * Apply style * Change 'sequence-classification' to 'text-classification' and prefix with 'eval' all the metric names * Apply style * Apply style	2020-05-13 09:22:31 -04:00
Denis	1e51bb717c	Fix for #3865 . PretrainedTokenizer mapped " do not" into " don't" when .decode(...) is called. Removed the " do not" --> " don't" mapping from clean_up_tokenization(...). (#4024 )	2020-05-13 14:32:57 +02:00
Julien Chaumond	241759101e	(v2) Improvements to the wandb integration (#4324 ) * Improvements to the wandb integration * small reorg + no global necessary * feat(trainer): log epoch and final metrics * Simplify logging a bit * Fixup * Fix crash when just running eval Co-authored-by: Chris Van Pelt <vanpelt@gmail.com> Co-authored-by: Boris Dayma <boris.dayma@gmail.com>	2020-05-12 21:52:01 -04:00
Funtowicz Morgan	7d7fe4997f	Allow BatchEncoding to be initialized empty. (#4316 ) * Allow BatchEncoding to be initialized empty. This is required by recent changes introduced in TF 2.2. * Attempt to unpin Tensorflow to 2.2 with the previous commit.	2020-05-12 15:02:46 -04:00
Julien Chaumond	4bf5042240	Fix BART tests on GPU (#4298 )	2020-05-12 09:11:50 -04:00
Viktor Alm	e4512aab3b	Add MultipleChoice to TFTrainer [WIP] (#4270 ) * catch gpu len 1 set to gpu0 * Add mpc to trainer * Add MPC for TF * fix TF automodel for MPC and add Albert * Apply style * Fix import * Note to self: double check * Make shape None, None for datasetgenerator output shapes * Add from_pt bool which doesnt seem to work * Original checkpoint dir * Fix docstrings for automodel * Update readme and apply style * Colab should probably not be from users * Colabs should probably not be from users * Add colab * Update README.md * Update README.md * Cleanup __intit__ * Cleanup flake8 trailing comma * Update src/transformers/training_args_tf.py * Update src/transformers/modeling_tf_auto.py Co-authored-by: Viktor Alm <viktoralm@pop-os.localdomain> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-12 08:48:48 -04:00
Jangwon Park	31e67dd19f	Remove hard-coded pad token id in distilbert and albert (#3965 )	2020-05-12 08:32:44 -04:00
Bram Vanroy	61d22f9cc7	Simplify cache vars and allow for TRANSFORMERS_CACHE env (#4226 ) * simplify cache vars and allow for TRANSFORMERS_CACHE env As it currently stands, "TRANSFORMERS_CACHE" is not an accepted variable. It seems that the these variables were not updated when moving from version pytorch_transformers to transformers. In addition, the fallback procedure could be improved. and simplified. Pathlib seems redundant here. * Update file_utils.py	2020-05-11 15:24:02 -04:00
Lysandre Debut	cd40cb8879	Fix special token doc (#4292 )	2020-05-11 15:05:36 -04:00
Tianlei Wu	82601f4c1a	Allow gpt2 to be exported to valid ONNX (#4244 ) * allow gpt2 to be exported to valid ONNX model * cast size from int to float explictly	2020-05-11 14:55:55 -04:00
Lysandre Debut	051dcb2a07	CamemBERT does not make use of Token Type IDs (#4289 )	2020-05-11 13:31:03 -04:00
fgaim	41e8291217	Add ALBERT to the Tensorflow to Pytorch model conversion cli (#3933 ) * Add ALBERT to convert command of transformers-cli * Document ALBERT tf to pytorch model conversion	2020-05-11 13:10:00 -04:00
Funtowicz Morgan	8fdb7997c6	Align sentiment-analysis' tokenizer (currently uncased) to the model (uncased). (#4264 )	2020-05-11 12:45:53 -04:00
Sam Shleifer	4658896ee1	[Marian] Fix typo in docstring (#4284 )	2020-05-11 11:47:51 -04:00
Julien Plu	94b57bf796	[TF 2.2 compat] use tf.VariableAggregation.ONLY_FIRST_REPLICA (#4283 ) * Fix the issue to properly run the accumulator with TF 2.2 * Apply style * Fix training_args_tf for TF 2.2 * Fix the TF training args when only one GPU is available * Remove the fixed version of TF in setup.py	2020-05-11 11:28:37 -04:00
theblackcat102	7751be7cee	fix reformer apex scaling issue (#4242 )	2020-05-11 16:53:42 +02:00
Patrick von Platen	ac7d5f67a2	[Reformer] Add Enwiki8 Reformer Model - Adapt convert script (#4282 ) * adapt convert script * update convert script * finish * fix marian pretrained docs	2020-05-11 16:38:07 +02:00
flozi00	b290c32e16	[docs] fix typo (#4249 )	2020-05-10 14:07:08 -04:00
Sam Shleifer	3487be75ef	[Marian] documentation and AutoModel support (#4152 ) - MarianSentencepieceTokenizer - > MarianTokenizer - Start using unk token. - add docs page - add better generation params to MarianConfig - more conversion utilities	2020-05-10 13:54:57 -04:00
Julien Chaumond	7b75aa9fa5	[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223 ) * [TPU] Doc, fix xla_spawn.py, only preprocess dataset once * Update examples/README.md * [xla_spawn] Add `_mp_fn` to other Trainer scripts * [TPU] Fix: eval dataloader was None	2020-05-08 14:10:05 -04:00

1 2 3 4 5 ...

574 Commits