HuggingFace_transformer

Author	SHA1	Message	Date
Patrick von Platen	7fb2a8b3d9	up (#14008 )	2021-10-14 15:46:22 +02:00
Lysandre Debut	7604557e44	Fix FNet tokenizer tests (#13995 )	2021-10-14 09:07:51 -04:00
Sylvain Gugger	f2002fea11	Add strong test for configuration attributes (#14000 ) * Add strong test for configuration attributes * Add fake modif to trigger all tests * Add a better fake modif * Ignore is_encoder_decoder * Fix faulty configs * Remove fake modif	2021-10-14 09:07:08 -04:00
Patrick von Platen	cc36064960	up (#13988 )	2021-10-14 10:54:20 +02:00
NielsRogge	408b2d2bd0	Add TrOCR + VisionEncoderDecoderModel (#13874 ) * First draft * Update self-attention of RoBERTa as proposition * Improve conversion script * Add TrOCR decoder-only model * More improvements * Make forward pass with pretrained weights work * More improvements * Some more improvements * More improvements * Make conversion work * Clean up print statements * Add documentation, processor * Add test files * Small improvements * Some more improvements * Make fix-copies, improve docs * Make all vision encoder decoder model tests pass * Make conversion script support other models * Update URL for OCR image * Update conversion script * Fix style & quality * Add support for the large-printed model * Fix some issues * Add print statement for debugging * Add print statements for debugging * Make possible fix for sinusoidal embedding * Further debugging * Potential fix v2 * Add more print statements for debugging * Add more print statements for debugging * Deubg more * Comment out print statements * Make conversion of large printed model possible, address review comments * Make it possible to convert the stage1 checkpoints * Clean up code, apply suggestions from code review * Apply suggestions from code review, use Microsoft models in tests * Rename encoder_hidden_size to cross_attention_hidden_size * Improve docs	2021-10-13 10:28:56 +02:00
Yih-Dar	8b240a0661	Add TFEncoderDecoderModel + Add cross-attention to some TF models (#13222 ) * Add cross attentions to TFGPT2Model * Add TFEncoderDecoderModel * Add TFBaseModelOutputWithPoolingAndCrossAttentions * Add cross attentions to TFBertModel * Fix past or past_key_values argument issue * Fix generation * Fix save and load * Add some checks and comments * Clean the code that deals with past keys/values * Add kwargs to processing_inputs * Add serving_output to TFEncoderDecoderModel * Some cleaning + fix use_cache value issue * Fix tests + add bert2bert/bert2gpt2 tests * Fix more tests * Ignore crossattention.bias when loading GPT2 weights into TFGPT2 * Fix return_dict_in_generate in tf generation * Fix is_token_logit_eos_token bug in tf generation * Finalize the tests after fixing some bugs * Fix another is_token_logit_eos_token bug in tf generation * Add/Update docs * Add TFBertEncoderDecoderModelTest * Clean test script * Add TFEncoderDecoderModel to the library * Add cross attentions to TFRobertaModel * Add TFRobertaEncoderDecoderModelTest * make style * Change the way of position_ids computation * bug fix * Fix copies in tf_albert * Remove some copied from and apply some fix-copies * Remove some copied * Add cross attentions to some other TF models * Remove encoder_hidden_states from TFLayoutLMModel.call for now * Make style * Fix TFRemBertForCausalLM * Revert the change to longformer + Remove copies * Revert the change to albert and convbert + Remove copies * make quality * make style * Add TFRembertEncoderDecoderModelTest * make quality and fix-copies * test TFRobertaForCausalLM * Fixes for failed tests * Fixes for failed tests * fix more tests * Fixes for failed tests * Fix Auto mapping order * Fix TFRemBertEncoder return value * fix tf_rembert * Check copies are OK * Fix missing TFBaseModelOutputWithPastAndCrossAttentions is not defined * Add TFEncoderDecoderModelSaveLoadTests * fix tf weight loading * check the change of use_cache * Revert the change * Add missing test_for_causal_lm for TFRobertaModelTest * Try cleaning past * fix _reorder_cache * Revert some files to original versions * Keep as many copies as possible * Apply suggested changes - Use raise ValueError instead of assert * Move import to top * Fix wrong require_torch * Replace more assert by raise ValueError * Add test_pt_tf_model_equivalence (the test won't pass for now) * add test for loading/saving * finish * finish * Remove test_pt_tf_model_equivalence * Update tf modeling template * Remove pooling, added in the prev. commit, from MainLayer * Update tf modeling test template * Move inputs["use_cache"] = False to modeling_tf_utils.py * Fix torch.Tensor in the comment * fix use_cache * Fix missing use_cache in ElectraConfig * Add a note to from_pretrained * Fix style * Change test_encoder_decoder_save_load_from_encoder_decoder_from_pt * Fix TFMLP (in TFGPT2) activation issue * Fix None past_key_values value in serving_output * Don't call get_encoderdecoder_model in TFEncoderDecoderModelTest.test_configuration_tie until we have a TF checkpoint on Hub * Apply review suggestions - style for cross_attns in serving_output * Apply review suggestions - change assert + docstrings * break the error message to respect the char limit * deprecate the argument past * fix docstring style * Update the encoder-decoder rst file * fix Unknown interpreted text role "method" * fix typo Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-10-13 00:10:34 +02:00
Patrick von Platen	58bf882579	[Wav2Vec2] Make sure tensors are always bool for mask_indices (#13977 ) * correct long to bool * up * correct code	2021-10-12 18:17:06 +02:00
Mishig Davaadorj	11c043d27d	Specify im-seg mask greyscole mode (#13974 )	2021-10-12 16:26:18 +02:00
Patrick von Platen	d45fc7da3d	[Speech Examples] Add pytorch speech pretraining (#13877 ) * adapt wav2vec2 * add example * add files * adapt * remove bogus file * Apply suggestions from code review * adapt files more * upload changes * del old files * up * up * up * up * up * correct gradient checkpoitning * add readme * finish * finish * up * more fixes * up * up * add demo run to readme * up	2021-10-12 00:46:32 +02:00
Luis F. Talavera R	e1bb2ebd92	Replace assert with unittest assertions (#13957 )	2021-10-11 10:21:46 -04:00
Patrick von Platen	dca6796876	[Gradient checkpoining] Correct disabling `find_unused_parameters` in Trainer when gradient checkpointing is enabled (#13961 ) * up * correct test	2021-10-11 15:34:01 +02:00
Sylvain Gugger	4a18337bae	Honor existing attention mask in tokenzier.pad (#13926 ) * Honor existing attention mask in tokenzier.pad * Fix initialization of attention mask * Roll the implem on all subclasses * Fix tests	2021-10-11 09:12:09 -04:00
Patrick von Platen	c8b07612a1	[Generation] Fix max_new_tokens (#13919 ) * up * Update src/transformers/generation_stopping_criteria.py * finish	2021-10-08 17:28:18 +02:00
Nicolas Patry	d70919e6d5	Adding support for tokens being suffixes or part of each other. (#13918 ) * Adding support for tokens being suffixes or part of each other. * Better test name.	2021-10-08 10:10:38 +02:00
Mishig Davaadorj	026866df92	Image Segmentation pipeline (#13828 ) * Implement img seg pipeline * Update src/transformers/pipelines/image_segmentation.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/pipelines/image_segmentation.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update output shape with individual masks * Rm dev change * Remove loops in test Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>	2021-10-08 09:59:53 +02:00
Matt	61cf2ea9c0	Fix incorrect output shapes for TF/PT LED (#13882 ) * Fix issues with LED model * Style pass * Bugfixes * correct attentions as well Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-10-07 17:30:15 +01:00
Patrick von Platen	0f5488f79f	[Wav2Vec2] Fix mask_feature_prob (#13921 ) * up * overwrite hubert	2021-10-07 19:07:32 +03:00
Nicolas Patry	013bdc6d65	Fixing Backward compatiblity for zero-shot (#13855 ) Fixes #13846	2021-10-05 23:06:47 -04:00
Nicolas Patry	e7b16f33ae	Fixing GPU for token-classification in a better way. (#13856 ) Co-authored-by: Pierre Snell <pierre.snell@botpress.com> Co-authored-by: Pierre Snell <pierre.snell@botpress.com>	2021-10-05 22:44:31 -04:00
Nicolas Patry	0ddadbf0a8	Fixing question-answering with long contexts (#13873 ) * Tmp. * Fixing BC for question answering with long context. * Capping model_max_length to avoid tf overflow. * Bad workaround bugged roberta. * Fixing name.	2021-10-05 16:08:58 +02:00
Zhaofeng Wu	1b74af76b7	Allow dataset to be an optional argument for (Distributed)LengthGroupedSampler (#13820 ) * Allow dataset to be an optional argument for (Distributed)LengthGroupedSampler * Fix	2021-10-05 09:04:39 -04:00
Michael Benayoun	d4e4efce68	Initial support for symbolic tracing with torch.fx allowing dynamic axes (#13579 ) * Symbolic trace dynamic axes support for BERT like models (albert, bert, distilbert, mobilebert, electra, megatron-bert) * Sanity checks before tracing that make sure the model to trace is supported * Adapted to PyTorch 1.9 Co-authored-by: Michael Benayoun <michael@huggingface.co>	2021-10-05 14:19:47 +02:00
Nicolas Patry	3a9c0f23b4	Fixing empty prompts for text-generation when BOS exists. (#13859 ) * Fixing empty prompts for text-generation when BOS exists. * Fixing odd case with Pegasus. * Fixing Bert is Assertion Error.	2021-10-05 13:46:10 +02:00
Nicolas Patry	7079a99e76	Fixing 1-length special tokens cut. (#13862 )	2021-10-05 12:26:54 +02:00
Bram Vanroy	12b4d66a80	Update no_* argument (HfArgumentParser) (#13865 ) * update no_* argument Changes the order so that the no_* argument is created after the original argument AND sets the default for this no_* argument to False * import copy * update test * make style * Use kwargs to set default=False * make style	2021-10-04 16:28:52 -04:00
Sidd Karamcheti	3a8de58c51	Add Mistral GPT-2 Stability Tweaks (#13573 ) * Add layer-wise scaling * Add reorder & upcasting argument * Add OpenAI GPT-2 weight initialization scheme * start `layer_idx` count at zero for consistency * disentangle attn and reordered and upscaled attn function * rename `scale_attn_by_layer` to `scale_attn_by_layer_id` * make autocast from amp compatible with pytorch<1.6 * fix docstring * style fixes * Add fixes from PR feedback, style tweaks * Fix doc whitespace * Reformat * First pass scale_attn_by_layer_idx and reorder_and_upcast_attn tests * Rename scale_attn_by_layer_idx, add tip * Remove extra newline * add test for weight initialization * update code format * add assert check weights are fp32 * remove assert * Fix incorrect merge * Fix shape mismatch in baddbmm * Add generation test for Mistral flags Co-authored-by: leandro <leandro.vonwerra@spoud.io> Co-authored-by: Keshav Santhanam <keshav2@stanford.edu> Co-authored-by: J38 <jebolton@stanford.edu>	2021-10-04 07:37:09 -04:00
Suraj Patil	8bbb53e20b	skip gptj slow generate tests for now (#13809 )	2021-09-30 15:44:33 -04:00
Patrick von Platen	41436d3dfb	[DPR] Correct init (#13796 ) * update * add to docs and init * make fix-copies	2021-09-30 18:55:20 +02:00
Sylvain Gugger	63cc5bda60	Fix length of IterableDatasetShard and add test (#13792 ) * Fix length of IterableDatasetShard and add test * Add comments	2021-09-29 11:48:48 -04:00
Li-Huai (Allan) Lin	7d84c3a488	Enable readme link synchronization (#13785 ) * Enable readme link synchronization * Style * Reuse regex pattern * Apply suggestions * Update	2021-09-29 11:18:59 -04:00
Anton Lozhkov	e0d31a8982	[Tests] Cast Hubert test models to fp16 (#13755 )	2021-09-26 22:58:23 +03:00
Patrick von Platen	067413fb73	finish (#13743 )	2021-09-25 21:20:21 +02:00
Patrick von Platen	e579f855fa	up (#13729 )	2021-09-24 08:57:49 -04:00
Nicolas Patry	0eabe49204	Fixing zero-shot backward compatiblity (#13725 ) Fixes #13697	2021-09-24 07:38:17 -04:00
kding1	6a3a197fcd	Add SigOpt HPO to transformers trainer api (#13572 ) * add sigopt hpo to transformers. Signed-off-by: Ding, Ke <ke.ding@intel.com> * extend sigopt changes to test code and others.. Signed-off-by: Ding, Ke <ke.ding@intel.com> * Style. * fix style for sigopt integration. Signed-off-by: Ding, Ke <ke.ding@intel.com> * Add necessary information to run unittests on SigOpt. Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>	2021-09-23 17:01:51 +02:00
Lysandre Debut	ca257a06cc	Fix torchscript tests (#13701 )	2021-09-22 19:02:54 -04:00
Anton Lozhkov	7c7d2ec952	[GPT-J] Use the `float16` checkpoints in integration tests (#13676 ) * Use fp16 checkpoints * Style * Fix outputs and disable OOM tests * Correct another output * Use a random smaller model for generation tests * repo quickfix * fix gradient checkpointing	2021-09-22 23:17:57 +03:00
Sylvain Gugger	27d4639779	Make gradient_checkpointing a training argument (#13657 ) * Make gradient_checkpointing a training argument * Update src/transformers/modeling_utils.py Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Update src/transformers/configuration_utils.py Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Fix tests * Style * document Gradient Checkpointing as a performance feature * Small rename * PoC for not using the config * Adapt BC to new PoC * Forgot to save * Rollout changes to all other models * Fix typo Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Stas Bekman <stas@stason.org>	2021-09-22 07:51:38 -04:00
Anton Lozhkov	75f6641eaf	[Wav2Vec2FeatureExtractor] Fix `extractor.pad()` dtype backwards compatibility (#13693 ) * Force dtype, add tests * Local torch imports * Remove unused logic (always ndarray)	2021-09-22 11:02:54 +02:00
Patrick von Platen	8e908c8c74	[AutoTokenizer] Allow creation of tokenizers by tokenizer type (#13668 ) * up * up	2021-09-22 00:29:38 +02:00
Patrick von Platen	2608944dc2	up (#13688 )	2021-09-22 00:28:43 +02:00
Sylvain Gugger	d16bec9530	Skip FlaxWav2Vec2 test until fixed	2021-09-21 16:17:01 -04:00
Nishant Prabhu	ddd4d02f30	Layoutlm onnx support (Issue #13300 ) (#13562 ) * Add support for exporting PyTorch LayoutLM to ONNX * Added tests for converting LayoutLM to ONNX * Add support for exporting PyTorch LayoutLM to ONNX * Added tests for converting LayoutLM to ONNX * cleanup * Removed regression/ folder * Add support for exporting PyTorch LayoutLM to ONNX * Added tests for converting LayoutLM to ONNX * cleanup * Fixed import error * Remove unnecessary import statements * Changed max_2d_positions from class variable to instance variable of the config class * Add support for exporting PyTorch LayoutLM to ONNX * Added tests for converting LayoutLM to ONNX * cleanup * Add support for exporting PyTorch LayoutLM to ONNX * cleanup * Fixed import error * Changed max_2d_positions from class variable to instance variable of the config class * Use super class generate_dummy_inputs method Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * Add support for Masked LM, sequence classification and token classification Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * Removed uncessary import and method * Fixed code styling * Raise error if PyTorch is not installed * Remove unnecessary import statement Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>	2021-09-21 15:39:37 -04:00
Anton Lozhkov	1417978cd4	[SequenceFeatureExtractor] Rewrite padding logic from pure python to numpy (#13650 ) * Test np padding * Pass feature extraction tests * Update type hints * Fix flaky integration tests * Try a more stable waveform * Add to_numpy jax support * int32 attention masks * Refactor normalization tests	2021-09-21 17:10:13 +03:00
Kamal Raj	8d533e6ad6	Typo "UNKWOWN" -> "UNKNOWN" (#13675 )	2021-09-21 09:11:26 -04:00
Kamal Raj	a2dec768a2	beit-flax (#13515 ) * beit-flax * updated FLAX_BEIT_MLM_DOCSTRING * removed bool_masked_pos from classification * updated Copyright * code refactoring: x -> embeddings * updated test: rm from_pt * Update docs/source/model_doc/beit.rst * model code dtype updates and other changes according to review * relative_position_bias revert back to pytorch design	2021-09-21 13:34:19 +02:00
Patrick von Platen	48fa42e5d5	Add Speech AutoModels (#13655 ) * upload * correct * correct * correct * finish * up * up * up again	2021-09-21 08:50:33 +02:00
Sylvain Gugger	002a078aff	Dynamically load model code from the Hub (#13467 ) * Dynamic model * Use defensive flag * Style * Doc and arg rename * Arg rename * Add tests * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address review comments * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-09-20 13:59:21 -04:00
Gunjan Chhablani	d8049331dc	Add FNet (#13045 ) * Init FNet * Update config * Fix config * Update model classes * Update tokenizers to use sentencepiece * Fix errors in model * Fix defaults in config * Remove position embedding type completely * Fix typo and take only real numbers * Fix type vocab size in configuration * Add projection layer to embeddings * Fix position ids bug in embeddings * Add minor changes * Add conversion script and remove CausalLM vestiges * Fix conversion script * Fix conversion script * Remove CausalLM Test * Update checkpoint names to dummy checkpoints * Add tokenizer mapping * Fix modeling file and corresponding tests * Add tokenization test file * Add PreTraining model test * Make style and quality * Make tokenization base tests work * Update docs * Add FastTokenizer tests * Fix fast tokenizer special tokens * Fix style and quality * Remove load_tf_weights vestiges * Add FNet to main README * Fix configuration example indentation * Comment tokenization slow test * Fix style * Add changes from review * Fix style * Remove bos and eos tokens from tokenizers * Add tokenizer slow test, TPU transforms, NSP * Add scipy check * Add scipy availabilty check to test * Fix tokenizer and use correct inputs * Remove remaining TODOs * Fix tests * Fix tests * Comment Fourier Test * Uncomment Fourier Test * Change to google checkpoint * Add changes from review * Fix activation function * Fix model integration test * Add more integration tests * Add comparison steps to MLM integration test * Fix style * Add masked tokenization fix * Improve mask tokenization fix * Fix index docs * Add changes from review * Fix issue * Fix failing import in test * some more fixes * correct fast tokenizer * finalize * make style * Remove additional tokenization logic * Set do_lower_case to False * Allow keeping accents * Fix tokenization test * Fix FNet Tokenizer Fast * fix tests * make style * Add tips to FNet docs Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>	2021-09-20 13:24:30 +02:00
calpt	b518aaf193	Fix GPT2Config parameters in GPT2ModelTester (#13630 )	2021-09-17 15:36:23 -04:00

1 2 3 4 5 ...

1308 Commits