HuggingFace_transformer

Author	SHA1	Message	Date
Lysandre	ef3cec0ca5	Release: v4.12.5 Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details v4.12.5	2021-11-17 11:36:14 -05:00
Matt	a5211fc59b	Revert "Experimenting with adding proper get_config() and from_config() methods (#14361 )" This reverts commit `e99a2314cd`.	2021-11-17 11:35:59 -05:00
Lysandre	527c763ff6	Release: v4.12.4 Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details v4.12.4	2021-11-16 17:25:47 -05:00
Sylvain Gugger	6f40723eb6	Fix gradient_checkpointing backward compatibility (#14408 ) * Fix gradient_checkpointing backward compatibility * Remove needless line * make sure mask prob is big enough and length small enough * Fix tests Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>	2021-11-16 17:14:10 -05:00
Patrick von Platen	db242aee15	[Wav2Vec2] Make sure that gradient checkpointing is only run if needed (#14407 ) * [Wav2Vec2] Make sure that gradient checkpointing is only run if needed * make fix-copies	2021-11-16 17:14:03 -05:00
Matt	e99a2314cd	Experimenting with adding proper get_config() and from_config() methods (#14361 ) * Experimenting with adding proper get_config() and from_config() methods * Adding a test for get/from config * Fix test for get/from config	2021-11-16 17:13:21 -05:00
Chang Wang	341a059792	enhance rewrite state_dict missing _metadata (#14348 )	2021-11-16 17:12:52 -05:00
Sylvain Gugger	6bf20275dd	Support for TF >= 2.7 (#14345 )	2021-11-16 17:12:19 -05:00
Chang Wang	c8206b4af5	improve rewrite state_dict missing _metadata (#14276 )	2021-11-16 17:10:51 -05:00
Dan Shirron	b6b97c319d	Fix of issue #13327 : Wrong weight initialization for TF t5 model (#14241 ) * Fix of issue #13327: Wrong weight initialization for TF t5 model * run black formatter * fix typo * remove my name tag from comments Co-authored-by: Shirron <dan.shirron@intel.com>	2021-11-16 17:07:40 -05:00
Sylvain Gugger	3ea15d2783	Style Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details v4.12.3	2021-11-02 18:04:04 -04:00
Sylvain Gugger	294a920027	Add maximum check for hf hub	2021-11-02 13:26:02 -04:00
Sylvain Gugger	9ab10fcd52	Release v4.12.3	2021-11-02 13:09:25 -04:00
Sylvain Gugger	872c4f3d44	Bump huggingface_hub	2021-11-02 13:08:51 -04:00
Sylvain Gugger	ac77639a75	Add PushToHubCallback in main init (#14246 )	2021-11-02 13:06:18 -04:00
Lysandre	219137337f	Release: v4.12.2 Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details v4.12.2	2021-10-29 14:48:05 -04:00
Nicolas Patry	cde7d78b09	Fixing image segmentation with inference mode. (#14204 ) * Fixing image segmentation for inference mode. * Update src/transformers/pipelines/base.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-10-29 14:47:39 -04:00
Lysandre Debut	e0a5154075	Release v4.12.1 Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details v4.12.1	2021-10-29 13:45:16 -04:00
Lysandre	9f3f335924	Torch 1.10 (#14169 ) * Torch 1.10 * torch scatter for 1.10 * style * Skip tests ok	2021-10-29 13:44:46 -04:00
Lysandre	62bf536631	Release v4.12.0 Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details v4.12.0	2021-10-28 12:09:49 -04:00
NielsRogge	5f3bf65111	Fix EncoderDecoderModel docs (#14197 ) * Fix docs * Apply suggestions from review + fix bug	2021-10-28 18:01:00 +02:00
NielsRogge	ac12a5ae47	Fix EncoderDecoderModel classes to be more like BART and T5 (#14139 ) * First draft * Make tuple output more readable * Replace assertions by value errors * Make it possible to predict_with_generate for vision and speech models * Adapt Seq2SeqTrainer to work with VisionEncoderDecoder/SpeechEncoderDecoder * Add deprecation warning * Add copied from statements to vision and speech encoder decoders * Fix failing test * Apply @patrickvonplaten's suggestion * Use reshape instead of view for consistency	2021-10-28 15:29:04 +02:00
Anton Lozhkov	1251072f46	Fix SEW-D implementation differences (#14191 ) * Fix SEW-D * Update tests * isort	2021-10-28 16:22:18 +03:00
Anton Lozhkov	78b6a2ecbd	Add audio-classification benchmarking results (#14192 )	2021-10-28 15:59:18 +03:00
NielsRogge	1dc96a760d	Add SegFormer (#14019 ) * First draft * Make style & quality * Improve conversion script * Add print statement to see actual slice * Make absolute tolerance smaller * Fix image classification models * Add post_process_semantic method * Disable padding * Improve conversion script * Rename to ForSemanticSegmentation, add integration test, remove post_process methods * Improve docs * Fix code quality * Fix feature extractor tests * Fix tests for image classification model * Delete file * Add is_torch_available to feature extractor * Improve documentation of feature extractor methods * Apply suggestions from @sgugger's code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply some more suggestions of code review * Rebase with master * Fix rebase issues * Make sure model only outputs hidden states when the user wants to * Apply suggestions from code review * Add pad method * Support padding of 2d images * Add print statement * Add print statement * Move padding method to SegformerFeatureExtractor * Fix issue * Add casting of segmentation maps * Add test for padding * Add small note about padding Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-10-28 08:23:52 -04:00
Stas Bekman	123cce6ffc	[modeling_utils] respect original dtype in _get_resized_lm_head (#14181 ) * respect dtype in _get_resized_lm_head * Update src/transformers/modeling_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * consistency Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-10-27 19:01:50 -07:00
Patrick von Platen	88cd82e801	Update README.md	2021-10-28 02:35:01 +02:00
Patrick von Platen	e118db15d6	Update README.md	2021-10-28 01:59:27 +02:00
Patrick von Platen	01b1466983	[TPU tests] Enable first TPU examples pytorch (#14121 ) * up * up * fix * up * Update examples/pytorch/test_xla_examples.py * correct labels * up * up * up * up * up * up	2021-10-28 01:22:28 +02:00
Anton Lozhkov	232822f36d	Add DistilHuBERT (#14174 ) * Add conversion * Rename * Add an integration test and remove layer_norm * Remove layer_norm from the converter * wording * Fix imports	2021-10-27 20:17:31 +03:00
Lahfa Samy	e5b8ffb848	Replace assert of data/data_collator.py by ValueError (#14131 ) * Replace assert of data_collator.py by ValueError * Replace assert of data_collator.py by ValueError	2021-10-27 12:19:10 -04:00
Anton Lozhkov	25ceb81871	[Pipelines] Fix ASR model types check (#14178 )	2021-10-27 17:17:47 +03:00
Patrick von Platen	6200fd7bbc	[Gradient checkpointing] Enable for Deberta + DebertaV2 + SEW-D (#14175 ) * up * up * finish * up * final changes	2021-10-27 15:47:20 +02:00
Anton Lozhkov	e1dc5afd28	Add SEW CTC models (#14158 ) * Add SEW CTC models * Update paths * Update paths	2021-10-27 12:21:09 +03:00
Lysandre Debut	1e53faeb2e	Fix gelu test for torch 1.10 (#14167 )	2021-10-26 22:20:51 -04:00
Kamal Raj	8ddbfe9752	switch to inference_mode from no_gard (#13667 ) * switch to inference_mode from no_gard faster inference * added switch to support older version of pytorch	2021-10-26 18:02:58 -04:00
Emanuel Huber	ebd48c6de5	Replace assertions with ValueError exception (#14142 ) Updated masked-language modeling examples in pytorch with convention defined by #12789	2021-10-26 17:14:29 -04:00
Matthew Goldey	42bfb83d74	fix typos in error messages in speech recognition example and modelcard.py (#14166 ) * specify the text column name in the error message * pluralize the word fields	2021-10-26 16:36:26 -04:00
Jangwon Park	41dad89f70	chore: typo on ner accelerate example code (#14150 )	2021-10-26 16:23:41 -04:00
Lysandre	27c888db6c	Fix copies	2021-10-26 15:48:28 -04:00
Jay Zhang	3f23634a17	[ONNX] Add symbolic function for XSoftmax op for exporting to ONNX. (#14013 ) * Add symbolic function for XSoftmax op for exporting to ONNX. * Fix format issues. * Fix a CI issue relative to copies.	2021-10-26 15:25:02 -04:00
Patrick von Platen	9f3aa46f45	Add Unispeech & Unispeech-SAT (#13963 ) * unispeech * add copy from * remove hubert copy from * finish for today * add unispeech-sat * adapt more * up * up * up * up * add modeling * add tests * up * up * finish * up * Apply suggestions from code review * up * up * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * up * up Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-10-26 18:59:58 +02:00
Patrick von Platen	9799f4e150	Update README.md	2021-10-26 18:59:25 +02:00
Stas Bekman	bfd8176636	[megatron_gpt2] dynamic gelu, add tokenizer, save config (#13928 ) * [megatron_gpt2] dynamic gelu, add tokenizer, save config * cleanup * Update src/transformers/models/megatron_gpt2/convert_megatron_gpt2_checkpoint.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply suggestions Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-10-26 09:09:54 -07:00
Sergio Valcarcel Macua	919a964b8f	Include Keras tensor in the allowed types (#14155 ) * Include KerasTensor in allowed types - This allows propagating symbolic tensors through TFBert models and layers' call(), which allows converting the subclass models to functional models. * Style pass Co-authored-by: Sergio Valcarcel Macua <sergiov@graphcore.ai> Co-authored-by: matt <rocketknight1@gmail.com>	2021-10-26 15:08:59 +01:00
Patrick von Platen	f5ed19f57d	[Speech Recognition] - Distributed training: Make sure vocab file removal and creation don't interfer (#14161 ) * up * better	2021-10-26 15:59:33 +02:00
Yih-Dar	840fc8dbca	Add vision_encoder_decoder to models/__init__.py (#14151 ) * Add vision_encoder_decoder * Update _ignore_modules in get_model_modules() Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2021-10-26 07:36:17 -04:00
Patrick von Platen	e248e9b042	up (#14154 )	2021-10-26 13:08:18 +02:00
Thomas Chaigneau	1f60df81b2	Add Camembert to models exportable with ONNX (#14059 ) Add Camembert to models exportable with ONNX Co-authored-by: Thomas.Chaigneau <thomas.chaigneau@arkea.com> Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>	2021-10-26 11:22:22 +02:00
Patrick von Platen	0c3174c758	Add TF<>PT and Flax<>PT everywhere (#14047 ) * up * up * up * up * up * up * up * add clip * fix clip PyTorch * fix clip PyTorch * up * up * up * up * up * up * up	2021-10-25 23:55:08 +02:00

1 2 3 4 5 ...

8246 Commits