HuggingFace_transformer

Author	SHA1	Message	Date
Joao Gante	c2f8eaf6bc	TF: unpack inputs on Convbert, GPTJ, LED, and templates (#16491 ) * Add unpack_inputs to remaining models * remove stray use of inputs in the templates; fix tf.debugging of attn masks	2022-03-30 17:12:27 +01:00
tomerip	ae189ef991	Add support for exporting GPT-J to ONNX-TRT (#16492 ) Add support for exporting GPT-J to ONNX-TRT Co-authored-by: Tomer Stav <stavt@amazon.com>	2022-03-30 17:56:03 +02:00
dctelus	d04adc3521	Add length to PreTrainedTokenizer train_new_from_iterator (#16493 )	2022-03-30 11:41:04 -04:00
Aditya Kane	147c816685	Nit: MCSCOCO -> MS COCO (#16481 )	2022-03-30 10:06:32 -04:00
Dahlbomii	ffd19ee1de	TF GPT-J Type hints and TF decorator (#16488 ) * Type hints and TF decorator added * Type hints and TF decorator added * make style Co-authored-by: matt <rocketknight1@gmail.com>	2022-03-30 14:03:54 +01:00
Antoni Baum	277d49a590	Do not initialize `torch.distributed` process group if one is already initailized (#16487 ) * Do not initialize torch process group twice * Apply suggestions from code review	2022-03-29 19:07:31 -04:00
Yih-Dar	2b483230a1	Raise diff tolerance value for TFViTMAEModelTest (#16483 ) * Raise diff tolerance value Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-29 22:12:27 +02:00
Christopher Akiki	ee18d4d2a9	TF GPT2: clearer model variable naming with @unpack_inputs (#16311 ) * add unpack_inputs decorator to Main Layer * add unpack_inputs decorator to Model * add unpack_inputs decorator to LMHead Model * add unpack_inputs decorator to Double Head Model * add unpack_inputs decorator to Sequence Classification Model * run fixup recipe * make unpack_inputs the first decorator	2022-03-29 20:35:25 +01:00
Sander Land	d7c8ce57d4	Avoid accessing .dataset of a DataLoader in Trainer (#16451 ) * Avoid accessing .dataset of a dataloader * style * fix * cleaning up, reverting some misunderstandings * black * add train_dataset argument to get_train_dataloader, and fix other instances of length checks * flake8 * address comments * fix bug * cleanup * add test * Update tests/trainer/test_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * under torch * merge * stylistic suggestion Co-authored-by: Sander Land <sander@chatdesk.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-29 15:00:18 -04:00
akashe	781af7362b	added typehints for RAG pytorch models (#16416 )	2022-03-29 18:24:25 +01:00
Sayak Paul	5b40a37bc4	Add TF ViT MAE (#16255 ) * ported TFViTMAEIntermediate and TFViTMAEOutput. * added TFViTMAEModel and TFViTMAEDecoder. * feat: added a noise argument in the implementation for reproducibility. * feat: vit mae models with an additional noise argument for reproducibility. Co-authored-by: ariG23498 <aritra.born2fly@gmail.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-29 18:24:15 +01:00
Joao Gante	7a9ef8181c	TF: properly handle kwargs in encoder_decoder architectures (#16465 ) * properly handle kwargs in encoder_decoder architectures * make fixup	2022-03-29 18:17:47 +01:00
Dan Tegzes	0540d1b6c0	Add type hints for UniSpeech (#16399 ) * Add type hints for UniSpeech * Added type hints for UniSpeechSat * Added type hints for Wave2Vec2 (PT) * Added type hints for models dependent of wave2vec	2022-03-29 18:02:46 +01:00
Wesley A. Cheng	875e07a9e3	[doc] Fix missing trainer import (#16469 )	2022-03-29 18:57:43 +02:00
Yih-Dar	6358a4c8ec	Add TF vision model code samples (#16477 ) * add code samples Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-29 18:57:16 +02:00
Wesley A. Cheng	3015d12bfb	fix wrong variable name (#16467 )	2022-03-29 18:55:40 +02:00
Sylvain Gugger	b62ac4d240	Fix example test and test_fetcher for examples (#16478 )	2022-03-29 12:21:19 -04:00
Yih-Dar	86cff21cf6	Fix some TF GPT-J CI testings (#16454 ) * Fix for test_mixed_precision * Fix test_saved_model_creation by using shape_list instead of shape * skit test_model_from_pretrained on GPU for now to avoid GPU OOM * skip test_gptj_sample_max_time for now Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-29 18:04:20 +02:00
Yih-Dar	aebca696af	Fix missing output_attentions in PT/Flax equivalence test (#16271 ) * fix - set output_attentions to True * Update tests/test_modeling_flax_common.py * update for has_attentions * overwrite check_outputs in FlaxBigBirdModelTest Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Suraj Patil <surajp815@gmail.com>	2022-03-29 17:51:48 +02:00
Steven Liu	45abb37ac9	Remove duplicate mLuke (#16460 ) * Remove duplicate mLuke * 🖍 apply feedback	2022-03-29 10:34:30 -05:00
Eldar Kurtic	5216607f8a	[MNLI example] Prevent overwriting matched with mismatched metrics (#16475 ) * Prevent overwriting matched with mismatched metrics * Fix style	2022-03-29 10:38:14 -04:00
Arnaud Stiegler	ed31ab3f10	Adding DocTest to TrOCR (#16398 ) * docstring still WIP \| adding to documentation_tests * clean version \| passes tests * adding to documentation_test * adding forward for training pass * make fixup applied * address comments * fix doctest * apply make fixup * remove additional blank * fix file to have correct split for prepare_for_doc_test * Update src/transformers/models/trocr/modeling_trocr.py Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * address comments * changing text \| adding loss check \| make fixup * make fixup * Update src/transformers/models/trocr/modeling_trocr.py Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * Update src/transformers/models/trocr/modeling_trocr.py Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * Update src/transformers/models/trocr/modeling_trocr.py Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * make fixup Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2022-03-29 16:19:06 +02:00
Suraj Patil	85295621f1	Fix blenderbot conversion script (#16472 )	2022-03-29 11:32:13 +02:00
lewtun	c85547af2b	Remove kwargs argument from IBERT MLM forward pass (#16449 )	2022-03-28 16:37:56 +02:00
Fernando	da936942b0	Translation from english to spanish of file pipeline_tutorial.mdx (#16149 ) * Add the translation from English to Spanish of the pipeline_tutorial.mdx file * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/pipeline_tutorial.mdx Fix typo Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> Co-authored-by: fernando <fernando@gethitch.ai> Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>	2022-03-28 10:31:19 -04:00
NielsRogge	979b039c89	Add DPT (#15991 ) * First draft * More improvements * Add fusion blocks * Make conversion script work for dpt_large * Make conversion script work * Improve implementation * Improve conversion script * Add DPTForSemanticSegmentation * Make conversion work for semantic segmentation * Add tests * Remove print statements * First draft * Redesign neck * Improve tests * Improve implementation some more * Make neck output list of tensors * Improve neck and feature extractor * Fix integration tests * Make more tests pass * Make all tests pass * Add missing config archive map * Add in_index attribute to make heads accept list of tensors * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply some more suggestions * Add copied from statements * Remove assert * Apply suggestions from code review * Apply suggestions from code review * Remove DPTInterpolate in favor of nn.Upsample * Add comments * Apply suggestions from code review * Apply suggestions from code review * Add proposed design * Update design * Add DPTReassembleLayer * Add DPTFeatureFusionStage * Apply more suggestions from code review * Apply suggestions from code review * Apply suggestions from code review * Fix rebase * Update in_index and out_indices * Fix conversion script * Fix code quality * Add model to toctree and use DepthEstimatorOutput * Fix rebase * Fix code examples * Improve code * Fix copied from statements * Apply suggestions from code review * Remove compute_loss method * Apply suggestions from code review * Fix documentation tests file * Remove test.py file * Improve doc example Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Niels Rogge <nielsrogge@nielss-mbp.home>	2022-03-28 16:28:10 +02:00
Sanchit Gandhi	7ca4633555	[FlaxSpeechEncoderDecoderModel] Ensure Input and Output Word Embeddings Are Not Tied (#16444 ) * [FlaxSpeechEncoderDecoderModel] Ensure Input and Output Word Embeddings Are Not Tied * rebase	2022-03-28 14:14:10 +02:00
Jaesun Park	e0ac72b7bd	Fix PerceiverMLP and test (#16405 ) Co-authored-by: Jaesun Park <jaesun.park1@navercorp.com>	2022-03-28 14:06:48 +02:00
Sylvain Gugger	473709fc76	Use doc builder styler (#16412 ) * Config update * Use doc-builder styler * Cleanup * Adapt import * We need it there too!	2022-03-28 07:45:18 -04:00
Yongrae Jo	8049dfa427	Update run_t5_mlm_flax.py (#16421 ) Fix typo in comment: proprocessed -> preprocessed	2022-03-28 06:00:53 -04:00
Sanchit Gandhi	925fc57b70	[Flax] Improve Robustness of Back-Prop Tests (#16418 ) * [Flax] Improve Robustness of Back-Prop Tests * check equality of logits/outputs * make fixup	2022-03-28 11:56:54 +02:00
Shang Zhang	7ecbb9c5e4	QDQBert example update (#16395 ) * update Dockerfile and utils_qa * Update README.md	2022-03-28 05:47:52 -04:00
Julien Chaumond	f6f6866e9e	`cached_download ∘ hf_hub_url` is `hf_hub_download` (#16375 )	2022-03-28 05:43:39 -04:00
Kurian Benoy	c88ff66cc8	Fix broken links (#16113 ) * Update marian.mdx * Update marian.mdx * Update docs/source/model_doc/marian.mdx Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update marian.mdx Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2022-03-28 05:38:17 -04:00
Jia	342ff6eb41	Update comments in class BatchEncoding (#15932 )	2022-03-28 05:19:12 -04:00
Nathan Glenn	e02f95b229	remove references to PDF reading via PIL (#15293 ) * fix confusing PIL instructions As stated in the documentation [here](https://pillow.readthedocs.io/en/stable/handbook/image-file-formats.html?highlight=pdf#write-only-formats), PIL can only write PDF's, not read them. Remove references to reading PDF's via PIL from this page to avoid confusion. * mention PDF in doc examples using PIL Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Be explicit: PDFs must be converted to images * fix formatting Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>	2022-03-28 05:00:29 -04:00
Shamima	3dc8242716	TF: removed inputs_processing and replaced with decorator in lxmert (#16414 )	2022-03-27 18:09:15 +01:00
Steven Liu	b320d87ece	Create concept guide section (#16369 ) * ✨ create concept guide section * 🖍 make fixup * 🖍 apply feedback Co-authored-by: Steven <stevhliu@gmail.com>	2022-03-25 14:51:43 -05:00
Daniel Stancl	ed2ee373d0	Add TF implementation of GPT-J (#15623 ) * Initial commit * Add TFGPTJModel * Fix a forward pass * Add TFGPTJCausalLM * Add TFGPTJForSequenceClassification * Add TFGPTJForQuestionAnswering * Fix docs * Deal with TF dynamic shapes * Add Loss parents to models * Adjust split and merge heads to handle 4 and 5-dim tensors * Update outputs for @tooslow tests	2022-03-25 19:27:19 +00:00
Sanchit Gandhi	aa4c0a86dc	Fix Typo in Argument of FlaxWav2Vec2ForPreTrainingModule (#16084 )	2022-03-25 17:49:37 +01:00
Sanchit Gandhi	e231c72906	[FlaxSpeechEncoderDecoder] Fix feature extractor gradient test (#16407 )	2022-03-25 17:46:53 +01:00
lewtun	a97f3150c4	Add ONNX support for Blenderbot and BlenderbotSmall (#15875 ) * Add ONNX support for Blenderbot * Add BlenderbotSmall ONNX configuration * Update serialization table	2022-03-25 17:04:43 +01:00
Sylvain Gugger	b473617d63	Checkpoint sharding (#16343 ) * Sharded checkpoint support * Handle distant sharded checkpoints * Add tests * TODO is done * Apply suggestions from code review Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Fix docstring * Add example and format * Address review comments * More review comments * End of merge * Revert unintentional change * VsCode what did you do? * Style * Changes * Address final comments * Quality * Moar tests * Move import beneath is_pt_available Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>	2022-03-25 11:59:25 -04:00
Matt	7fa7408b26	Terminate previous pushes when we get to the final push (#16409 )	2022-03-25 15:47:05 +00:00
Sylvain Gugger	867f3950fa	Rename master to main for notebooks links and leftovers (#16397 )	2022-03-25 09:12:23 -04:00
Atharva Ingle	7e7490473e	fixed typo from enable to disable in disable_progress_bar function (#16406 )	2022-03-25 09:07:43 -04:00
Sylvain Gugger	088c1880b7	Big file_utils cleanup (#16396 ) * Big file_utils cleanup * This one still needs to be treated separately	2022-03-25 07:25:20 -04:00
Michael Benayoun	2b23e0801a	Make FeaturesManager.get_model_from_feature a static method (#16357 )	2022-03-25 11:35:48 +01:00
NielsRogge	aa6cfe9c4b	Rename to SemanticSegmenterOutput (#15849 ) Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-03-24 20:44:15 +01:00
Yi Heng Lim	70a9bc69a8	Added type hints (#16389 ) * Added type hints for PyTorch T5 model * removed a type hint * ran make style * added type hints for ibert pytorch * added type hints for lxmert pytorch * removed kwargs type hint and fixed arguments order	2022-03-24 19:14:34 +00:00

1 2 3 4 5 ...

9425 Commits