Joao Gante
c2f8eaf6bc
TF: unpack inputs on Convbert, GPTJ, LED, and templates ( #16491 )
...
* Add unpack_inputs to remaining models
* remove stray use of inputs in the templates; fix tf.debugging of attn masks
2022-03-30 17:12:27 +01:00
tomerip
ae189ef991
Add support for exporting GPT-J to ONNX-TRT ( #16492 )
...
Add support for exporting GPT-J to ONNX-TRT
Co-authored-by: Tomer Stav <stavt@amazon.com >
2022-03-30 17:56:03 +02:00
dctelus
d04adc3521
Add length to PreTrainedTokenizer train_new_from_iterator ( #16493 )
2022-03-30 11:41:04 -04:00
Aditya Kane
147c816685
Nit: MCSCOCO -> MS COCO ( #16481 )
2022-03-30 10:06:32 -04:00
Dahlbomii
ffd19ee1de
TF GPT-J Type hints and TF decorator ( #16488 )
...
* Type hints and TF decorator added
* Type hints and TF decorator added
* make style
Co-authored-by: matt <rocketknight1@gmail.com >
2022-03-30 14:03:54 +01:00
Antoni Baum
277d49a590
Do not initialize torch.distributed process group if one is already initailized ( #16487 )
...
* Do not initialize torch process group twice
* Apply suggestions from code review
2022-03-29 19:07:31 -04:00
Yih-Dar
2b483230a1
Raise diff tolerance value for TFViTMAEModelTest ( #16483 )
...
* Raise diff tolerance value
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2022-03-29 22:12:27 +02:00
Christopher Akiki
ee18d4d2a9
TF GPT2: clearer model variable naming with @unpack_inputs ( #16311 )
...
* add unpack_inputs decorator to Main Layer
* add unpack_inputs decorator to Model
* add unpack_inputs decorator to LMHead Model
* add unpack_inputs decorator to Double Head Model
* add unpack_inputs decorator to Sequence Classification Model
* run fixup recipe
* make unpack_inputs the first decorator
2022-03-29 20:35:25 +01:00
Sander Land
d7c8ce57d4
Avoid accessing .dataset of a DataLoader in Trainer ( #16451 )
...
* Avoid accessing .dataset of a dataloader
* style
* fix
* cleaning up, reverting some misunderstandings
* black
* add train_dataset argument to get_train_dataloader, and fix other instances of length checks
* flake8
* address comments
* fix bug
* cleanup
* add test
* Update tests/trainer/test_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* under torch
* merge
* stylistic suggestion
Co-authored-by: Sander Land <sander@chatdesk.com >
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
2022-03-29 15:00:18 -04:00
akashe
781af7362b
added typehints for RAG pytorch models ( #16416 )
2022-03-29 18:24:25 +01:00
Sayak Paul
5b40a37bc4
Add TF ViT MAE ( #16255 )
...
* ported TFViTMAEIntermediate and TFViTMAEOutput.
* added TFViTMAEModel and TFViTMAEDecoder.
* feat: added a noise argument in the implementation for reproducibility.
* feat: vit mae models with an additional noise argument for reproducibility.
Co-authored-by: ariG23498 <aritra.born2fly@gmail.com >
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2022-03-29 18:24:15 +01:00
Joao Gante
7a9ef8181c
TF: properly handle kwargs in encoder_decoder architectures ( #16465 )
...
* properly handle kwargs in encoder_decoder architectures
* make fixup
2022-03-29 18:17:47 +01:00
Dan Tegzes
0540d1b6c0
Add type hints for UniSpeech ( #16399 )
...
* Add type hints for UniSpeech
* Added type hints for UniSpeechSat
* Added type hints for Wave2Vec2 (PT)
* Added type hints for models dependent of wave2vec
2022-03-29 18:02:46 +01:00
Wesley A. Cheng
875e07a9e3
[doc] Fix missing trainer import ( #16469 )
2022-03-29 18:57:43 +02:00
Yih-Dar
6358a4c8ec
Add TF vision model code samples ( #16477 )
...
* add code samples
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2022-03-29 18:57:16 +02:00
Wesley A. Cheng
3015d12bfb
fix wrong variable name ( #16467 )
2022-03-29 18:55:40 +02:00
Sylvain Gugger
b62ac4d240
Fix example test and test_fetcher for examples ( #16478 )
2022-03-29 12:21:19 -04:00
Yih-Dar
86cff21cf6
Fix some TF GPT-J CI testings ( #16454 )
...
* Fix for test_mixed_precision
* Fix test_saved_model_creation by using shape_list instead of shape
* skit test_model_from_pretrained on GPU for now to avoid GPU OOM
* skip test_gptj_sample_max_time for now
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2022-03-29 18:04:20 +02:00
Yih-Dar
aebca696af
Fix missing output_attentions in PT/Flax equivalence test ( #16271 )
...
* fix - set output_attentions to True
* Update tests/test_modeling_flax_common.py
* update for has_attentions
* overwrite check_outputs in FlaxBigBirdModelTest
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
Co-authored-by: Suraj Patil <surajp815@gmail.com >
2022-03-29 17:51:48 +02:00
Steven Liu
45abb37ac9
Remove duplicate mLuke ( #16460 )
...
* Remove duplicate mLuke
* 🖍 apply feedback
2022-03-29 10:34:30 -05:00
Eldar Kurtic
5216607f8a
[MNLI example] Prevent overwriting matched with mismatched metrics ( #16475 )
...
* Prevent overwriting matched with mismatched metrics
* Fix style
2022-03-29 10:38:14 -04:00
Arnaud Stiegler
ed31ab3f10
Adding DocTest to TrOCR ( #16398 )
...
* docstring still WIP | adding to documentation_tests
* clean version | passes tests
* adding to documentation_test
* adding forward for training pass
* make fixup applied
* address comments
* fix doctest
* apply make fixup
* remove additional blank
* fix file to have correct split for prepare_for_doc_test
* Update src/transformers/models/trocr/modeling_trocr.py
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com >
* address comments
* changing text | adding loss check | make fixup
* make fixup
* Update src/transformers/models/trocr/modeling_trocr.py
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com >
* Update src/transformers/models/trocr/modeling_trocr.py
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com >
* Update src/transformers/models/trocr/modeling_trocr.py
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com >
* make fixup
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com >
2022-03-29 16:19:06 +02:00
Suraj Patil
85295621f1
Fix blenderbot conversion script ( #16472 )
2022-03-29 11:32:13 +02:00
lewtun
c85547af2b
Remove kwargs argument from IBERT MLM forward pass ( #16449 )
2022-03-28 16:37:56 +02:00
Fernando
da936942b0
Translation from english to spanish of file pipeline_tutorial.mdx ( #16149 )
...
* Add the translation from English to Spanish of the pipeline_tutorial.mdx file
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
* Update docs/source_es/pipeline_tutorial.mdx
Fix typo
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
Co-authored-by: fernando <fernando@gethitch.ai >
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com >
2022-03-28 10:31:19 -04:00
NielsRogge
979b039c89
Add DPT ( #15991 )
...
* First draft
* More improvements
* Add fusion blocks
* Make conversion script work for dpt_large
* Make conversion script work
* Improve implementation
* Improve conversion script
* Add DPTForSemanticSegmentation
* Make conversion work for semantic segmentation
* Add tests
* Remove print statements
* First draft
* Redesign neck
* Improve tests
* Improve implementation some more
* Make neck output list of tensors
* Improve neck and feature extractor
* Fix integration tests
* Make more tests pass
* Make all tests pass
* Add missing config archive map
* Add in_index attribute to make heads accept list of tensors
* Apply suggestions from code review
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Apply some more suggestions
* Add copied from statements
* Remove assert
* Apply suggestions from code review
* Apply suggestions from code review
* Remove DPTInterpolate in favor of nn.Upsample
* Add comments
* Apply suggestions from code review
* Apply suggestions from code review
* Add proposed design
* Update design
* Add DPTReassembleLayer
* Add DPTFeatureFusionStage
* Apply more suggestions from code review
* Apply suggestions from code review
* Apply suggestions from code review
* Fix rebase
* Update in_index and out_indices
* Fix conversion script
* Fix code quality
* Add model to toctree and use DepthEstimatorOutput
* Fix rebase
* Fix code examples
* Improve code
* Fix copied from statements
* Apply suggestions from code review
* Remove compute_loss method
* Apply suggestions from code review
* Fix documentation tests file
* Remove test.py file
* Improve doc example
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local >
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
Co-authored-by: Niels Rogge <nielsrogge@nielss-mbp.home >
2022-03-28 16:28:10 +02:00
Sanchit Gandhi
7ca4633555
[FlaxSpeechEncoderDecoderModel] Ensure Input and Output Word Embeddings Are **Not** Tied ( #16444 )
...
* [FlaxSpeechEncoderDecoderModel] Ensure Input and Output Word Embeddings Are **Not** Tied
* rebase
2022-03-28 14:14:10 +02:00
Jaesun Park
e0ac72b7bd
Fix PerceiverMLP and test ( #16405 )
...
Co-authored-by: Jaesun Park <jaesun.park1@navercorp.com >
2022-03-28 14:06:48 +02:00
Sylvain Gugger
473709fc76
Use doc builder styler ( #16412 )
...
* Config update
* Use doc-builder styler
* Cleanup
* Adapt import
* We need it there too!
2022-03-28 07:45:18 -04:00
Yongrae Jo
8049dfa427
Update run_t5_mlm_flax.py ( #16421 )
...
Fix typo in comment: proprocessed -> preprocessed
2022-03-28 06:00:53 -04:00
Sanchit Gandhi
925fc57b70
[Flax] Improve Robustness of Back-Prop Tests ( #16418 )
...
* [Flax] Improve Robustness of Back-Prop Tests
* check equality of logits/outputs
* make fixup
2022-03-28 11:56:54 +02:00
Shang Zhang
7ecbb9c5e4
QDQBert example update ( #16395 )
...
* update Dockerfile and utils_qa
* Update README.md
2022-03-28 05:47:52 -04:00
Julien Chaumond
f6f6866e9e
cached_download ∘ hf_hub_url is hf_hub_download (#16375 )
2022-03-28 05:43:39 -04:00
Kurian Benoy
c88ff66cc8
Fix broken links ( #16113 )
...
* Update marian.mdx
* Update marian.mdx
* Update docs/source/model_doc/marian.mdx
Co-authored-by: Lysandre Debut <lysandre@huggingface.co >
* Update marian.mdx
Co-authored-by: Lysandre Debut <lysandre@huggingface.co >
2022-03-28 05:38:17 -04:00
Jia
342ff6eb41
Update comments in class BatchEncoding ( #15932 )
2022-03-28 05:19:12 -04:00
Nathan Glenn
e02f95b229
remove references to PDF reading via PIL ( #15293 )
...
* fix confusing PIL instructions
As stated in the documentation
[here](https://pillow.readthedocs.io/en/stable/handbook/image-file-formats.html?highlight=pdf#write-only-formats ),
PIL can only write PDF's, not read them. Remove references to reading
PDF's via PIL from this page to avoid confusion.
* mention PDF in doc examples using PIL
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com >
* Be explicit: PDFs must be converted to images
* fix formatting
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com >
2022-03-28 05:00:29 -04:00
Shamima
3dc8242716
TF: removed inputs_processing and replaced with decorator in lxmert ( #16414 )
2022-03-27 18:09:15 +01:00
Steven Liu
b320d87ece
Create concept guide section ( #16369 )
...
* ✨ create concept guide section
* 🖍 make fixup
* 🖍 apply feedback
Co-authored-by: Steven <stevhliu@gmail.com >
2022-03-25 14:51:43 -05:00
Daniel Stancl
ed2ee373d0
Add TF implementation of GPT-J ( #15623 )
...
* Initial commit
* Add TFGPTJModel
* Fix a forward pass
* Add TFGPTJCausalLM
* Add TFGPTJForSequenceClassification
* Add TFGPTJForQuestionAnswering
* Fix docs
* Deal with TF dynamic shapes
* Add Loss parents to models
* Adjust split and merge heads to handle 4 and 5-dim tensors
* Update outputs for @tooslow tests
2022-03-25 19:27:19 +00:00
Sanchit Gandhi
aa4c0a86dc
Fix Typo in Argument of FlaxWav2Vec2ForPreTrainingModule ( #16084 )
2022-03-25 17:49:37 +01:00
Sanchit Gandhi
e231c72906
[FlaxSpeechEncoderDecoder] Fix feature extractor gradient test ( #16407 )
2022-03-25 17:46:53 +01:00
lewtun
a97f3150c4
Add ONNX support for Blenderbot and BlenderbotSmall ( #15875 )
...
* Add ONNX support for Blenderbot
* Add BlenderbotSmall ONNX configuration
* Update serialization table
2022-03-25 17:04:43 +01:00
Sylvain Gugger
b473617d63
Checkpoint sharding ( #16343 )
...
* Sharded checkpoint support
* Handle distant sharded checkpoints
* Add tests
* TODO is done
* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com >
* Fix docstring
* Add example and format
* Address review comments
* More review comments
* End of merge
* Revert unintentional change
* VsCode what did you do?
* Style
* Changes
* Address final comments
* Quality
* Moar tests
* Move import beneath is_pt_available
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com >
2022-03-25 11:59:25 -04:00
Matt
7fa7408b26
Terminate previous pushes when we get to the final push ( #16409 )
2022-03-25 15:47:05 +00:00
Sylvain Gugger
867f3950fa
Rename master to main for notebooks links and leftovers ( #16397 )
2022-03-25 09:12:23 -04:00
Atharva Ingle
7e7490473e
fixed typo from enable to disable in disable_progress_bar function ( #16406 )
2022-03-25 09:07:43 -04:00
Sylvain Gugger
088c1880b7
Big file_utils cleanup ( #16396 )
...
* Big file_utils cleanup
* This one still needs to be treated separately
2022-03-25 07:25:20 -04:00
Michael Benayoun
2b23e0801a
Make FeaturesManager.get_model_from_feature a static method ( #16357 )
2022-03-25 11:35:48 +01:00
NielsRogge
aa6cfe9c4b
Rename to SemanticSegmenterOutput ( #15849 )
...
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local >
2022-03-24 20:44:15 +01:00
Yi Heng Lim
70a9bc69a8
Added type hints ( #16389 )
...
* Added type hints for PyTorch T5 model
* removed a type hint
* ran make style
* added type hints for ibert pytorch
* added type hints for lxmert pytorch
* removed kwargs type hint and fixed arguments order
2022-03-24 19:14:34 +00:00