Lysandre Debut
ec47baeba2
2022 is the year of multi-modality ( #14610 )
...
* 2022 is the year of multi-modality
* Small fix
* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com >
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com >
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com >
* Apply suggestions from code review
* Apply to documentation index
* Apply suggestions from code review
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
* Update README.md
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
* Apply suggestions from code review
* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com >
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com >
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com >
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
2021-12-03 11:35:44 -05:00
Daniel Stancl
50d909be28
[Flax] Add FlaxBlenderbotSmall ( #14576 )
...
* [WIP] Add FlaxBlenderbotSmall
* Revert some unintentionally changed files
Revert some unintentionally files changed by improperly filled cookiecutter instructions.
* Fix repo consistency
* Fix Flax-PT equivalence
* Apply suggestions from code review
* Update index.mdx
* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com >
2021-12-02 14:21:48 +05:30
Mishig Davaadorj
275402bf2b
Update doc img links ( #14593 )
...
* Update doc img links
* Rename toctree.yml -> _toctree.yml (#14594 )
* Update doc img links
* Update performance.md img link
2021-12-02 09:01:35 +01:00
Mishig Davaadorj
4f68de625c
Rename toctree.yml -> _toctree.yml ( #14594 )
2021-12-02 08:58:39 +01:00
Stas Bekman
fbe278c76c
[doc] bf16/tf32 guide ( #14579 )
...
* [doc] bf16/tf32 guide
* expand
* expand
* Update docs/source/performance.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
2021-12-01 14:18:58 -08:00
Sylvain Gugger
4df7d05a87
Doc new front ( #14590 )
...
* Convert PretrainedConfig doc to Markdown
* Use syntax
* Add necessary doc files (#14496 )
* Doc fixes (#14499 )
* Fixes for the new front
* Convert DETR file for table
* Title is needed
* Simplify a bit
* Even simpler
* Remove imports
* Fix typo in toctree (#14516 )
* Fix checkpoints badge
* Update versions.yml format (#14517 )
* Doc new front github actions (#14512 )
* Doc new front github actions
* Fix docstring
* Fix feature extraction utils import (#14515 )
* Address Julien's comments
* Push to doc-builder
* Ready for merge
* Remove old build and deploy
* Doc misc fixes (#14583 )
* Rm versions.yml from doc
* Fix converting.rst
* Rm pretrained_models from toctree
* Fix index links (#14567 )
* Fix links in README
* Localized READMEs
* Fix copy script
* Fix find doc script
* Update README_ko.md
Co-authored-by: Julien Chaumond <julien@huggingface.co >
Co-authored-by: Julien Chaumond <julien@huggingface.co >
* Adapt build command to new CLI tools (#14578 )
* Fix typo
* Fix doc interlinks (#14589 )
* Convert PretrainedConfig doc to Markdown
* Use syntax
* Rm pattern <[a-z]+(.html).*>
* Rm huggingface.co/transformers/master
* Rm .html
* Rm .html from index.mdx
* Rm .html from model_summary.rst
* Update index.mdx rm html
* Update remove .html
* Fix inner doc links
* Fix interlink in preprocssing.rst
* Update pr_checks
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com >
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Convert PretrainedConfig doc to Markdown
* Use syntax
* Add necessary doc files (#14496 )
* Doc fixes (#14499 )
* Fixes for the new front
* Convert DETR file for table
* Title is needed
* Simplify a bit
* Even simpler
* Remove imports
* Fix checkpoints badge
* Fix typo in toctree (#14516 )
* Update versions.yml format (#14517 )
* Doc new front github actions (#14512 )
* Doc new front github actions
* Fix docstring
* Fix feature extraction utils import (#14515 )
* Address Julien's comments
* Push to doc-builder
* Ready for merge
* Remove old build and deploy
* Doc misc fixes (#14583 )
* Rm versions.yml from doc
* Fix converting.rst
* Rm pretrained_models from toctree
* Fix index links (#14567 )
* Fix links in README
* Localized READMEs
* Fix copy script
* Fix find doc script
* Update README_ko.md
Co-authored-by: Julien Chaumond <julien@huggingface.co >
Co-authored-by: Julien Chaumond <julien@huggingface.co >
* Adapt build command to new CLI tools (#14578 )
* Fix typo
* Fix doc interlinks (#14589 )
* Convert PretrainedConfig doc to Markdown
* Use syntax
* Rm pattern <[a-z]+(.html).*>
* Rm huggingface.co/transformers/master
* Rm .html
* Rm .html from index.mdx
* Rm .html from model_summary.rst
* Update index.mdx rm html
* Update remove .html
* Fix inner doc links
* Fix interlink in preprocssing.rst
* Update pr_checks
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com >
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Styling
Co-authored-by: Mishig Davaadorj <mishig.davaadorj@coloradocollege.edu >
Co-authored-by: Lysandre Debut <lysandre@huggingface.co >
Co-authored-by: Julien Chaumond <julien@huggingface.co >
2021-12-01 14:13:02 -05:00
Suraj Patil
4c0dd199c8
FlaxGPTJ ( #14396 )
...
* add flax gptj
* no bias in attention dense
* no wpe
* fix rotary embeddings
* fix rotary embeds
* fix rotray embeds
* quality
* doc and quality
* fix equivalence tests
2021-12-01 10:57:39 +05:30
Suraj Patil
fc1d97f29d
VisionTextDualEncoder ( #13511 )
...
* init vision_text_dual_encoder
* fix merge
* remove extra heads
* fix tests
* remove VISION_TEXT_DUAL_ENCODER_PRETRAINED_CONFIG_ARCHIVE_MAP
* remove archive map
* fix imports
* fix more imports
* fix init
* delete tokenizers
* fix imports
* clean
* support clip's vision model
* handle None config
* begin tests
* more test and few fixes
* warn about newly init weights
* more tests
* add loss to model
* remove extra classes from doc
* add processor
* doc and small fixes
* add start docstr
* update flax model
* flax tests
* more flax tests
* doc
* quality
* doc and quality
* fix doc
* doc
* remove comments
* update warning
* quality
* fix docs
* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* replace asserts, fix imports
* update imports
* fix import
* address some review comments
* fix check
* reduce tolerance
* fix test
* add flax integration test
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* address Sylvain's comments
* fix style
* add pt_flax_equivalence test in PT tests
* add pt integration test
* update test
* use pre-trained checkpoint in examples
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
2021-11-30 22:21:48 +05:30
Daniel Stancl
faacd74729
[Flax] Add FlaxBlenderbot ( #13633 )
...
* Init Flax implementation for Blenderbot
* Add a majority of stuff except for tests
* make style quality
* Add tests and fix some bugs
* Add tests
* Clean source code and fix some bugs
* Fix copies and docs
* Fix jax device condition for tests
* Fix layer norm in the encoder
* Fix a few typos in the test file
* make fix-copies
* make fix-copies
* fix layer norm
* Fix Flax params dtype (#13090 )
* Fix PR reference (#13098 )
* make fix-copies
* Update tests/test_modeling_flax_blenderbot.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
Co-authored-by: Suraj Patil <surajp815@gmail.com >
2021-11-30 17:36:54 +05:30
Kamal Raj
c468a87a69
Tapas tf ( #13393 )
...
* TF Tapas first commit
* updated docs
* updated logger message
* updated pytorch weight conversion
script to support scalar array
* added use_cache to tapas model config to
work properly with tf input_processing
* 1. rm embeddings_sum
2. added # Copied
3. + TFTapasMLMHead
4. and lot other small fixes
* updated docs
* + test for tapas
* updated testing_utils to check
is_tensorflow_probability_available
* converted model logits post processing using
numpy to work with both PT and TF models
* + TFAutoModelForTableQuestionAnswering
* added TF support
* added test for
TFAutoModelForTableQuestionAnswering
* added test for
TFAutoModelForTableQuestionAnswering pipeline
* updated auto model docs
* fixed typo in import
* added tensorflow_probability to run tests
* updated MLM head
* updated tapas.rst with TF model docs
* fixed optimizer import in docs
* updated convert to np
data from pt model is not
`transformers.tokenization_utils_base.BatchEncoding`
after pipeline upgrade
* updated pipeline:
1. with torch.no_gard removed, pipeline forward handles
2. token_type_ids converted to numpy
* updated docs.
* removed `use_cache` from config
* removed floats_tensor
* updated code comment
* updated Copyright Year and
logits_aggregation Optional
* updated docs and comments
* updated docstring
* fixed model weight loading
* make fixup
* fix indentation
* added tf slow pipeline test
* pip upgrade
* upgrade python to 3.7
* removed from_pt from tests
* revert commit f18cfa9
2021-11-30 11:07:55 +01:00
NielsRogge
25156eb296
Rename ImageGPT ( #14526 )
...
* Rename
* Add MODEL_FOR_CAUSAL_IMAGE_MODELING_MAPPING
2021-11-29 10:19:11 +01:00
Xing Han Lu
ebbe8cc3fe
Tokenizers docs: Specify which class contains __call__ method ( #14379 )
...
* Update tokenizer.rst
* Apply `make fixup`
2021-11-28 18:55:38 -05:00
Lysandre Debut
2318bf77eb
Fixes ( #14534 )
2021-11-26 04:35:08 -05:00
Lysandre Debut
c15f4f203f
Quicktour updates ( #14533 )
2021-11-26 04:09:31 -05:00
Chris Fregly
1bbd6fcdeb
added save_directories for _psave_pretrained_pt and _tf, changed model to tf_model and pt_model, enable the notebook to run cleanly from top to bottom without error ( #14529 )
...
* added save_directories for _psave_pretrained_pt and _tf, changed model to tf_model and pt_model, enable the notebook to run cleanly from top to bottom without error
* Update quicktour.rst
* added >>>
* dependencies
* added space
2021-11-26 03:46:07 -05:00
Stas Bekman
956a483173
[deepspeed] zero inference ( #14253 )
...
* [deepspeed] zero inference
* only z3 makes sense for inference
* fix and style
* docs
* rework
* fix test
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* responding to suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
2021-11-23 14:09:15 -08:00
Sylvain Gugger
204d251310
Auto processor ( #14465 )
...
* Add AutoProcessor class
* Init and tests
* Add doc
* Fix init
* Update src/transformers/models/auto/processing_auto.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co >
* Reverts to tokenizer or feature extractor when available
* Adapt test
Co-authored-by: Lysandre Debut <lysandre@huggingface.co >
2021-11-22 12:17:38 -05:00
Daniel Stancl
e0e2da1194
Improve a add-new-pipeline docs a bit ( #14485 )
2021-11-22 10:35:49 -05:00
Shang Zhang
a59e7c1ed4
Add QDQBert model and quantization examples of SQUAD task ( #14066 )
...
* clean up branch for add-qdqbert-model
* README update for QAT example; update docstrings in modeling_qdqbert.py
* Update qdqbert.rst
* Update README.md
* Update README.md
* calibration data using traning set; QAT example runs in fp32
* re-use BERTtokenizer for qdqbert
* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* remove qdqbert tokenizer
* Update qdqbert.rst
* update evaluate-hf-trt-qa.py
* update configuration_qdqbert.py
* update modeling_qdqbert.py: add copied statement; replace assert with ValueError
* update copied from statement
* add is_quantization_available; run make fix-copies
* unittest add require_quantization
* add backend dependency to qdqbert model
* update README; update evaluate script; make style
* lint
* docs qdqbert update
* circleci build_doc add pytorch-quantization for qdqbert
* update README
* update example readme with instructions to upgrade TensorRT to 8.2
* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co >
* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co >
* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co >
* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co >
* change quantization to pytorch_quantization for backend requirement
* feed_forward_chunking not supported in QDQBert
* make style
* update model docstrings and comments in testing scripts
* rename example to quantization-qdqbert; rename example scripts from qat to quant
* Update src/transformers/models/qdqbert/modeling_qdqbert.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* rm experimental functions in quant_trainer
* qa cleanup
* make fix-copies for docs index.rst
* fix doctree; use post_init() for qdqbert
* fix early device assignment for qdqbert
* fix CI:Model templates runner
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
Co-authored-by: Lysandre Debut <lysandre@huggingface.co >
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2021-11-19 13:33:39 -05:00
NielsRogge
0490b98877
[ImageGPT] Small fixes ( #14460 )
...
* Add integration test
* Fix typo
2021-11-19 15:15:02 +01:00
NielsRogge
da36c557f7
Add ImageGPT ( #14240 )
...
* First draft
* More improvements
* Improve conversion script
* Fix init weights for layer norm
* Fix correct model for conversion script
* Don't tie input and output embeddings
* Add print statements for debugging
* Add print statements for debugging
* Fix vocab size of model
* Improve documentation, remove fast tokenizer
* Add ImageGPTForImageClassification, improve docs
* Fix docs issue
* Set verbosity level back to info
* Improve tests
* Fix tests and add figure
* Delete tokenizer file
* Remove ImageGPTTokenizer from init files
* Remove ImageGPTLayer from init files
* Remove ImageGPT tokenizer from docs
* First draft of ImageGPTFeatureExtractor
* Fix typo
* Fix bug
* More improvements
* Apply suggestions from code review, add tests for feature extractor
* Fix layernorm
* Update save_pretrained method
* Fix issue
* Make all tests of ImageGPTFeatureExtractor pass
* Update code examples
* Rename model inputs to pixel_values
* Improve code examples
* Update init_weights to post_init
* Fix post_init
2021-11-18 16:24:34 +01:00
Patrick von Platen
754202de4f
[Bart] Fix docs ( #14434 )
2021-11-17 19:02:33 +01:00
Lysandre
c6c075544d
Docs for version v4.12.5
2021-11-17 11:39:12 -05:00
NielsRogge
a2864a50e7
Improve semantic segmentation models ( #14355 )
...
* Improve tests
* Improve documentation
* Add ignore_index attribute
* Add semantic_ignore_index to BEiT model
* Add segmentation maps argument to BEiTFeatureExtractor
* Simplify SegformerFeatureExtractor and corresponding tests
* Improve tests
* Apply suggestions from code review
* Minor docs improvements
* Streamline segmentation map tests of SegFormer and BEiT
* Improve reduce_labels docs and test
* Fix code quality
* Fix code quality again
2021-11-17 15:29:58 +01:00
Lysandre
888fb21159
Docs for v4.12.4
2021-11-16 17:40:58 -05:00
Patrick von Platen
4ce74edf51
[Speech2Text2] Enable tokenizers ( #14390 )
...
* [Speech2Text2] Enable tokenizers
* minor fix
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
2021-11-15 16:34:11 +01:00
Stas Bekman
29dfb2dbb1
[doc] performance and parallelism updates ( #14391 )
...
* [doc] performance and parallelism doc update
* improve
* improve
2021-11-14 17:19:15 -08:00
Nicolas Patry
5c153079e2
Adding some quality of life for pipeline function. ( #14322 )
...
* Adding some quality of life for `pipeline` function.
* Update docs/source/main_classes/pipelines.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Update src/transformers/pipelines/__init__.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Improve the tests.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
2021-11-10 10:18:35 +01:00
Steven Liu
e4d8f517b9
Rewrite guides for fine-tuning with Datasets ( #13923 )
...
* rewrite guides for fine-tuning with datasets
* simple qa code example
* use anonymous rST links
* style
2021-11-09 14:12:50 -05:00
Yih-Dar
be4a6c64dc
Add TFViTModel ( #13778 )
...
* Start the work for TFViTModel
* Convert to TF code - need to check in the follow up commits
* Clean up model code
* Expose TFViTModel
* make style
* make quality
* Add test
* make style & quality
* Fix some imports
* fix wrong usage - *kwargs => ** kwargs
* Fix Conv2D weight loading (PT->TF) issue
* Add tests for images with different sizes + fix model
* Fix some common tests for TFViTModel
* Use inputs instead of input_ids in test_compile_tf_model
* Add a comment about transpose and Conv2D in convert_tf_weight_name_to_pt_weight_name
* Avoid transpose in TFViT call
* Fix Conv2D issue in load_tf2_weights_in_pytorch_model
* Use tf.keras.layers.Conv2D instead of tf.nn.conv2d
* Using simpler heuristic to detect Conv2D layer
* Change convert_tf_weight_name_to_pt_weight_name to return TransposeType
* Check tf_weight_shape is not None before using it
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* fix missing comma
* fix input dtype
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
2021-11-09 07:54:37 -05:00
Yih-Dar
95b3ec3bc9
Add FlaxVisionEncoderDecoderModel ( #13359 )
...
* Start the work on FlaxVisionEncoderDecoderModel
* Add FlaxVisionEncoderDecoderModel
* Add VisionEncoderDecoderConfig
* Make FlaxVisionEncoderDecoderModel visible to transformers
* Add test
* Fix wrong getattr usage
* Fix tests
* Add FlaxAutoModelForVision2Seq
* Expose FLAX_MODEL_FOR_VISION_2_SEQ_MAPPING
* clean-up
* add integration test
* update expected logits
* update expected scores
* Add ViT2GPT2ModelIntegrationTest + some cleaning
* Add projection layer + PT/Flax equivalence tests
* Fix import
* minor changes
* make test slow again
* Apply suggestions
* Add modeling_flax_vision_encoder_decoder to _ignore_modules in get_model_modules()
* fix copies
* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com >
* split long strings in multiple lines
* decoder_input_ids can't be None
* Add back test_configuration_tie
* Remove attention_mask parameter
* fix test - encoder_last_hidden_state should be encoder_outputs.last_hidden_state instead of the projected vector
* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Remove more encoder_attention_mask
* remove encoder_attention_mask when calling self.decode (in FlaxVisionEncoderDecoderModule)
* Fix style + pass 1s instead of None as encoder_attention_mask
* fix init_weights
* pass None for encoder_attention_mask
* pass 1s instead of None as encoder_attention_mask
* Fix doc style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
Co-authored-by: Suraj Patil <surajp815@gmail.com >
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2021-11-09 15:14:28 +05:30
Xing Han Lu
843c326ee1
Update dpr.rst ( #14300 )
2021-11-06 09:41:02 -04:00
Sylvain Gugger
f0d6e952c0
Quality explain ( #14264 )
...
* Start PR doc
* Cleanup the quality checks and document them
* Add reference in the contributing guide
* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com >
* Rename file as per review suggestion
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com >
2021-11-03 17:43:19 -04:00
NielsRogge
5f789a687a
Add LayoutXLMProcessor (and LayoutXLMTokenizer, LayoutXLMTokenizerFast) ( #14115 )
...
* Add LayoutXLMTokenizer and LayoutXLMTokenizerFast
* Fix styling issues
* Fix more styling issues
* Fix more styling issues
* Fix docstring
* Fix unit tests
* Fix docs
* Fix unit tests
* Fix typos and styling issues
* Fix styling issues
* Fix docstring
* Make all tests of test_tokenization_layoutxlm pass
* Add LayoutXLMProcessor
* Make fixup
* Make all LayoutXLMProcessor tests pass
* Minor fixes
* Leave LayoutLMv2Processor tests unchanged
* Fix code quality
* Move LayoutXLM tokenizers and processor to separate folder
* Fix code quality
* Apply suggestions from code review
* Replace assertions by value errors
* Remove methods from fast tokenizer
Co-authored-by: King Yiu Suen <kingyiusuen@gmail.com >
2021-11-03 08:59:44 +01:00
Sylvain Gugger
558f8543ba
Update Transformers to huggingface_hub >= 0.1.0 ( #14251 )
...
* Update Transformers to huggingface_hub >= 0.1.0
* Forgot to save...
* Style
* Fix test
2021-11-02 18:58:42 -04:00
lumliolum
519a677e87
Added Beit model output class ( #14133 )
...
* add Beit model ouput class
* inherting from BaseModelOuputWithPooling
* updated docs if use_mean_pooling is False
* added beit specific outputs in model docs
* changed the import path
* Fix docs
Co-authored-by: Niels Rogge <niels.rogge1@gmail.com >
2021-11-02 18:29:14 +01:00
NielsRogge
e20faa6f03
Add BeitForSemanticSegmentation ( #14096 )
...
* Add first draft
* Make forward pass work
* Improve conversion script
* Add notebook that checks if it works
* Add BeitForSemanticSegmentation to the tests
* More improvements
* Make BeitForSemanticSegmentation consistent with Segformer
* Small bug fix
* Add BeitForSemanticSegmentation to docs
* Make sure model doesn't output hidden states when the user doesn't want to
* Make it possible to convert the large model
* Fix issue
* Fix conversion script for large model
* Add auxiliary_head option to semantic segmentation model
* Apply suggestions from @sgugger's review
* Apply suggestions from code review
* Fix failing test
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr >
2021-11-01 19:55:45 +01:00
Lysandre
9fc1951711
Docs for v4.12.2
2021-10-29 14:51:05 -04:00
Lysandre
513fa30a63
Docs for v4.12.1
2021-10-29 13:49:50 -04:00
Daniel Stancl
d37f1fb8ba
Add BlenderbotTokenizerFast ( #13720 )
...
* Add the support for the fast (rust) implementation of BlenbderbotTokenizer
* Fix a converter and a typo in a doc
* Apply the patil-suraj's suggestion
* (Nitpick) Fast tokenization -> Fast Tokenization in doc
* Apply the SaulLu's suggestion
* Apply Narsil's suggestion to fix test pipelines
* Add encoder_no_repeat_ngram_size according to the Narsil's suggestion
* Revert the last (unnecessary) commit
* Override pipeline config for Blenderbot to allow for larger pos. emb.
* make fix-copies
2021-10-29 09:19:01 -04:00
Nicolas Patry
be236361f1
Adding batch_size support for (almost) all pipelines ( #13724 )
...
* Tentative enabling of `batch_size` for pipelines.
* Add systematic test for pipeline batching.
* Enabling batch_size on almost all pipelines
- Not `zero-shot` (it's already passing stuff as batched so trickier)
- Not `QA` (preprocess uses squad features, we need to switch to real
tensors at this boundary.
* Adding `min_length_for_response` for conversational.
* Making CTC, speech mappings avaiable regardless of framework.
* Attempt at fixing automatic tests (ffmpeg not enabled for fast tests)
* Removing ffmpeg dependency in tests.
* Small fixes.
* Slight cleanup.
* Adding docs
and adressing comments.
* Quality.
* Update docs/source/main_classes/pipelines.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Update src/transformers/pipelines/question_answering.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Update src/transformers/pipelines/zero_shot_classification.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Improving docs.
* Update docs/source/main_classes/pipelines.rst
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com >
* N -> oberved_batch_size
softmax trick.
* Follow `padding_side`.
* Supporting image pipeline batching (and padding).
* Rename `unbatch` -> `loader_batch`.
* unbatch_size forgot.
* Custom padding for offset mappings.
* Attempt to remove librosa.
* Adding require_audio.
* torchaudio.
* Back to using datasets librosa.
* Adding help to set a pad_token on the tokenizer.
* Update src/transformers/pipelines/base.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Update src/transformers/pipelines/base.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Update src/transformers/pipelines/base.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Quality.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com >
2021-10-29 11:34:18 +02:00
Lysandre
b8fad022a0
v4.13.0.dev0
2021-10-28 12:56:46 -04:00
Lysandre
62bf536631
Release v4.12.0
Release - Conda / build_and_package (push) Has been cancelled
2021-10-28 12:09:49 -04:00
NielsRogge
1dc96a760d
Add SegFormer ( #14019 )
...
* First draft
* Make style & quality
* Improve conversion script
* Add print statement to see actual slice
* Make absolute tolerance smaller
* Fix image classification models
* Add post_process_semantic method
* Disable padding
* Improve conversion script
* Rename to ForSemanticSegmentation, add integration test, remove post_process methods
* Improve docs
* Fix code quality
* Fix feature extractor tests
* Fix tests for image classification model
* Delete file
* Add is_torch_available to feature extractor
* Improve documentation of feature extractor methods
* Apply suggestions from @sgugger's code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Apply some more suggestions of code review
* Rebase with master
* Fix rebase issues
* Make sure model only outputs hidden states when the user wants to
* Apply suggestions from code review
* Add pad method
* Support padding of 2d images
* Add print statement
* Add print statement
* Move padding method to SegformerFeatureExtractor
* Fix issue
* Add casting of segmentation maps
* Add test for padding
* Add small note about padding
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
2021-10-28 08:23:52 -04:00
Patrick von Platen
9f3aa46f45
Add Unispeech & Unispeech-SAT ( #13963 )
...
* unispeech
* add copy from
* remove hubert copy from
* finish for today
* add unispeech-sat
* adapt more
* up
* up
* up
* up
* add modeling
* add tests
* up
* up
* finish
* up
* Apply suggestions from code review
* up
* up
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* up
* up
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
2021-10-26 18:59:58 +02:00
Thomas Chaigneau
1f60df81b2
Add Camembert to models exportable with ONNX ( #14059 )
...
Add Camembert to models exportable with ONNX
Co-authored-by: Thomas.Chaigneau <thomas.chaigneau@arkea.com >
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com >
2021-10-26 11:22:22 +02:00
Matt
84b9579da7
Remove unneeded to_tensor() in TF inline example ( #14140 )
2021-10-25 15:04:36 +01:00
Reza Gharibi
3e04a41a9b
Fix some writing issues in the docs ( #14136 )
...
* Fix some writing issues in the docs
* Run code quality check
2021-10-25 07:48:02 -04:00
Reza Gharibi
6b83090e80
Fix some typos in the docs ( #14126 )
...
* Fix some typos in the docs
* Fix a styling issue
* Fix code quality check error
2021-10-25 07:40:44 -04:00
Kevin Ko
95bab53868
Update TP parallel GEMM image ( #14112 )
...
* Update TP parallel GEMM image
* Delete parallelism-tp-parallel_gemm.png
* Update parallelism-tp-parallel_gemm.png
2021-10-22 12:57:48 -07:00