HuggingFace_transformer/docs/source at 95b3ec3bc9e8fa135bd9adde5bbdd6cc7ee01618 - HuggingFace_transformer - Gitea: Git with SSUM

SUMIN/HuggingFace_transformer

Files

History

Yih-Dar 95b3ec3bc9 Add FlaxVisionEncoderDecoderModel (#13359 )

* Start the work on FlaxVisionEncoderDecoderModel

* Add FlaxVisionEncoderDecoderModel

* Add VisionEncoderDecoderConfig

* Make FlaxVisionEncoderDecoderModel visible to transformers

* Add test

* Fix wrong getattr usage

* Fix tests

* Add FlaxAutoModelForVision2Seq

* Expose FLAX_MODEL_FOR_VISION_2_SEQ_MAPPING

* clean-up

* add integration test

* update expected logits

* update expected scores

* Add ViT2GPT2ModelIntegrationTest + some cleaning

* Add projection layer + PT/Flax equivalence tests

* Fix import

* minor changes

* make test slow again

* Apply suggestions

* Add modeling_flax_vision_encoder_decoder to _ignore_modules in get_model_modules()

* fix copies

* Apply suggestions from code review

Co-authored-by: Suraj Patil <surajp815@gmail.com>

* split long strings in multiple lines

* decoder_input_ids can't be None

* Add back test_configuration_tie

* Remove attention_mask parameter

* fix test - encoder_last_hidden_state should be encoder_outputs.last_hidden_state instead of the projected vector

* Apply suggestions from code review

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Remove more encoder_attention_mask

* remove encoder_attention_mask when calling self.decode (in FlaxVisionEncoderDecoderModule)

* Fix style + pass 1s instead of None as encoder_attention_mask

* fix init_weights

* pass None for encoder_attention_mask

* pass 1s instead of None as encoder_attention_mask

* Fix doc style

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

2021-11-09 15:14:28 +05:30

..

Docs for v4.12.2

2021-10-29 14:51:05 -04:00

Update TP parallel GEMM image (#14112 )

2021-10-22 12:57:48 -07:00

Fix doc building error

2021-08-12 05:49:02 -04:00

Adding batch_size support for (almost) all pipelines (#13724 )

2021-10-29 11:34:18 +02:00

Add FlaxVisionEncoderDecoderModel (#13359 )

2021-11-09 15:14:28 +05:30

add_new_model.rst

Fix some writing issues in the docs (#14136 )

2021-10-25 07:48:02 -04:00

add_new_pipeline.rst

Fix some writing issues in the docs (#14136 )

2021-10-25 07:48:02 -04:00

benchmarks.rst

[Docs] fixed broken link (#12205 )

2021-06-16 15:14:53 -04:00

bertology.rst

Fix documentation links always pointing to master. (#9217 )

2021-01-05 06:18:48 -05:00

community.md

Fix some writing issues in the docs (#14136 )

2021-10-25 07:48:02 -04:00

conf.py

v4.13.0.dev0

2021-10-28 12:56:46 -04:00

contributing.md

Update installation page and add contributing to the doc (#5084 )

2020-06-17 14:01:10 -04:00

converting_tensorflow_models.rst

Fix some writing issues in the docs (#14136 )

2021-10-25 07:48:02 -04:00

custom_datasets.rst

Fix some writing issues in the docs (#14136 )

2021-10-25 07:48:02 -04:00

debugging.rst

Fix some writing issues in the docs (#14136 )

2021-10-25 07:48:02 -04:00

examples.md

per_device instead of per_gpu/error thrown when argument unknown (#4618 )

2020-05-27 11:36:55 -04:00

fast_tokenizers.rst

Documentation about loading a fast tokenizer within Transformers (#11029 )

2021-04-05 10:51:16 -04:00

favicon.ico

Adding usage examples for common tasks (#2850 )

2020-02-25 13:48:24 -05:00

glossary.rst

Add video links to the documentation (#12162 )

2021-06-15 06:37:37 -04:00

index.rst

Add FlaxVisionEncoderDecoderModel (#13359 )

2021-11-09 15:14:28 +05:30

installation.md

Fix some typos in the docs (#14126 )

2021-10-25 07:40:44 -04:00

migration.md

consistent nn. and nn.functional: part 5 docs (#12161 )

2021-06-14 13:34:32 -07:00

model_sharing.rst

Fix some writing issues in the docs (#14136 )

2021-10-25 07:48:02 -04:00

model_summary.rst

Add video links to the documentation (#12162 )

2021-06-15 06:37:37 -04:00

multilingual.rst

Examples reorg (#11350 )

2021-04-21 11:11:20 -04:00

notebooks.md

Update notebooks (#3620 )

2020-04-06 14:32:39 -04:00

parallelism.md

Fix some writing issues in the docs (#14136 )

2021-10-25 07:48:02 -04:00

performance.md

Make gradient_checkpointing a training argument (#13657 )

2021-09-22 07:51:38 -04:00

perplexity.rst

Small changes in perplexity.rstto make the notebook executable on google collaboratory (#13541 )

2021-09-13 13:32:32 +02:00

philosophy.rst

Minor documentation revisions from copyediting (#9266 )

2020-12-23 10:15:49 -05:00

pr_checks.md

Quality explain (#14264 )

2021-11-03 17:43:19 -04:00

preprocessing.rst

Fix a typo in preprocessing docs (#14108 )

2021-10-21 17:00:26 -04:00

pretrained_models.rst

Fix broken link to distill models in docs (#13848 )

2021-10-04 11:57:54 -04:00

quicktour.rst

[Large PR] Entire rework of pipelines. (#13308 )

2021-09-10 14:47:48 +02:00

sagemaker.md

remove documentation (#12657 )

2021-07-12 18:02:51 +02:00

serialization.rst

Add Camembert to models exportable with ONNX (#14059 )

2021-10-26 11:22:22 +02:00

task_summary.rst

Fix broken link in translation section (#14087 )

2021-10-20 15:10:57 -04:00

testing.rst

[testing] auto-replay captured streams (#13803 )

2021-09-30 09:26:49 -07:00

tokenizer_summary.rst

Fix some typos in the docs (#14126 )

2021-10-25 07:40:44 -04:00

training.rst

Remove unneeded to_tensor() in TF inline example (#14140 )

2021-10-25 15:04:36 +01:00

troubleshooting.md

[troubleshooting] add 2 points of reference to the offline mode (#11236 )

2021-04-14 08:39:23 -07:00