HuggingFace_transformer/docs/source at bc0d26d1dea73b23f6e388c18709287d5423a2d8 - HuggingFace_transformer - Gitea: Git with SSUM

SUMIN/HuggingFace_transformer

Files

History

Yossi Synett bc0d26d1de [All Seq2Seq model + CLM models that can be used with EncoderDecoder] Add cross-attention weights to outputs (#8071 )

* Output cross-attention with decoder attention output

* Update src/transformers/modeling_bert.py

* add cross-attention for t5 and bart as well

* fix tests

* correct typo in docs

* add sylvains and sams comments

* correct typo

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

2020-11-06 19:34:48 +01:00

..

Docs for v3.4.0

2020-10-20 16:29:00 +02:00

Guide to fixed-length model perplexity evaluation (#5449 )

2020-07-07 16:04:15 -06:00

Refactoring the generate() function (#6949 )

2020-11-03 16:04:22 +01:00

[All Seq2Seq model + CLM models that can be used with EncoderDecoder] Add cross-attention weights to outputs (#8071 )

2020-11-06 19:34:48 +01:00

Docs bart training ref (#8330 )

2020-11-05 17:20:57 -05:00

benchmarks.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00

bertology.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00

conf.py

Release: v3.4.0

2020-10-20 16:22:26 +02:00

contributing.md

Update installation page and add contributing to the doc (#5084 )

2020-06-17 14:01:10 -04:00

converting_tensorflow_models.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00

custom_datasets.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00

examples.md

per_device instead of per_gpu/error thrown when argument unknown (#4618 )

2020-05-27 11:36:55 -04:00

favicon.ico

Adding usage examples for common tasks (#2850 )

2020-02-25 13:48:24 -05:00

glossary.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00

index.rst

Refactoring the generate() function (#6949 )

2020-11-03 16:04:22 +01:00

installation.md

Fix doc errors and typos across the board (#8139 )

2020-10-29 10:33:33 -04:00

migration.md

Fix doc errors and typos across the board (#8139 )

2020-10-29 10:33:33 -04:00

model_sharing.rst

Fix doc errors and typos across the board (#8139 )

2020-10-29 10:33:33 -04:00

model_summary.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00

multilingual.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00

notebooks.md

Update notebooks (#3620 )

2020-04-06 14:32:39 -04:00

perplexity.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00

philosophy.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00

preprocessing.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00

pretrained_models.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00

quicktour.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00

serialization.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00

task_summary.rst

Fix doc errors and typos across the board (#8139 )

2020-10-29 10:33:33 -04:00

testing.rst

[s2s] test_distributed_eval (#8315 )

2020-11-05 16:01:15 -05:00

tokenizer_summary.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00

training.rst

Doc styling (#8067 )

2020-10-26 18:26:02 -04:00