Files
HuggingFace_transformer/docs/source/en/model_doc
Andreas Madsen b4b613b102 Implement Roberta PreLayerNorm (#20305)
* Copy RoBERTa

* formatting

* implement RoBERTa with prelayer normalization

* update test expectations

* add documentation

* add convertion script for DinkyTrain weights

* update checkpoint repo

Unfortunately the original checkpoints assumes a hacked roberta model

* add to RoBERTa-PreLayerNorm docs to toc

* run utils/check_copies.py

* lint files

* remove unused import

* fix check_repo reporting wrongly a test is missing

* fix import error, caused by rebase

* run make fix-copies

* add RobertaPreLayerNormConfig to ROBERTA_EMBEDDING_ADJUSMENT_CONFIGS

* Fix documentation <Facebook> -> Facebook

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup: Fix documentation <Facebook> -> Facebook

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Add missing Flax header

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* expected_slice -> EXPECTED_SLICE

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* update copies after rebase

* add missing copied from statements

* make fix-copies

* make prelayernorm explicit in code

* fix checkpoint path for the original implementation

* add flax integration tests

* improve docs

* update utils/documentation_tests.txt

* lint files

* Remove Copyright notice

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* make fix-copies

* Remove EXPECTED_SLICE calculation comments

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-12-19 09:30:17 +01:00
..
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-11-01 11:09:53 -07:00
2022-04-04 10:25:46 -04:00
2022-12-05 10:12:03 -05:00
2022-12-07 11:03:39 +01:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-11-09 18:31:22 +01:00
2022-06-24 17:10:38 +02:00
2022-04-04 10:25:46 -04:00
2022-11-08 19:54:41 +00:00
2022-04-04 10:25:46 -04:00
2022-11-15 13:26:07 -05:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-11-29 10:38:01 +00:00
2022-04-04 10:25:46 -04:00
2022-11-08 19:54:41 +00:00
2022-10-31 21:32:58 -04:00
2022-04-04 10:25:46 -04:00
2022-11-08 19:54:41 +00:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-11-29 10:02:40 -05:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-11-08 19:54:41 +00:00
2022-11-08 19:54:41 +00:00
2022-04-04 10:25:46 -04:00
2022-10-18 17:42:46 +02:00
2022-08-16 10:20:46 -05:00
2022-04-04 10:25:46 -04:00
2022-10-24 17:54:23 +02:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-06-29 09:30:55 -04:00
2022-10-10 09:30:59 -04:00
2022-09-02 19:54:02 +02:00
2022-04-04 10:25:46 -04:00
2022-11-08 19:54:41 +00:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-10-27 11:33:15 -07:00
2022-11-08 10:03:43 -05:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-12-16 16:24:01 +01:00
2022-11-14 01:04:26 -05:00
2022-04-08 10:57:51 +02:00
2022-04-04 10:25:46 -04:00
2022-06-21 10:24:50 +02:00
2022-05-17 00:43:16 +02:00
2022-04-04 10:25:46 -04:00
2022-11-08 19:54:41 +00:00
2022-12-07 11:03:39 +01:00
2022-10-28 13:28:18 -07:00
2022-05-17 00:43:16 +02:00
2022-09-19 19:22:34 +02:00
2022-12-13 09:25:55 -08:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00