Arthur
799df10aef
[Umt5] Add google's umt5 to transformers (#24477)
* add tokenization template
* update conversion script
* update modeling code
* update
* update convert checkpoint
* update modeling
* revert changes on convert script
* new conversion script for new format
* correct position bias
* cleaning a bit
* Credit co authors
Co-authored-by: agemagician
<ahmed.elnaggar@tum.de>
Co-authored-by: stefan-it
<>
* styling
* Add docq
* fix copies
* add co author
* Other Author
* Merge branch 'main' of https://github.com/huggingface/transformers into add-umt5
* add testing
* nit
* Update docs/source/en/model_doc/umt5.mdx
Co-authored-by: Stefan Schweter <stefan@schweter.it>
* fix t5
* actual fix?
* revert wrong changes
* remove
* update test
* more fixes
* revert some changes
* add SPIECE_UNDERLINE
* add a commone xample
* upfate
* fix copies
* revert changes on t5 conversion script
* revert bytefallback changes since there was no addition yet
* fixup
* fixup
* ingore umt5 cutom testing folder
* fix readmes
* revertT5 changes
* same outputs
* fixup
* update example
* Apply suggestions from code review
* style
* draft addition of all new files
* current update
* fix attention and stuff
* finish refactoring
* auto config
* fixup
* more nits
* add umt5 to init
* use md format
* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* revert changes on mt5
* revert mt4 changes
* update test
* more fixes
* add to mapping
* fix-copies
* fix copies
* foix retain grad
* fix some tests
* nits
* done
* Update src/transformers/models/umt5/modeling_umt5.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update docs/source/en/model_doc/umt5.md
* Update src/transformers/models/umt5/__init__.py
* Update docs/source/en/model_doc/umt5.md
Co-authored-by: Stefan Schweter <stefan@schweter.it>
* Update src/transformers/models/umt5/modeling_umt5.py
* update conversion script + use google checkpoints
* nits
* update test and modelling
* stash slow convert
* update fixupd
* don't change slow
---------
Co-authored-by: stefan-it <>
Co-authored-by: Stefan Schweter <stefan@schweter.it>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-07-03 07:38:21 +02:00
..
2022-11-08 19:54:41 +00:00
2021-02-15 07:55:10 -05:00
2023-04-06 18:08:14 +02:00
2023-05-18 14:14:43 -04:00
2023-06-15 11:39:20 +02:00
2023-06-29 14:48:59 +01:00
2023-06-27 12:15:49 +01:00
2023-02-22 09:14:54 +01:00
2023-02-06 18:10:56 -05:00
2023-02-07 12:27:31 -05:00
2023-04-25 09:17:56 -04:00
2023-03-13 19:11:19 +01:00
2023-07-03 07:38:21 +02:00
2023-06-06 18:17:41 +02:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2021-02-15 07:55:10 -05:00
2023-07-03 07:38:21 +02:00
2023-02-06 18:10:56 -05:00
2023-06-29 14:48:59 +01:00
2021-10-07 12:44:23 +05:30
2023-02-28 17:12:44 +01:00
2023-04-19 19:27:37 +02:00
2023-02-28 17:12:44 +01:00
2023-02-03 12:57:02 -05:00
2023-04-21 20:36:35 +02:00
2023-03-01 17:53:29 +01:00
2023-06-20 18:07:47 -04:00
2023-06-12 21:27:10 +02:00
2023-03-30 21:06:35 +02:00
2022-06-02 10:24:16 +02:00
2022-06-02 07:44:03 -04:00
2023-04-25 09:17:56 -04:00
2023-06-27 12:15:49 +01:00
2023-05-31 14:10:14 -04:00
2023-04-06 22:52:59 +02:00