Arthur
799df10aef
[Umt5] Add google's umt5 to transformers (#24477)
* add tokenization template
* update conversion script
* update modeling code
* update
* update convert checkpoint
* update modeling
* revert changes on convert script
* new conversion script for new format
* correct position bias
* cleaning a bit
* Credit co authors
Co-authored-by: agemagician
<ahmed.elnaggar@tum.de>
Co-authored-by: stefan-it
<>
* styling
* Add docq
* fix copies
* add co author
* Other Author
* Merge branch 'main' of https://github.com/huggingface/transformers into add-umt5
* add testing
* nit
* Update docs/source/en/model_doc/umt5.mdx
Co-authored-by: Stefan Schweter <stefan@schweter.it>
* fix t5
* actual fix?
* revert wrong changes
* remove
* update test
* more fixes
* revert some changes
* add SPIECE_UNDERLINE
* add a commone xample
* upfate
* fix copies
* revert changes on t5 conversion script
* revert bytefallback changes since there was no addition yet
* fixup
* fixup
* ingore umt5 cutom testing folder
* fix readmes
* revertT5 changes
* same outputs
* fixup
* update example
* Apply suggestions from code review
* style
* draft addition of all new files
* current update
* fix attention and stuff
* finish refactoring
* auto config
* fixup
* more nits
* add umt5 to init
* use md format
* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* revert changes on mt5
* revert mt4 changes
* update test
* more fixes
* add to mapping
* fix-copies
* fix copies
* foix retain grad
* fix some tests
* nits
* done
* Update src/transformers/models/umt5/modeling_umt5.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update docs/source/en/model_doc/umt5.md
* Update src/transformers/models/umt5/__init__.py
* Update docs/source/en/model_doc/umt5.md
Co-authored-by: Stefan Schweter <stefan@schweter.it>
* Update src/transformers/models/umt5/modeling_umt5.py
* update conversion script + use google checkpoints
* nits
* update test and modelling
* stash slow convert
* update fixupd
* don't change slow
---------
Co-authored-by: stefan-it <>
Co-authored-by: Stefan Schweter <stefan@schweter.it>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-07-03 07:38:21 +02:00
..
2022-02-23 15:46:28 -05:00
2023-04-27 11:03:42 +02:00
2023-06-28 18:02:13 +02:00
2023-05-31 15:16:22 +05:30
2023-04-12 08:25:45 -07:00
2023-02-03 12:43:46 -05:00
2023-06-27 12:15:49 +01:00
2023-07-03 07:38:21 +02:00
2023-06-27 12:15:49 +01:00
2023-03-02 12:08:43 -05:00
2023-06-26 13:58:36 +02:00
2023-06-07 11:38:56 -04:00
2023-04-25 09:17:56 -04:00
2023-02-06 18:10:56 -05:00
2023-06-26 09:58:14 -04:00
2023-06-27 12:15:49 +01:00
2023-06-30 16:54:54 +02:00
2020-01-06 15:11:12 +01:00
2023-06-06 17:11:30 +01:00
2023-06-15 07:30:24 -04:00
2023-06-15 07:30:24 -04:00
2023-06-15 07:30:24 -04:00
2023-06-15 07:30:24 -04:00
2023-06-15 07:30:24 -04:00
2023-06-15 07:30:24 -04:00
2023-05-31 17:12:27 +01:00
2023-06-28 20:11:01 +02:00
2023-06-15 07:30:24 -04:00
2023-06-15 07:30:24 -04:00
2023-06-30 16:30:33 +01:00
2023-06-15 07:30:24 -04:00
2023-06-30 08:19:39 -04:00
2023-06-20 14:43:10 +02:00
2023-02-22 09:14:54 +01:00
2023-06-27 12:15:49 +01:00
2023-06-15 07:30:24 -04:00