NielsRogge
f3d2f7a6e0
Add MarkupLM (#19198)
* First draft
* Make basic test work
* Fix most tokenizer tests
* More improvements
* Make more tests pass
* Fix more tests
* Fix some code quality
* Improve truncation
* Implement feature extractor
* Improve feature extractor and add tests
* Improve feature extractor tests
* Fix pair_input test partly
* Add fast tokenizer
* Improve implementation
* Fix rebase
* Fix rebase
* Fix most of the tokenizer tests.
* propose solution for fast
* add: integration test for fasttokenizer, warning for decode, fix template in slow tokenizer
* add: modify markuplmconverter
* add: some modify on converter and tokenizerfast
* Fix style, copies
* Make fixup
* Update tokenization_markuplm.py
* Update test_tokenization_markuplm.py
* Update markuplm related
* Improve processor, add integration test
* Add processor test file
* Improve processor
* Improve processor tests
* Fix more processor tests
* Fix processor tests
* Update docstrings
* Add Copied from statements
* Add more Copied from statements
* Add code examples
* Improve code examples
* Add model to doc tests
* Adding dependency check
* Add dummy file
* Add requires_backends
* Add model to toctree
* Fix more things, disable dependency check for now
* Apply more suggestions
* Add soft dependency
* Add annotators to tests
* Fix style
* Remove from_slow=True
* Remove print statements
* Add sanity check
* Fix processor test
* Fix processor tests, add more docs
* Add doc tests for mdx file
* Add more tips
* Apply suggestions
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: lockon-n <45759388+lockon-n@users.noreply.github.com>
Co-authored-by: SaulLu <lucilesaul.com@gmail.com>
Co-authored-by: lockon-n <dd098309@126.com>
2022-09-30 08:25:43 +02:00
..
2022-06-03 11:08:35 +02:00
2022-09-14 16:21:06 -07:00
2022-09-30 08:25:43 +02:00
2022-09-26 09:50:48 -04:00
2022-04-04 10:25:46 -04:00
2022-09-30 08:25:43 +02:00
2022-08-08 10:53:52 -05:00
2022-08-01 12:07:30 -04:00
2022-07-21 07:55:30 +02:00
2022-09-05 11:09:39 +02:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-07-04 04:13:05 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-07-11 05:00:13 -04:00
2022-09-14 16:29:58 -07:00
2022-04-04 10:25:46 -04:00
2022-08-12 10:49:59 -07:00
2022-09-27 07:51:51 -04:00
2022-09-30 08:25:43 +02:00
2022-08-05 13:14:00 -04:00
2022-04-04 10:25:46 -04:00
2022-08-06 09:38:55 +02:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-05-16 13:36:41 +02:00
2022-06-14 07:56:47 -04:00
2022-08-24 10:37:52 +02:00
2022-08-18 17:34:48 +02:00
2022-06-23 14:51:54 +02:00
2022-09-22 09:20:15 -04:00
2022-06-08 09:41:57 -04:00
2022-06-23 14:51:54 +02:00
2022-09-05 14:06:36 +02:00
2022-06-23 14:51:54 +02:00
2022-06-23 14:51:54 +02:00
2022-06-23 14:51:54 +02:00
2022-09-06 15:20:12 -04:00
2022-08-10 13:20:39 -05:00
2022-08-24 05:38:03 -04:00
2022-08-09 18:22:55 -05:00
2022-09-28 17:09:44 -07:00
2022-09-07 14:45:22 -07:00
2022-08-06 09:38:55 +02:00
2022-04-04 10:25:46 -04:00
2022-09-22 09:45:04 +02:00
2022-08-06 09:38:55 +02:00
2022-07-11 05:00:13 -04:00
2022-04-04 10:25:46 -04:00
2022-09-07 13:30:07 +01:00
2022-04-04 10:25:46 -04:00