NielsRogge
f3d2f7a6e0
Add MarkupLM (#19198)
* First draft
* Make basic test work
* Fix most tokenizer tests
* More improvements
* Make more tests pass
* Fix more tests
* Fix some code quality
* Improve truncation
* Implement feature extractor
* Improve feature extractor and add tests
* Improve feature extractor tests
* Fix pair_input test partly
* Add fast tokenizer
* Improve implementation
* Fix rebase
* Fix rebase
* Fix most of the tokenizer tests.
* propose solution for fast
* add: integration test for fasttokenizer, warning for decode, fix template in slow tokenizer
* add: modify markuplmconverter
* add: some modify on converter and tokenizerfast
* Fix style, copies
* Make fixup
* Update tokenization_markuplm.py
* Update test_tokenization_markuplm.py
* Update markuplm related
* Improve processor, add integration test
* Add processor test file
* Improve processor
* Improve processor tests
* Fix more processor tests
* Fix processor tests
* Update docstrings
* Add Copied from statements
* Add more Copied from statements
* Add code examples
* Improve code examples
* Add model to doc tests
* Adding dependency check
* Add dummy file
* Add requires_backends
* Add model to toctree
* Fix more things, disable dependency check for now
* Apply more suggestions
* Add soft dependency
* Add annotators to tests
* Fix style
* Remove from_slow=True
* Remove print statements
* Add sanity check
* Fix processor test
* Fix processor tests, add more docs
* Add doc tests for mdx file
* Add more tips
* Apply suggestions
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: lockon-n <45759388+lockon-n@users.noreply.github.com>
Co-authored-by: SaulLu <lucilesaul.com@gmail.com>
Co-authored-by: lockon-n <dd098309@126.com>
2022-09-30 08:25:43 +02:00
..
2022-07-19 12:02:35 +02:00
2021-02-15 07:55:10 -05:00
2022-09-08 14:50:30 +02:00
2022-08-12 16:40:58 +02:00
2022-08-01 11:10:20 -05:00
2022-09-21 14:54:09 -04:00
2022-05-25 07:23:56 -04:00
2022-09-29 10:48:04 +01:00
2022-09-23 19:23:05 +02:00
2022-07-11 05:00:13 -04:00
2021-02-15 07:55:10 -05:00
2022-06-13 08:05:45 -04:00
2022-09-30 08:25:43 +02:00
2021-10-07 12:44:23 +05:30
2022-09-29 19:22:23 +02:00
2022-09-01 12:02:52 +02:00
2022-03-23 03:46:59 -04:00
2022-08-29 09:10:13 +02:00
2022-09-23 19:23:05 +02:00
2022-07-26 15:47:23 +02:00
2022-08-01 14:23:02 +02:00
2022-06-02 10:24:16 +02:00
2022-06-02 07:44:03 -04:00
2022-05-16 13:24:20 -04:00
2022-09-26 18:01:00 -04:00
2022-09-16 15:40:38 +02:00