NielsRogge
f3d2f7a6e0
Add MarkupLM (#19198)
* First draft
* Make basic test work
* Fix most tokenizer tests
* More improvements
* Make more tests pass
* Fix more tests
* Fix some code quality
* Improve truncation
* Implement feature extractor
* Improve feature extractor and add tests
* Improve feature extractor tests
* Fix pair_input test partly
* Add fast tokenizer
* Improve implementation
* Fix rebase
* Fix rebase
* Fix most of the tokenizer tests.
* propose solution for fast
* add: integration test for fasttokenizer, warning for decode, fix template in slow tokenizer
* add: modify markuplmconverter
* add: some modify on converter and tokenizerfast
* Fix style, copies
* Make fixup
* Update tokenization_markuplm.py
* Update test_tokenization_markuplm.py
* Update markuplm related
* Improve processor, add integration test
* Add processor test file
* Improve processor
* Improve processor tests
* Fix more processor tests
* Fix processor tests
* Update docstrings
* Add Copied from statements
* Add more Copied from statements
* Add code examples
* Improve code examples
* Add model to doc tests
* Adding dependency check
* Add dummy file
* Add requires_backends
* Add model to toctree
* Fix more things, disable dependency check for now
* Apply more suggestions
* Add soft dependency
* Add annotators to tests
* Fix style
* Remove from_slow=True
* Remove print statements
* Add sanity check
* Fix processor test
* Fix processor tests, add more docs
* Add doc tests for mdx file
* Add more tips
* Apply suggestions
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: lockon-n <45759388+lockon-n@users.noreply.github.com>
Co-authored-by: SaulLu <lucilesaul.com@gmail.com>
Co-authored-by: lockon-n <dd098309@126.com>
2022-09-30 08:25:43 +02:00
..
2022-04-04 10:25:46 -04:00
2022-09-07 13:38:49 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-09-20 10:41:56 +03:00
2022-04-29 17:42:15 -04:00
2022-04-04 10:25:46 -04:00
2022-06-27 12:06:21 +01:00
2022-04-04 10:25:46 -04:00
2022-05-03 11:26:19 +02:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-08-12 19:36:18 +02:00
2022-04-04 10:25:46 -04:00
2022-04-13 11:36:54 +02:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-06-24 17:10:38 +02:00
2022-09-22 13:21:05 +02:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-05-02 12:47:39 -04:00
2022-04-04 10:25:46 -04:00
2022-05-18 17:47:18 +02:00
2022-06-08 14:03:18 +01:00
2022-05-10 16:21:44 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-09-22 09:45:24 +02:00
2022-07-13 18:04:08 +01:00
2022-06-21 16:45:35 +02:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-08-12 18:52:49 +02:00
2022-04-04 10:25:46 -04:00
2022-09-21 15:15:26 +03:00
2022-05-03 11:26:19 +02:00
2022-07-27 10:08:59 +02:00
2022-09-09 07:36:46 -04:00
2022-04-04 10:25:46 -04:00
2022-05-11 14:56:48 -07:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-09-14 10:17:40 -04:00
2022-05-24 09:31:10 -04:00
2022-04-04 10:25:46 -04:00
2022-09-29 10:48:04 +01:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-09-02 14:48:19 +02:00
2022-05-23 08:02:31 -04:00
2022-08-30 11:48:11 +01:00
2022-04-04 10:25:46 -04:00
2022-05-17 23:44:37 +02:00
2022-06-01 17:06:20 +02:00
2022-04-04 10:25:46 -04:00
2022-08-16 10:20:46 -05:00
2022-08-01 11:09:47 -04:00
2022-04-04 10:25:46 -04:00
2022-09-29 13:27:58 +02:00
2022-07-29 08:09:09 -04:00
2022-09-30 08:25:43 +02:00
2022-08-02 18:43:58 +03:00
2022-07-29 08:09:09 -04:00
2022-07-29 08:09:09 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-09-23 16:24:28 +03:00
2022-04-04 10:25:46 -04:00
2022-06-30 00:49:02 +02:00
2022-06-29 09:30:55 -04:00
2022-06-23 12:36:22 -04:00
2022-07-29 08:09:09 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-07-20 10:14:21 +02:00
2022-08-11 19:10:25 +03:00
2022-09-02 19:54:02 +02:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-07-29 08:09:09 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-06-29 13:45:14 +01:00
2022-04-04 10:25:46 -04:00
2022-07-04 10:59:15 +01:00
2022-04-04 10:25:46 -04:00
2022-05-03 11:26:19 +02:00
2022-04-04 10:25:46 -04:00
2022-09-21 11:40:35 +03:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-07-29 08:09:09 -04:00
2022-07-29 08:09:09 -04:00
2022-07-29 08:09:09 -04:00
2022-05-17 23:42:14 +02:00
2022-04-04 10:25:46 -04:00
2022-05-16 22:19:53 +01:00
2022-07-27 11:14:47 -04:00
2022-09-06 14:12:26 +02:00
2022-04-13 11:36:54 +02:00
2022-04-04 10:25:46 -04:00
2022-04-08 10:57:51 +02:00
2022-05-17 19:07:43 -04:00
2022-04-04 10:25:46 -04:00
2022-07-29 08:09:09 -04:00
2022-06-21 10:24:50 +02:00
2022-05-17 00:43:16 +02:00
2022-05-17 00:43:16 +02:00
2022-04-04 10:25:46 -04:00
2022-08-04 18:02:55 +02:00
2022-07-26 10:11:32 +02:00
2022-07-27 10:08:59 +02:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-09-22 07:15:03 -04:00
2022-09-19 19:22:34 +02:00
2022-04-04 10:25:46 -04:00
2022-06-15 18:34:15 +02:00
2022-07-29 08:09:09 -04:00
2022-05-17 00:43:16 +02:00
2022-09-19 19:22:34 +02:00
2022-08-24 10:51:05 +01:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2022-05-02 18:30:55 +02:00
2022-04-04 10:25:46 -04:00