Files
HuggingFace_transformer/tests
Nicolas Patry 3dd538c4d3 [Tentative] Moving slow tokenizer to the Trie world. (#13220)
* Moving slow tokenizer to the Trie world.

* Adding more docstrings to the Trie.

* Fixing doctest (incompatible wiht our format? )

* Update src/transformers/tokenization_utils.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Adding a lot more comment into the internals of this algorithm.

* Cleaner doc.

* Fixing the namings.

* Update src/transformers/tokenization_utils.py

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* quality.

* Fixing longest first match.

* Small improvements to cuts + more test + canine resistant test.

* Fixing fast test.

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2021-09-09 17:26:16 +02:00
..
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-05-12 13:48:15 +05:30
2021-01-27 21:25:11 +03:00
2020-12-07 18:36:34 -05:00
2021-08-31 07:06:49 -04:00
2021-09-06 16:09:24 +02:00
2021-08-31 17:53:02 +02:00
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-08-30 06:02:08 -04:00
2021-01-27 21:25:11 +03:00
2021-05-05 12:38:01 +02:00
2021-06-01 19:07:37 +01:00
2020-12-07 18:36:34 -05:00
2021-05-12 13:48:15 +05:30
2020-12-07 18:36:34 -05:00
2020-12-07 18:36:34 -05:00
2021-04-26 13:50:34 +02:00