Files
HuggingFace_transformer/tests
Thomas Wolf 827d6d6ef0 Cleanup fast tokenizers integration (#3706)
* First pass on utility classes and python tokenizers

* finishing cleanup pass

* style and quality

* Fix tests

* Updating following @mfuntowicz comment

* style and quality

* Fix Roberta

* fix batch_size/seq_length inBatchEncoding

* add alignement methods + tests

* Fix OpenAI and Transfo-XL tokenizers

* adding trim_offsets=True default for GPT2 et RoBERTa

* style and quality

* fix tests

* add_prefix_space in roberta

* bump up tokenizers to rc7

* style

* unfortunately tensorfow does like these - removing shape/seq_len for now

* Update src/transformers/tokenization_utils.py

Co-Authored-By: Stefan Schweter <stefan@schweter.it>

* Adding doc and docstrings

* making flake8 happy

Co-authored-by: Stefan Schweter <stefan@schweter.it>
2020-04-18 13:43:57 +02:00
..
2020-03-04 20:18:07 -05:00
2020-03-09 13:58:01 +00:00
2020-04-03 14:10:54 -04:00
2020-02-04 18:05:35 -05:00
2020-03-08 15:29:10 +01:00
2020-03-09 13:58:01 +00:00
2020-04-03 14:10:54 -04:00
2020-04-16 10:21:34 -04:00
💄 super
2020-01-15 18:33:50 -05:00
2020-04-09 09:09:00 -04:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
2020-03-02 15:45:25 -05:00