Files
HuggingFace_transformer/tests
Thomas Wolf 9aeacb58ba Adding Fast tokenizers for SentencePiece based tokenizers - Breaking: remove Transfo-XL fast tokenizer (#7141)
* [WIP] SP tokenizers

* fixing tests for T5

* WIP tokenizers

* serialization

* update T5

* WIP T5 tokenization

* slow to fast conversion script

* Refactoring to move tokenzier implementations inside transformers

* Adding gpt - refactoring - quality

* WIP adding several tokenizers to the fast world

* WIP Roberta - moving implementations

* update to dev4 switch file loading to in-memory loading

* Updating and fixing

* advancing on the tokenizers - updating do_lower_case

* style and quality

* moving forward with tokenizers conversion and tests

* MBart, T5

* dumping the fast version of transformer XL

* Adding to autotokenizers + style/quality

* update init and space_between_special_tokens

* style and quality

* bump up tokenizers version

* add protobuf

* fix pickle Bert JP with Mecab

* fix newly added tokenizers

* style and quality

* fix bert japanese

* fix funnel

* limite tokenizer warning to one occurence

* clean up file

* fix new tokenizers

* fast tokenizers deep tests

* WIP adding all the special fast tests on the new fast tokenizers

* quick fix

* adding more fast tokenizers in the fast tests

* all tokenizers in fast version tested

* Adding BertGenerationFast

* bump up setup.py for CI

* remove BertGenerationFast (too early)

* bump up tokenizers version

* Clean old docstrings

* Typo

* Update following Lysandre comments

Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>
2020-10-08 11:32:16 +02:00
..
2020-08-24 11:03:01 -04:00
2020-08-26 17:20:22 +02:00
2020-10-07 19:09:23 -04:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-10-07 19:09:23 -04:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-09-04 16:08:54 +02:00
2020-10-07 19:09:23 -04:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-08-26 17:20:22 +02:00
2020-08-17 09:04:35 -04:00
2020-08-27 18:31:51 +02:00
2020-09-22 18:29:58 +02:00