Files
HuggingFace_transformer/tests
Joe Davison f1e8a51f08 Preserve spaces in GPT-2 tokenizers (#2778)
* Preserve spaces in GPT-2 tokenizers

Preserves spaces after special tokens in GPT-2 and inhereted (RoBERTa)
tokenizers, enabling correct BPE encoding. Automatically inserts a space
in front of first token in encode function when adding special tokens.

* Add tokenization preprocessing method

* Add framework argument to pipeline factory

Also fixes pipeline test issue. Each test input now treated as a
distinct sequence.
2020-02-13 13:29:43 -05:00
..
2020-02-04 16:38:52 -05:00
2020-02-04 18:05:35 -05:00
2020-02-04 18:05:35 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
💄 super
2020-01-15 18:33:50 -05:00
2020-01-11 03:43:57 +00:00