Ita Zaporozhets
e48e5f1f13
Support reading tiktoken tokenizer.model file (#31656)
* use existing TikTokenConverter to read tiktoken tokenizer.model file
* del test file
* create titktoken integration file
* adding tiktoken llama test
* ALTNATIVE IMPLEMENTATION: supports llama 405B
* fix one char
* remove redundant line
* small fix
* rm unused import
* flag for converting from tiktokeng
* remove unneeded file
* ruff
* remove llamatiktokenconverter, stick to general converter
* tiktoken support v2
* update test
* remove stale changes
* udpate doc
* protect import
* use is_protobuf_available
* add templateprocessor in tiktokenconverter
* reverting templateprocessor from tiktoken support
* update test
* add require_tiktoken
* dev-ci
* trigger build
* trigger build again
* dev-ci
* [build-ci-image] tiktoken
* dev-ci
* dev-ci
* dev-ci
* dev-ci
* change tiktoken file name
* feedback review
* feedback rev
* applying feedback, removing tiktoken converters
* conform test
* adding docs for review
* add doc file for review
* add doc file for review
* add doc file for review
* support loading model without config.json file
* Revert "support loading model without config.json file"
This reverts commit 2753602e51c34cef2f184eb11f36d2ad1b02babb.
* remove dev var
* updating docs
* safely import protobuf
* fix protobuf import error
* fix protobuf import error
* trying isort to fix ruff error
* fix ruff error
* try to fix ruff again
* try to fix ruff again
* try to fix ruff again
* doc table of contents
* add fix for consistency.dockerfile torchaudio
* ruff
* applying feedback
* minor typo
* merging with push-ci-image
* clean up imports
* revert dockerfile consistency
2024-09-06 14:24:02 +02:00
..
2024-08-29 11:51:09 +02:00
2024-08-30 18:22:27 +02:00
2024-09-05 15:49:28 +02:00
2024-08-14 16:14:24 +02:00
2024-09-01 12:06:31 +03:00
2024-04-08 14:21:16 +01:00
2024-05-28 18:29:22 +02:00
2024-09-06 14:24:02 +02:00
2023-09-04 11:15:12 +01:00
2024-04-24 09:38:18 +02:00
2024-04-16 15:34:04 +01:00
2024-09-04 18:19:54 +02:00
2024-09-04 18:19:54 +02:00
2024-02-08 14:13:35 -08:00
2024-02-16 08:16:58 +01:00
2024-02-16 08:16:58 +01:00
2023-06-20 18:07:47 -04:00
2024-04-01 18:47:32 -07:00
2024-09-04 17:13:34 +01:00
2024-09-06 09:35:01 +02:00
2022-04-04 10:25:46 -04:00
2024-07-23 17:47:51 +01:00
2024-06-06 22:02:38 +01:00
2024-08-26 13:15:43 +02:00
2024-02-02 08:45:00 +01:00
2024-07-08 11:52:47 +01:00
2023-06-20 18:07:47 -04:00
2023-12-20 10:37:23 -08:00
2024-08-07 16:34:46 +01:00
2024-09-05 17:42:03 +02:00
2024-07-08 11:52:47 +01:00
2023-11-13 14:20:54 +01:00
2024-09-05 14:43:20 +05:00
2024-05-29 11:55:43 +01:00
2024-09-06 10:22:00 +01:00
2024-09-06 10:22:00 +01:00
2024-09-06 10:22:00 +01:00
2024-08-22 15:30:22 +02:00
2024-08-27 09:29:05 -07:00
2024-07-30 15:49:14 +01:00
2024-03-23 18:29:39 -07:00
2024-02-16 08:16:58 +01:00
2022-04-04 10:25:46 -04:00
2023-12-08 10:32:18 -08:00
2024-05-30 16:47:35 +02:00
2024-07-08 11:52:47 +01:00
2024-02-02 08:45:00 +01:00
2024-09-05 14:43:20 +05:00
2024-07-29 10:50:43 +01:00
2024-08-14 09:36:43 -07:00
2024-02-16 08:16:58 +01:00
2024-06-18 11:00:26 -07:00
2024-07-04 13:20:49 -04:00
2024-02-16 08:16:58 +01:00
2023-06-20 18:07:47 -04:00
2023-10-31 09:44:51 -07:00
2024-02-16 08:16:58 +01:00
2023-11-06 19:45:03 +00:00
2024-08-19 09:50:35 -07:00
2024-02-16 08:16:58 +01:00
2024-02-02 08:45:00 +01:00
2024-07-09 10:38:29 +01:00
2024-06-12 11:33:00 +01:00
2024-04-29 10:57:51 +01:00
2023-11-06 19:45:03 +00:00
2024-02-16 08:16:58 +01:00
2024-04-16 11:58:55 +02:00
2024-02-26 08:18:15 -08:00
2024-08-07 11:01:33 -07:00
2024-07-29 10:50:43 +01:00
2024-02-16 08:16:58 +01:00
2024-09-06 14:24:02 +02:00
2024-06-03 16:52:23 -07:00
2024-02-16 08:16:58 +01:00
2024-08-23 13:20:49 +02:00
2024-05-14 18:45:06 +01:00
2024-02-16 08:16:58 +01:00