Matthijs Hollemans
cd927a4736
add word-level timestamps to Whisper ( #23205 )
...
* let's go!
* initial implementation of token-level timestamps
* only return a single timestamp per token
* remove token probabilities
* fix return type
* fix doc comment
* strip special tokens
* rename
* revert to not stripping special tokens
* only support models that have alignment_heads
* add integration test
* consistently name it token-level timestamps
* small DTW tweak
* initial support for ASR pipeline
* fix pipeline doc comments
* resolve token timestamps in pipeline with chunking
* change warning when no final timestamp is found
* return word-level timestamps
* fixup
* fix bug that skipped final word in each chunk
* fix failing unit tests
* merge punctuations into the words
* also return word tokens
* also return token indices
* add (failing) unit test for combine_tokens_into_words
* make combine_tokens_into_words private
* restore OpenAI's punctuation rules
* add pipeline tests
* make requested changes
* PR review changes
* fix failing pipeline test
* small stuff from PR
* only return words and their timestamps, not segments
* move alignment_heads into generation config
* forgot to set alignment_heads in pipeline tests
* tiny comment fix
* grr
2023-06-21 17:48:21 +02:00
..
2023-06-13 17:51:37 +01:00
2023-06-21 17:04:59 +02:00
2023-06-21 17:04:59 +02:00
2023-05-23 14:37:35 -04:00
2023-06-06 14:31:14 -04:00
2023-06-21 17:04:59 +02:00
2023-06-16 15:40:49 +01:00
2023-02-06 18:10:56 -05:00
2023-02-06 18:10:56 -05:00
2023-06-21 17:04:59 +02:00
2023-06-13 17:51:37 +01:00
2023-02-28 19:40:57 +01:00
2023-02-06 18:10:56 -05:00
2023-02-06 18:10:56 -05:00
2023-06-21 17:04:59 +02:00
2023-05-18 11:04:51 +01:00
2023-05-18 17:29:04 +02:00
2023-04-06 13:50:15 +01:00
2023-06-16 15:40:49 +01:00
2023-06-16 15:40:49 +01:00
2023-06-21 17:04:59 +02:00
2023-03-13 15:03:21 +01:00
2023-06-16 16:38:23 +02:00
2023-05-24 13:52:52 +01:00
2023-06-16 16:38:23 +02:00
2023-02-06 18:10:56 -05:00
2023-05-24 13:52:52 +01:00
2023-06-21 17:04:59 +02:00
2023-06-21 17:04:59 +02:00
2023-05-23 14:37:35 -04:00
2023-06-21 17:04:59 +02:00
2023-06-21 17:04:59 +02:00
2023-04-24 15:27:31 +02:00
2023-03-07 14:23:36 +01:00
2023-05-24 13:52:52 +01:00
2023-06-16 15:40:49 +01:00
2023-05-16 23:35:11 +02:00
2023-03-22 20:02:24 +01:00
2023-05-18 17:29:04 +02:00
2023-06-14 14:39:02 +01:00
2023-06-16 15:40:49 +01:00
2023-06-21 17:04:59 +02:00
2023-05-24 13:52:52 +01:00
2023-05-24 13:52:52 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-05-24 13:52:52 +01:00
2023-04-13 21:32:29 +02:00
2023-03-07 15:19:39 -05:00
2023-04-06 13:50:15 +01:00
2023-05-24 13:52:52 +01:00
2022-05-03 14:42:02 +02:00
2023-02-28 19:40:57 +01:00
2023-05-24 13:52:52 +01:00
2023-06-21 17:04:59 +02:00
2023-06-16 14:43:43 +01:00
2023-04-06 22:52:59 +02:00
2023-05-24 13:52:52 +01:00
2023-06-20 14:43:10 +02:00
2023-06-06 18:30:51 +01:00
2023-02-28 19:40:57 +01:00
2023-03-23 19:14:17 +01:00
2023-05-24 13:52:52 +01:00
2023-05-24 13:52:52 +01:00
2023-06-21 17:04:59 +02:00
2023-06-21 17:04:59 +02:00
2023-05-18 17:29:04 +02:00
2023-04-06 17:56:06 +02:00
2023-06-16 15:40:49 +01:00
2023-06-20 18:07:34 +02:00
2023-02-28 19:40:57 +01:00
2023-06-21 17:04:59 +02:00
2023-06-13 19:19:40 +02:00
2023-05-18 17:29:04 +02:00
2023-06-20 19:19:19 +02:00
2023-02-28 19:40:57 +01:00
2022-12-12 13:12:13 -05:00
2023-06-13 20:33:26 +02:00
2023-04-06 22:52:59 +02:00
2023-06-21 17:04:59 +02:00
2023-06-16 15:40:49 +01:00
2023-02-06 18:10:56 -05:00
2023-05-24 13:52:52 +01:00
2023-02-28 19:40:57 +01:00
2023-06-21 17:04:59 +02:00
2023-06-21 17:04:59 +02:00
2023-03-07 04:20:14 +01:00
2023-06-21 17:04:59 +02:00
2023-06-20 14:43:10 +02:00
2023-06-20 14:43:10 +02:00
2023-02-22 09:14:54 +01:00
2023-06-16 15:40:49 +01:00
2023-06-16 16:38:23 +02:00
2023-06-21 17:04:59 +02:00
2023-05-30 16:50:41 +02:00
2023-06-20 12:59:21 +01:00
2023-04-06 17:56:06 +02:00
2023-06-21 17:04:59 +02:00
2023-06-16 15:40:49 +01:00
2023-04-06 17:56:06 +02:00
2023-06-21 17:04:59 +02:00
2023-04-11 10:00:34 +02:00
2023-03-22 20:45:08 -04:00
2023-04-06 13:50:15 +01:00
2023-06-16 15:40:49 +01:00
2022-07-29 08:09:09 -04:00
2023-05-23 14:37:35 -04:00
2023-04-07 17:13:04 +02:00
2023-02-28 19:40:57 +01:00
2022-05-03 14:42:02 +02:00
2023-04-06 17:56:06 +02:00
2022-05-12 16:25:55 -04:00
2023-06-16 15:40:49 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-06-16 14:43:43 +01:00
2023-06-05 18:12:45 +02:00
2023-05-24 13:52:52 +01:00
2023-05-24 13:52:52 +01:00
2023-05-18 11:04:51 +01:00
2023-04-06 13:50:15 +01:00
2023-02-28 19:40:57 +01:00
2023-04-04 14:53:06 +02:00
2023-04-21 17:09:40 +01:00
2023-02-28 19:40:57 +01:00
2023-03-23 19:14:17 +01:00
2023-04-28 11:01:32 -04:00
2023-06-13 17:51:37 +01:00
2023-06-16 15:40:49 +01:00
2023-06-21 17:04:59 +02:00
2023-06-21 17:04:59 +02:00
2023-04-06 17:56:06 +02:00
2023-02-28 19:40:57 +01:00
2023-02-06 18:10:56 -05:00
2023-06-21 17:04:59 +02:00
2023-05-18 11:04:51 +01:00
2023-02-28 19:40:57 +01:00
2023-05-18 11:04:51 +01:00
2023-02-28 19:40:57 +01:00
2023-06-16 15:40:49 +01:00
2023-05-31 14:59:30 +01:00
2023-05-18 11:04:51 +01:00
2023-06-21 17:04:59 +02:00
2023-05-24 13:52:52 +01:00
2023-05-24 13:52:52 +01:00
2023-02-06 18:10:56 -05:00
2023-05-24 13:52:52 +01:00
2023-05-24 13:52:52 +01:00
2023-03-14 10:03:02 +01:00
2023-06-21 17:04:59 +02:00
2023-06-12 18:14:15 +02:00
2023-06-21 17:04:59 +02:00
2023-06-16 15:40:49 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-06 18:10:56 -05:00
2023-06-21 17:04:59 +02:00
2023-02-28 19:40:57 +01:00
2023-05-23 14:37:35 -04:00
2023-04-06 17:45:55 +02:00
2023-02-28 19:40:57 +01:00
2023-05-18 17:29:04 +02:00
2023-06-16 15:40:49 +01:00
2023-03-07 14:23:36 +01:00
2023-03-07 14:23:36 +01:00
2023-06-21 17:04:59 +02:00
2023-06-16 15:40:49 +01:00
2023-02-28 19:40:57 +01:00
2023-06-20 18:35:45 +02:00
2023-02-06 18:10:56 -05:00
2023-06-21 17:04:59 +02:00
2023-02-28 19:40:57 +01:00
2023-06-20 14:43:10 +02:00
2023-03-21 19:22:01 +01:00
2023-05-24 13:52:52 +01:00
2023-05-18 11:04:51 +01:00
2023-06-16 16:38:23 +02:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-02-28 19:40:57 +01:00
2023-06-21 17:04:59 +02:00
2023-02-28 19:40:57 +01:00
2023-06-21 17:04:59 +02:00
2023-06-06 18:30:51 +01:00
2023-05-24 13:52:52 +01:00
2023-06-21 17:04:59 +02:00
2023-05-24 13:52:52 +01:00
2023-02-28 19:40:57 +01:00
2023-06-21 17:04:59 +02:00
2023-02-28 19:40:57 +01:00
2023-06-20 19:39:52 +02:00
2023-02-28 19:40:57 +01:00
2022-05-03 14:42:02 +02:00
2023-04-17 12:41:55 +02:00
2023-02-28 19:40:57 +01:00
2023-06-21 17:48:21 +02:00
2023-06-21 17:04:59 +02:00
2023-06-13 19:04:22 +01:00
2023-05-24 13:52:52 +01:00
2023-03-06 09:15:44 +01:00
2023-05-24 13:52:52 +01:00
2023-03-29 16:16:23 +02:00
2023-06-20 12:59:21 +01:00
2023-03-29 16:16:23 +02:00
2023-03-07 07:34:04 -05:00
2023-02-28 19:40:57 +01:00
2022-05-03 14:42:02 +02:00