Matthijs Hollemans
4ece3b9433
add VITS model (#24085)
* add VITS model
* let's vits
* finish TextEncoder (mostly)
* rename VITS to Vits
* add StochasticDurationPredictor
* ads flow model
* add generator
* correctly set vocab size
* add tokenizer
* remove processor & feature extractor
* add PosteriorEncoder
* add missing weights to SDP
* also convert LJSpeech and VCTK checkpoints
* add training stuff in forward
* add placeholder tests for tokenizer
* add placeholder tests for model
* starting cleanup
* let the great renaming begin!
* use config
* global_conditioning
* more cleaning
* renaming variables
* more renaming
* more renaming
* it never ends
* reticulating the splines
* more renaming
* HiFi-GAN
* doc strings for main model
* fixup
* fix-copies
* don't make it a PreTrainedModel
* fixup
* rename config options
* remove training logic from forward pass
* simplify relative position
* use actual checkpoint
* style
* PR review fixes
* more review changes
* fixup
* more unit tests
* fixup
* fix doc test
* add integration test
* improve tokenizer tests
* add tokenizer integration test
* fix tests on GPU (gave OOM)
* conversion script can handle repos from hub
* add conversion script for all MMS-TTS checkpoints
* automatically create a README for the converted checkpoint
* small changes to config
* push README to hub
* only show uroman note for checkpoints that need it
* remove conversion script because code formatting breaks the readme
* make WaveNet layers configurable
* rename variables
* simplifying the math
* output attentions and hidden states
* remove VitsFlip in flow model
* also got rid of the other flip
* fix tests
* rename more variables
* rename tokenizer, add phonemization
* raise error when phonemizer missing
* re-order config docstrings to match method
* change config naming
* remove redundant str -> list
* fix copyright: vits authors -> kakao enterprise
* (mean, log_variances) -> (prior_mean, prior_log_variances)
* if return dict -> if not return dict
* speed -> speaking rate
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* update fused tanh sigmoid
* reduce dims in tester
* audio -> output_values
* audio -> output_values in tuple out
* fix return type
* fix return type
* make _unconstrained_rational_quadratic_spline a function
* all nn's to accept a config
* add spectro to output
* move {speaking rate, noise scale, noise scale duration} to config
* path -> attn_path
* idxs -> valid idxs -> padded idxs
* output values -> waveform
* use config for attention
* make generation work
* harden integration test
* add spectrogram to dict output
* tokenizer refactor
* make style
* remove 'fake' padding token
* harden tokenizer tests
* ron norm test
* fprop / save tests deterministic
* move uroman to tokenizer as much as possible
* better logger message
* fix vivit imports
* add uroman integration test
* make style
* up
* matthijs -> sanchit-gandhi
* fix tokenizer test
* make fix-copies
* fix dict comprehension
* fix config tests
* fix model tests
* make outputs consistent with reverse/not reverse
* fix key concat
* more model details
* add author
* return dict
* speaker error
* labels error
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update src/transformers/models/vits/convert_original_checkpoint.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* remove uromanize
* add docstrings
* add docstrings for tokenizer
* upper-case skip messages
* fix return dict
* style
* finish tests
* update checkpoints
* make style
* remove doctest file
* revert
* fix docstring
* fix tokenizer
* remove uroman integration test
* add sampling rate
* fix docs / docstrings
* style
* add sr to model output
* fix outputs
* style / copies
* fix docstring
* fix copies
* remove sr from model outputs
* Update utils/documentation_tests.txt
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* add sr as allowed attr
---------
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-09-01 10:50:06 +01:00
..
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-19 11:15:38 +02:00
2023-08-02 20:22:36 +02:00
2023-06-06 14:31:14 -04:00
2023-06-26 18:36:47 +02:00
2023-08-09 18:28:02 +02:00
2023-08-02 20:22:36 +02:00
2023-02-06 18:10:56 -05:00
2023-02-06 18:10:56 -05:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-02-06 18:10:56 -05:00
2023-02-06 18:10:56 -05:00
2023-06-22 16:11:27 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-06-29 10:17:36 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-08-24 11:15:29 +02:00
2023-08-11 11:30:18 +01:00
2023-02-06 18:10:56 -05:00
2023-05-24 13:52:52 +01:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-08-25 10:58:14 +02:00
2023-08-08 10:48:45 +02:00
2023-08-25 18:57:40 +02:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-05-16 23:35:11 +02:00
2023-03-22 20:02:24 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-07-24 10:08:47 -04:00
2023-08-08 10:48:45 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-02-28 19:40:57 +01:00
2023-08-11 11:30:18 +01:00
2023-08-11 11:30:18 +01:00
2023-08-11 11:30:18 +01:00
2023-08-29 09:09:45 +01:00
2023-06-29 10:17:36 +01:00
2023-08-29 11:05:27 +01:00
2023-08-02 20:22:36 +02:00
2023-06-29 10:17:36 +01:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-08-11 11:30:18 +01:00
2023-08-14 17:08:47 +01:00
2023-08-02 20:22:36 +02:00
2023-07-24 10:08:47 -04:00
2023-06-06 18:30:51 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-08 10:48:45 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-05-18 17:29:04 +02:00
2023-04-06 17:56:06 +02:00
2023-06-16 15:40:49 +01:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-25 17:36:37 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2022-12-12 13:12:13 -05:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-07-24 10:08:47 -04:00
2023-08-02 20:22:36 +02:00
2023-02-06 18:10:56 -05:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-24 08:33:14 -07:00
2023-08-16 17:45:02 +01:00
2023-06-22 16:11:27 +02:00
2023-08-31 17:01:27 +02:00
2023-08-18 12:40:40 +02:00
2023-08-02 20:22:36 +02:00
2023-08-18 13:26:27 +02:00
2023-08-18 13:26:27 +02:00
2023-08-18 13:26:27 +02:00
2023-06-30 16:30:33 +01:00
2023-08-11 11:30:18 +01:00
2023-06-22 16:11:27 +02:00
2023-08-29 15:08:14 +02:00
2023-08-07 17:45:41 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-06-16 15:40:49 +01:00
2023-04-06 17:56:06 +02:00
2023-08-16 11:49:29 +01:00
2023-08-18 13:26:27 +02:00
2023-08-11 11:30:18 +01:00
2023-08-16 18:11:24 +02:00
2023-08-02 20:22:36 +02:00
2022-07-29 08:09:09 -04:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2022-05-03 14:42:02 +02:00
2023-08-11 11:30:18 +01:00
2022-05-12 16:25:55 -04:00
2023-08-19 11:15:38 +02:00
2023-08-11 11:30:18 +01:00
2023-08-11 11:30:18 +01:00
2023-08-11 11:30:18 +01:00
2023-07-24 10:08:47 -04:00
2023-08-02 20:22:36 +02:00
2023-08-21 11:11:21 +02:00
2023-08-02 20:22:36 +02:00
2023-05-24 13:52:52 +01:00
2023-07-28 18:50:15 +01:00
2023-08-02 20:22:36 +02:00
2023-06-29 10:17:36 +01:00
2023-08-02 20:22:36 +02:00
2023-04-04 14:53:06 +02:00
2023-08-17 17:21:56 +02:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2023-08-18 12:40:40 +02:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-07-25 07:56:04 -04:00
2023-02-06 18:10:56 -05:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-08-29 18:00:40 +01:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2023-06-16 15:40:49 +01:00
2023-08-02 20:22:36 +02:00
2023-05-18 11:04:51 +01:00
2023-06-29 10:17:36 +01:00
2023-08-02 20:22:36 +02:00
2023-06-29 10:17:36 +01:00
2023-08-08 10:48:45 +02:00
2023-08-08 10:48:45 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-06-22 16:11:27 +02:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-02-06 18:10:56 -05:00
2023-06-29 16:05:24 +02:00
2023-08-02 20:22:36 +02:00
2023-08-22 08:12:57 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-07-25 07:56:04 -04:00
2023-06-29 10:17:36 +01:00
2023-08-16 17:45:02 +01:00
2023-06-29 10:17:36 +01:00
2023-08-11 13:16:01 +01:00
2023-08-29 15:08:14 +02:00
2023-07-25 07:56:04 -04:00
2023-08-02 20:22:36 +02:00
2023-06-22 16:11:27 +02:00
2023-08-02 20:22:36 +02:00
2023-07-25 07:56:04 -04:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-07-25 07:56:04 -04:00
2023-08-16 17:45:02 +01:00
2023-08-11 11:30:18 +01:00
2023-06-29 10:17:36 +01:00
2023-07-05 13:44:30 +02:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-08-16 17:02:57 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-29 10:03:52 +01:00
2023-09-01 10:50:06 +01:00
2023-08-16 17:45:02 +01:00
2023-08-11 15:26:45 +01:00
2023-08-02 20:22:36 +02:00
2022-05-03 14:42:02 +02:00
2023-04-17 12:41:55 +02:00
2023-08-02 20:22:36 +02:00
2023-08-17 15:30:46 +02:00
2023-08-02 20:22:36 +02:00
2023-08-18 12:40:40 +02:00
2023-08-02 20:22:36 +02:00
2023-03-06 09:15:44 +01:00
2023-05-24 13:52:52 +01:00
2023-08-02 20:22:36 +02:00
2023-08-17 18:56:34 +02:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2022-05-03 14:42:02 +02:00