Matthijs Hollemans
7f91950901
audio_utils improvements (#21998)
* silly change to allow making a PR
* clean up doc comments
* simplify hertz_to_mel and mel_to_hertz
* fixup
* clean up power_to_db
* also add amplitude_to_db
* move functions
* clean up mel_filter_bank
* fixup
* credit librosa & torchaudio authors
* add unit tests
* tests for power_to_db and amplitude_to_db
* add mel_filter_bank tests
* rewrite STFT
* add convenience spectrogram function
* missing transpose
* fewer transposes
* add integration test to M-CTC-T
* frame length can be either window or FFT length
* rewrite stft API
* add preemphasis coefficient
* move argument
* add log option to spectrogram
* replace M-CTC-T feature extractor
* fix api thing
* replace whisper STFT
* replace whisper mel filters
* replace tvlt's stft
* allow alternate window names
* replace speecht5 stft
* fixup
* fix integration tests
* fix doc comments
* remove manual FFT length calculation
* fix docs
* go away, deprecation warnings
* combine everything into spectrogram function
* add deprecated functions back
* fixup
2023-05-09 09:10:17 -04:00
..
2022-02-23 15:46:28 -05:00
2022-04-19 16:13:08 +01:00
2023-02-02 09:33:04 -05:00
2023-04-25 09:17:56 -04:00
2023-05-09 09:10:17 -04:00
2023-05-04 10:15:06 +01:00
2022-07-12 13:38:05 +01:00
2023-04-06 14:00:29 +02:00
2022-02-23 15:46:28 -05:00
2023-02-27 15:31:55 +00:00
2022-10-24 13:48:40 -04:00
2023-05-03 10:49:54 -04:00
2023-04-24 09:31:50 -04:00
2022-11-02 11:57:36 +00:00
2023-01-26 10:15:26 +00:00
2023-02-28 16:24:14 -05:00
2022-05-12 16:25:55 -04:00
2022-11-23 15:09:21 -05:00
2023-02-22 09:14:54 +01:00
2023-04-27 14:22:05 +02:00
2023-02-06 18:10:56 -05:00
2023-02-06 18:10:56 -05:00
2023-04-24 14:45:22 +02:00