Matthijs Hollemans
0c3fdccf2f
[WIP] add EnCodec model (#23655)
* boilerplate stuff
* messing around with the feature extractor
* fix feature extractor
* unit tests for feature extractor
* rename speech to audio
* quick-and-dirty import of Meta's code
* import weights (sort of)
* cleaning up
* more cleaning up
* move encoder/decoder args into config
* cleanup model
* rename EnCodec -> Encodec
* RVQ parameters in config
* add slow test
* add lstm init and test_init
* Add save & load
* finish EncodecModel
* remove decoder_input_values as they are ont used anywhere (not removed from doc yet)
* fix test feature extraction model name
* Add better slow test
* Fix tests
* some fixup and cleaning
* Improve further
* cleaning up quantizer
* fix up conversion script
* test don't pass, _encode_fram does not work
* update tests with output per encode and decode
* more cleanup
* rename _codebook
* remove old config cruft
* ratios & hop_length
* use ModuleList instead of Sequential
* clean up resnet block
* update types
* update tests
* fixup
* quick cleanup
* fix padding
* more styl,ing
* add patrick feedback
* fix copies
* fixup
* fix lstm
* fix shape issues
* fixup
* rename conv layers
* fixup
* fix decoding
* small conv refactoring
* remove norm_params
* simplify conv layers
* rename conv layers
* stuff
* Clean up
* Add padding logic
use padding mask
small conv refactoring
remove norm_params
simplify conv layers
rename conv layers
stuff
add batched test
update
Clean up
merge and update for padding
fix padding
fixup
* clean up more
* clean up more
* More clean ups
* cleanup convolutions
* typo
* fix typos
* fixup
* build PR doc?
* start refactoring docstring
* fix don't pad when no strid and chunk
* update docstring
* update docstring
* nits
* update going to lunch
* update config and model
* fix broken testse (becaue of the config changes)
* fix scale computation
* fixu[
* only return dict if speciefied or if config returns it
* remove todos
* update defaults in config
* update conversion script
* fix doctest
* more docstring + fixup
* nits on batched_tests
* more nits
* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* update basxed on review
* fix update
* updaet tests
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* fixup
* add overlap and chunl_length_s
* cleanup feature extraction
* teste edge cases truncation and padding
* correct processor values
* update config encodec, nits
* fix tests
* fixup
* fix 24Hz test
* elle tests are green
* fix fixup
* Apply suggestions from code review
* revert readme changes
* fixup
* add example
* use facebook checkpoints
* fix typo
* no pipeline tests
* use slef.pad everywhere we can
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* update based on review
* update
* update mdx
* fix bug and tests
* fixup
* fix doctest
* remove comment
* more nits
* add more coverage for `test_truncation_and_padding`
* fixup
* add last test
* fix text
* nits
* Update tests/models/encodec/test_modeling_encodec.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* take care of the last comments
* typo
* fix test
* nits
* fixup
* Update src/transformers/models/encodec/feature_extraction_encodec.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: arthur.zucker@gmail.com <arthur.zucker@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-06-14 18:57:23 +02:00
..
2023-05-18 14:46:40 +01:00
2023-06-14 00:31:06 +05:30
2023-06-14 18:57:23 +02:00
2023-06-14 11:23:55 +01:00
2022-04-04 10:25:46 -04:00
2023-06-14 18:57:23 +02:00
2022-08-08 10:53:52 -05:00
2023-05-08 14:56:42 -04:00
2022-12-12 09:23:23 -05:00
2023-04-11 08:41:53 -04:00
2023-02-15 10:35:14 -08:00
2022-11-30 14:50:55 +00:00
2022-04-04 10:25:46 -04:00
2023-03-08 07:54:30 -05:00
2022-11-07 09:19:04 -05:00
2023-02-27 08:36:36 +01:00
2022-04-04 10:25:46 -04:00
2023-01-25 11:33:39 -05:00
2022-11-07 13:57:39 -05:00
2023-05-24 16:09:13 -04:00
2022-12-12 10:38:56 -05:00
2022-04-04 10:25:46 -04:00
2023-05-16 18:54:34 +01:00
2023-03-13 19:09:37 -04:00
2022-10-11 22:54:19 -04:00
2023-06-14 18:57:23 +02:00
2023-03-09 07:44:58 -05:00
2023-02-06 09:26:22 -05:00
2023-02-15 10:35:14 -08:00
2022-04-04 10:25:46 -04:00
2022-04-04 10:25:46 -04:00
2023-03-20 14:18:55 -04:00
2022-05-16 13:36:41 +02:00
2022-12-01 16:52:13 +01:00
2022-12-01 16:52:13 +01:00
2023-06-01 13:21:22 -04:00
2022-06-23 14:51:54 +02:00
2023-05-01 09:44:47 -04:00
2023-02-06 09:20:43 -05:00
2023-05-04 09:56:45 -04:00
2023-04-27 11:03:42 +02:00
2022-11-30 10:41:18 -05:00
2023-02-07 15:19:19 +00:00
2022-11-30 10:41:18 -05:00
2022-11-30 10:41:18 -05:00
2023-03-28 09:09:17 -04:00
2022-11-30 14:50:55 +00:00
2023-04-12 15:20:25 +01:00
2023-02-06 18:10:56 -05:00
2023-04-11 08:41:53 -04:00
2023-05-04 09:56:28 -04:00
2023-06-02 15:28:52 +01:00
2022-08-06 09:38:55 +02:00
2022-04-04 10:25:46 -04:00
2023-05-24 08:13:23 -04:00
2023-02-13 10:09:00 -05:00
2023-02-02 11:41:27 -08:00
2023-05-10 22:00:48 +02:00
2023-01-18 17:53:05 +00:00
2023-05-24 08:13:23 -04:00
2022-11-15 01:18:13 +01:00
2023-05-02 09:27:59 -04:00
2023-06-02 15:28:52 +01:00
2023-05-11 14:26:19 -04:00
2023-03-01 08:26:25 -05:00