Files
HuggingFace_transformer/tests/models
Sylvain Gugger b4d4d6fe87 Add RWKV-4 (#22797)
* First draft of RWKV-4

* Add support for generate

* Style post-rebase

* Properly use state

* Write doc

* Fix doc

* More math

* Add model to README, dummies and clean config

* Fix init

* multiple fixes:

- fix common tests
- fix configuraion default values
- add CI test for checking state computation
- fix some CI tests

* correct tokenizer

* some tweaks

- fix config docstring
- fix failing tests

* fix CI tests

- add output_attention / output_hidden_states
- override test_initialization
- fix failing CIs

* fix conversion script

- fix sharded case
- add new arguments

* add slow tests + more fixes on conversion script

* add another test

* final fixes

* change single name variable

* add mock attention mask for pipeline to work

* correct eos token id

* fix nits

* add checkpoints

* Apply suggestions from code review

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add `tie_word_embeddings` in docstring

* change tensor name

* fix final nits

* Trigger CI

---------

Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-05-09 13:04:10 -04:00
..
2023-04-06 17:56:06 +02:00
2023-04-06 13:50:15 +01:00
2023-04-12 17:46:41 +01:00
2022-05-03 14:42:02 +02:00
2023-03-01 11:11:04 +01:00
2023-04-06 13:50:15 +01:00
2022-05-03 14:42:02 +02:00
2023-04-20 15:50:31 +01:00
2023-05-03 19:32:42 +03:00
2023-04-06 17:56:06 +02:00
2023-05-02 09:25:46 -04:00
2023-04-06 17:56:06 +02:00
2023-03-07 04:20:14 +01:00
2023-04-06 17:56:06 +02:00
2023-04-06 17:56:06 +02:00
2023-04-06 17:56:06 +02:00
2023-04-06 17:56:06 +02:00
2023-05-09 09:10:17 -04:00
2023-04-07 17:13:04 +02:00
2023-04-06 17:56:06 +02:00
2022-05-12 16:25:55 -04:00
2023-04-06 17:56:06 +02:00
2023-04-06 13:50:15 +01:00
2023-04-04 14:53:06 +02:00
2023-04-06 17:56:06 +02:00
2023-04-26 18:29:25 +02:00
2023-04-06 17:56:06 +02:00
2023-02-09 14:46:15 +00:00
2023-04-04 12:41:12 -04:00
2023-04-06 13:50:15 +01:00
2023-05-09 13:04:10 -04:00
2023-04-06 13:50:15 +01:00
2023-04-06 17:56:06 +02:00
2023-05-09 09:10:17 -04:00
2023-05-09 09:10:17 -04:00
2023-04-21 15:16:45 +02:00
2022-05-03 14:42:02 +02:00