Arthur
9cccb3a838
[Persimmon] Add support for persimmon ( #26042 )
...
* intiial commit
* updates
* nits
* update conversion script
* update conversion script
* use path to load
* add tips etc
* some modeling logic
* modeling update
* more nits
* nits
* normal layer norm
* update config and doc
* nits
* update doc remove unused
* update
* fix inits and stuff
* fixup
* revert wrong changes
* updates
* more nits
* add default config values to the configuration file
* fixup happy
* update
* 2 tests left
* update readmes
* more nits
* slow test and more documentation
* update readme
* fix licences
* styling
* use fast if possible when saving tokenizer
* remove todo
* remove tokenization tests
* small last nits
* Apply suggestions from code review
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com >
* nits to skip the timout doctest
* fix integration test
* fix test
* update eos token
* update to allow fast tokenization
* styling
* fix codeLlama as well for the update post processor
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com >
* add more copied from statements
* update
* doc passes doctest
* remove `# final layer norm?`
* change docstring prompot
* update
* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* don't doctest the conversion script as it requires more packages
* don't init a model in the config
* oups
* fix doctest
---------
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com >
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com >
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
2023-09-12 11:33:27 +02:00
..
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-19 11:15:38 +02:00
2023-08-02 20:22:36 +02:00
2023-06-06 14:31:14 -04:00
2023-06-26 18:36:47 +02:00
2023-08-09 18:28:02 +02:00
2023-08-02 20:22:36 +02:00
2023-02-06 18:10:56 -05:00
2023-02-06 18:10:56 -05:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-02-06 18:10:56 -05:00
2023-02-06 18:10:56 -05:00
2023-06-22 16:11:27 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-06-29 10:17:36 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-08-24 11:15:29 +02:00
2023-08-11 11:30:18 +01:00
2023-02-06 18:10:56 -05:00
2023-05-24 13:52:52 +01:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-08-25 10:58:14 +02:00
2023-08-08 10:48:45 +02:00
2023-09-01 20:40:40 +02:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-05-16 23:35:11 +02:00
2023-03-22 20:02:24 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-07-24 10:08:47 -04:00
2023-09-05 10:12:25 +02:00
2023-08-02 20:22:36 +02:00
2023-09-05 17:13:06 +01:00
2023-02-28 19:40:57 +01:00
2023-08-11 11:30:18 +01:00
2023-08-11 11:30:18 +01:00
2023-08-11 11:30:18 +01:00
2023-09-05 11:19:56 +02:00
2023-06-29 10:17:36 +01:00
2023-08-29 11:05:27 +01:00
2023-08-02 20:22:36 +02:00
2023-06-29 10:17:36 +01:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-08-11 11:30:18 +01:00
2023-08-14 17:08:47 +01:00
2023-08-02 20:22:36 +02:00
2023-07-24 10:08:47 -04:00
2023-06-06 18:30:51 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-08 10:48:45 +02:00
2023-09-04 14:17:09 -04:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-05-18 17:29:04 +02:00
2023-04-06 17:56:06 +02:00
2023-06-16 15:40:49 +01:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-25 17:36:37 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2022-12-12 13:12:13 -05:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-07-24 10:08:47 -04:00
2023-08-02 20:22:36 +02:00
2023-02-06 18:10:56 -05:00
2023-09-05 10:12:25 +02:00
2023-08-02 20:22:36 +02:00
2023-08-24 08:33:14 -07:00
2023-08-16 17:45:02 +01:00
2023-06-22 16:11:27 +02:00
2023-08-31 17:01:27 +02:00
2023-08-18 12:40:40 +02:00
2023-08-02 20:22:36 +02:00
2023-08-18 13:26:27 +02:00
2023-08-18 13:26:27 +02:00
2023-08-18 13:26:27 +02:00
2023-06-30 16:30:33 +01:00
2023-08-11 11:30:18 +01:00
2023-06-22 16:11:27 +02:00
2023-08-29 15:08:14 +02:00
2023-08-07 17:45:41 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-06-16 15:40:49 +01:00
2023-04-06 17:56:06 +02:00
2023-08-16 11:49:29 +01:00
2023-08-18 13:26:27 +02:00
2023-08-11 11:30:18 +01:00
2023-08-16 18:11:24 +02:00
2023-08-02 20:22:36 +02:00
2022-07-29 08:09:09 -04:00
2023-09-05 21:50:14 +02:00
2023-08-02 20:22:36 +02:00
2022-05-03 14:42:02 +02:00
2023-08-11 11:30:18 +01:00
2022-05-12 16:25:55 -04:00
2023-08-19 11:15:38 +02:00
2023-08-11 11:30:18 +01:00
2023-08-11 11:30:18 +01:00
2023-08-11 11:30:18 +01:00
2023-07-24 10:08:47 -04:00
2023-08-02 20:22:36 +02:00
2023-08-21 11:11:21 +02:00
2023-08-02 20:22:36 +02:00
2023-05-24 13:52:52 +01:00
2023-07-28 18:50:15 +01:00
2023-08-02 20:22:36 +02:00
2023-06-29 10:17:36 +01:00
2023-08-02 20:22:36 +02:00
2023-04-04 14:53:06 +02:00
2023-08-17 17:21:56 +02:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2023-08-18 12:40:40 +02:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-07-25 07:56:04 -04:00
2023-09-12 11:33:27 +02:00
2023-02-06 18:10:56 -05:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-08-29 18:00:40 +01:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2023-06-16 15:40:49 +01:00
2023-08-02 20:22:36 +02:00
2023-05-18 11:04:51 +01:00
2023-06-29 10:17:36 +01:00
2023-08-02 20:22:36 +02:00
2023-06-29 10:17:36 +01:00
2023-08-08 10:48:45 +02:00
2023-08-08 10:48:45 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-06-22 16:11:27 +02:00
2023-08-11 11:30:18 +01:00
2023-09-05 10:12:25 +02:00
2023-09-05 10:12:25 +02:00
2023-02-06 18:10:56 -05:00
2023-06-29 16:05:24 +02:00
2023-08-02 20:22:36 +02:00
2023-09-05 10:12:25 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-07-25 07:56:04 -04:00
2023-06-29 10:17:36 +01:00
2023-08-16 17:45:02 +01:00
2023-06-29 10:17:36 +01:00
2023-08-11 13:16:01 +01:00
2023-08-29 15:08:14 +02:00
2023-07-25 07:56:04 -04:00
2023-08-02 20:22:36 +02:00
2023-06-22 16:11:27 +02:00
2023-08-02 20:22:36 +02:00
2023-07-25 07:56:04 -04:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-08-16 17:45:02 +01:00
2023-08-02 20:22:36 +02:00
2023-09-05 10:12:25 +02:00
2023-09-05 10:12:25 +02:00
2023-07-25 07:56:04 -04:00
2023-08-16 17:45:02 +01:00
2023-08-11 11:30:18 +01:00
2023-06-29 10:17:36 +01:00
2023-07-05 13:44:30 +02:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-08-16 17:02:57 +02:00
2023-08-02 20:22:36 +02:00
2023-08-02 20:22:36 +02:00
2023-09-04 11:35:39 +02:00
2023-09-07 17:49:14 +01:00
2023-08-16 17:45:02 +01:00
2023-09-05 10:12:25 +02:00
2023-09-05 18:26:06 +01:00
2022-05-03 14:42:02 +02:00
2023-04-17 12:41:55 +02:00
2023-09-05 10:12:25 +02:00
2023-09-09 05:43:26 +02:00
2023-08-02 20:22:36 +02:00
2023-08-18 12:40:40 +02:00
2023-08-02 20:22:36 +02:00
2023-03-06 09:15:44 +01:00
2023-05-24 13:52:52 +01:00
2023-08-02 20:22:36 +02:00
2023-08-17 18:56:34 +02:00
2023-08-02 20:22:36 +02:00
2023-08-11 11:30:18 +01:00
2023-08-02 20:22:36 +02:00
2022-05-03 14:42:02 +02:00