peter-sk
d65b14ed67
added GPTNeoForTokenClassification (#22908)
* added GPTNeoForTokenClassification
* add to top-level init
* fixup
* test
* more fixup
* add to gpt_neo.mdx
* repo consistency
* dummy copy
* fix copies
* optax >= 0.1.5 assumes jax.Array exists - which it doesn't for jax <= 0.3.6
* merge with main made this superfluous
* added classifier_dropout
* remove legacy code
* removed fmt:on/off
removed expected_outputs
* doc style fix
* classifier_dropout is always in config
---------
Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com>
2023-04-27 12:10:03 -04:00
..
2022-02-23 15:46:28 -05:00
2023-04-27 11:03:42 +02:00
2023-04-11 17:59:15 +02:00
2023-04-12 08:25:45 -07:00
2023-02-03 12:43:46 -05:00
2023-04-24 19:54:55 +01:00
2023-04-12 08:01:18 -04:00
2023-04-27 12:10:03 -04:00
2023-03-21 19:22:01 +01:00
2023-03-02 12:08:43 -05:00
2023-04-24 14:45:22 +02:00
2023-03-31 16:18:43 -04:00
2023-04-25 09:17:56 -04:00
2023-02-06 18:10:56 -05:00
2023-04-17 15:09:45 -04:00
2023-04-27 14:22:05 +02:00
2020-01-06 15:11:12 +01:00
2023-04-06 13:50:15 +01:00
2023-04-24 09:31:50 -04:00
2023-04-24 09:31:50 -04:00
2023-04-24 09:31:50 -04:00
2023-03-30 11:29:11 +01:00
2023-04-24 09:31:50 -04:00
2023-03-09 09:23:48 -05:00
2023-04-24 09:31:50 -04:00
2023-04-24 14:45:22 +02:00
2023-02-22 09:14:54 +01:00
2023-04-24 09:31:50 -04:00