Files
HuggingFace_transformer/tests/models
Orion Weller 0e4b7938d0
Some checks failed
Release - Conda / build_and_package (push) Has been cancelled
Secret Leaks / trufflehog (push) Has been cancelled
Add ModernBERT Decoder Models - ModernBERT, but trained with CLM! (#38967)
* working locally; need to style and test

* added docs and initial tests; need to debug and flesh out

* fixed tests

* working long context; batches

* working fa2 and eager

* update tests

* add missing confnigs

* remove default autoset

* fix spacing

* fix most tests

* fixed tests

* fix to init

* refactor to match new transformers updates

* remove static cache option

* fa2 fix

* fix docs

* in progress

* working on tests

* fixed issue with attn outputs

* remove debug

* fix local config attr

* update doc string

* fix docstring

* add docs to toc

* correct typo in toc

* add new updates from main w.r.t. ModernBERT RoPE

* fix local param

---------

Co-authored-by: oweller2 <oweller2@dsailogin.mgmt.ai.cluster>
Co-authored-by: oweller2 <oweller2@l07.mgmt.ai.cluster>
Co-authored-by: oweller2 <oweller2@n02.mgmt.ai.cluster>
Co-authored-by: oweller2 <oweller2@l08.mgmt.ai.cluster>
Co-authored-by: oweller2 <oweller2@l01.mgmt.ai.cluster>
Co-authored-by: oweller2 <oweller2@l02.mgmt.ai.cluster>
2025-07-15 10:40:41 +02:00
..
2025-06-11 17:28:06 +01:00
2025-06-24 15:05:29 +02:00
2025-06-26 11:04:23 +00:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-05-16 13:26:54 +02:00
2025-04-28 15:08:46 +02:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-07-08 11:44:29 +02:00
2025-06-27 16:54:11 +02:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-07-02 12:25:26 +01:00
2025-06-11 17:28:06 +01:00
2025-05-23 18:29:31 +01:00
2025-06-11 17:28:06 +01:00
2025-06-19 10:56:34 +02:00
2025-06-11 17:28:06 +01:00
2025-04-08 17:15:37 +01:00
2025-06-11 17:28:06 +01:00
2025-05-23 18:29:31 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-05-23 18:29:31 +01:00
2025-06-26 20:07:17 +02:00
2025-06-11 17:28:06 +01:00
2025-07-14 12:02:59 +02:00
2025-07-01 23:27:22 +02:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-24 18:01:15 +02:00
2025-04-15 11:33:09 +01:00
2025-06-12 16:15:35 +02:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-04-08 17:15:37 +01:00
2025-06-13 11:13:32 +02:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-07-10 05:18:44 +00:00
2025-07-10 05:18:44 +00:00
2025-06-11 17:28:06 +01:00
2025-07-11 11:07:32 +02:00
2025-05-23 18:29:31 +01:00
2025-05-23 18:29:31 +01:00
2025-07-10 11:51:55 +02:00
2025-05-28 16:44:20 +01:00
2025-05-28 16:44:20 +01:00
2025-06-18 14:36:03 +02:00
2025-06-23 17:42:46 +02:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-25 15:12:15 +00:00
2025-05-23 18:29:31 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-04-08 17:15:37 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00
2025-06-11 17:28:06 +01:00