Files
HuggingFace_transformer/tests/models
Sambhav Dixit 950cfb0b4f Fix PaliGemma Pad Token Masking During Training #35855 (#35859)
* change order of unmasking of tokens

* library import

* class setup

* test function

* refactor

* add commit message

* test modified

* explict initiliasation of weights + made model smaller

* removed sepete testing file

* fixup

* fixup core

* test attention mask with token types

* tests fixup

* removed PaliGemmaAttentionMaskTest class

---------

Co-authored-by: sambhavnoobcoder <indosambahv@gmail.com>
2025-02-13 10:11:44 +01:00
..
2025-02-12 12:55:46 +01:00
2025-02-11 18:17:01 +01:00
2025-01-30 10:00:11 +00:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2024-12-18 16:53:39 +01:00
2024-06-26 21:59:08 +01:00
2025-01-17 12:10:43 +00:00
2025-01-13 18:41:15 +01:00
2024-06-26 21:59:08 +01:00
2024-06-26 21:59:08 +01:00
2025-02-12 12:55:46 +01:00
2025-02-12 12:55:46 +01:00
2024-10-07 10:56:24 +02:00
2024-06-26 21:59:08 +01:00
2024-11-29 11:58:11 +00:00
2022-05-03 14:42:02 +02:00