Pavel Iakubovskii
9167fadab9
Introduce GradientCheckpointingLayer (#37223)
* GradientCheckpointingLayer
* trigger
* Move GC layer to a separate file
* Update import
* Expose and document GC layer
* Fix dummy
* Apply to llama-based models
* Update modulars
* Update a few more models for consistency
* Update glm4
* Update Janus
2025-04-22 11:33:31 +01:00
..
2025-04-14 14:16:07 +01:00
2025-04-11 18:42:37 +01:00
2025-04-22 11:33:31 +01:00
2025-04-10 17:44:09 +02:00
2025-04-11 18:42:37 +01:00
2024-11-04 09:40:30 -08:00
2025-03-24 14:08:29 +00:00
2025-04-18 18:47:34 +01:00
2025-04-18 18:47:34 +01:00
2025-04-11 18:42:37 +01:00
2025-03-24 14:08:29 +00:00
2024-12-17 09:32:00 -08:00
2023-11-08 08:35:20 -05:00
2025-04-18 18:47:34 +01:00
2024-04-08 14:21:16 +01:00