Pavel Iakubovskii
9167fadab9
Introduce GradientCheckpointingLayer (#37223)
* GradientCheckpointingLayer
* trigger
* Move GC layer to a separate file
* Update import
* Expose and document GC layer
* Fix dummy
* Apply to llama-based models
* Update modulars
* Update a few more models for consistency
* Update glm4
* Update Janus
2025-04-22 11:33:31 +01:00
..
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2024-12-20 12:08:12 +01:00
2023-06-20 18:07:47 -04:00
2025-04-11 11:08:36 +02:00
2025-04-18 16:45:54 +02:00
2025-04-22 11:33:31 +01:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-06-20 18:07:47 -04:00
2023-12-04 10:04:28 -08:00