Ke Wen
20142ab542
Simplify Tensor Parallel implementation with PyTorch TP (#34184)
* Simplify Tensor Parallel implementation with PyTorch TP
* Move tp_plan to config
* Lint
* Format and warning
* Disable copy-from check
* Conditionally get attr from config
* make fix-copies
* Move base_model_tp_plan to PretrainedConfig
* Move TP into from_pretrained
* Add device context for load
* Do not serialize
* Move _tp_plan setting to post_init
* Add has_tp_plan
* Add test_tp
* Add 'Multi-gpu inference' doc
* Add backward support for device type identification
* Auto-detect accelerator
* supports_tp_plan
* copyright year
* Fix copy
2024-11-18 19:51:49 +01:00
..
2024-11-11 20:52:09 +01:00
2022-02-23 15:46:28 -05:00
2023-10-09 11:04:57 +02:00
2024-10-02 14:08:46 +01:00
2024-09-19 19:28:04 +01:00
2024-03-19 14:43:02 +00:00
2024-11-15 22:28:06 +01:00
2024-11-05 16:10:42 +01:00
2024-11-18 13:21:07 +01:00
2024-07-11 12:11:50 +01:00
2024-10-24 17:56:40 +02:00
2024-10-31 15:48:11 -04:00
2024-11-05 18:26:20 +01:00
2024-08-30 18:17:25 +02:00
2024-10-02 14:08:46 +01:00
2024-11-04 16:37:51 +01:00
2024-11-18 19:51:49 +01:00
2024-11-15 23:00:03 +01:00
2024-11-15 23:07:24 +01:00
2023-12-20 18:33:17 +00:00
2024-11-05 11:34:01 +01:00
2023-06-15 07:30:24 -04:00
2024-10-21 09:05:05 -04:00
2024-05-21 13:56:52 +01:00
2024-11-18 15:45:40 +01:00
2024-05-16 10:56:11 +01:00
2024-10-05 16:20:50 +02:00
2024-10-31 15:48:11 -04:00
2024-10-02 10:41:08 +02:00
2023-09-05 10:12:25 +02:00
2024-11-15 14:27:04 +00:00