Files
HuggingFace_transformer/docs/source/en
Ke Wen 20142ab542 Simplify Tensor Parallel implementation with PyTorch TP (#34184)
* Simplify Tensor Parallel implementation with PyTorch TP

* Move tp_plan to config

* Lint

* Format and warning

* Disable copy-from check

* Conditionally get attr from config

* make fix-copies

* Move base_model_tp_plan to PretrainedConfig

* Move TP into from_pretrained

* Add device context for load

* Do not serialize

* Move _tp_plan setting to post_init

* Add has_tp_plan

* Add test_tp

* Add 'Multi-gpu inference' doc

* Add backward support for device type identification

* Auto-detect accelerator

* supports_tp_plan

* copyright year

* Fix copy
2024-11-18 19:51:49 +01:00
..
2024-10-25 11:46:46 +01:00
2024-11-18 14:13:26 +01:00
2022-04-04 10:25:46 -04:00
2024-07-08 11:52:47 +01:00
2023-12-20 10:37:23 -08:00
2024-10-30 16:52:17 +01:00
2024-07-08 11:52:47 +01:00
2024-11-18 10:43:10 +01:00
2024-10-02 09:12:21 +01:00
2022-04-04 10:25:46 -04:00
2024-10-31 15:59:23 -07:00
2024-10-31 15:59:23 -07:00
2024-10-22 09:46:07 -07:00
2024-11-18 18:41:12 +00:00