[docs] LLM inference (#29791)

* first draft

* feedback

* static cache snippet

* feedback

* feedback
This commit is contained in:
Steven Liu
2024-04-22 12:41:51 -07:00
committed by GitHub
parent b4c18a830a
commit e74d793a3c
2 changed files with 328 additions and 0 deletions

View File

@@ -141,6 +141,8 @@
- sections:
- local: performance
title: Overview
- local: llm_optims
title: LLM inference optimization
- local: quantization
title: Quantization
- sections: