[docs] LLM inference (#29791)
* first draft * feedback * static cache snippet * feedback * feedback
This commit is contained in:
@@ -141,6 +141,8 @@
|
||||
- sections:
|
||||
- local: performance
|
||||
title: Overview
|
||||
- local: llm_optims
|
||||
title: LLM inference optimization
|
||||
- local: quantization
|
||||
title: Quantization
|
||||
- sections:
|
||||
|
||||
Reference in New Issue
Block a user