Lysandre Debut
|
de5ca373ac
|
Responses API in transformers serve (#39155)
* Scaffolding
* Explicit content
* Naïve Responses API streaming implementation
* Cleanup
* Responses API (to be merged into #39155) (#39338)
* Scaffolding
* Explicit content
* Naïve Responses API streaming implementation
* Cleanup
* use openai
* validate request, including detecting unused fields
* dict indexing
* dict var access
* tmp commit (tests failing)
* add slow
* use oai output type in completions
* (little rebase errors)
* working spec?
* guard type hint
* type hints. fix state (CB can now load different models)
* type hints; fn names; error type
* add docstrings
* responses + kv cache
* metadata support; fix kv cache; error event
* add output_index and content_index
* docstrings
* add test_build_response_event
* docs/comments
* gate test requirements; terminate cb manager on model switch
* nasty type hints
* more type hints
* disable validation by default; enable force models
* todo
---------
Co-authored-by: Lysandre <hi@lysand.re>
* Slight bugfixes
* PR comments from #39338
* make fixup
---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Joao Gante <joao@huggingface.co>
|
2025-07-16 14:16:16 +02:00 |
|
Lucain
|
bf203aa9da
|
Update tiny-agents example (#39245)
|
2025-07-07 15:58:36 +02:00 |
|
Joao Gante
|
85d93cc6e3
|
[serve] Cursor support, move docs into separate page, add more examples (#39133)
* jan docs
* rm
* [cursor] tmp commit
* Cursor working :D
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update src/transformers/commands/serving.py
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* cursor docs
* try to fix agents/tools docs?
* try to fix agents/tools docs?
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* add transformers chat example with transformers serve
---------
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
|
2025-07-03 17:04:16 +01:00 |
|
湛露先生
|
cc68070d41
|
fix docs serving typos. (#37936)
Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>
|
2025-05-06 14:32:44 +01:00 |
|
Steven Liu
|
e9756cdbc7
|
[docs] Serving LLMs (#36522)
* initial
* fix
* model-impl
|
2025-03-10 13:14:19 -07:00 |
|