llama-cpp-turboquant/tools/server/tests/unit
Radoslav Gerganov 2b6b55a59f
server : include usage statistics only when user request them (#16052)
* server : include usage statistics only when user request them

When serving the OpenAI compatible API, we should check if
{"stream_options": {"include_usage": true} is set in the request when
deciding whether we should send usage statistics

closes: #16048

* add unit test
2025-09-18 10:36:57 +00:00
..
test_basic.py
test_chat_completion.py server : include usage statistics only when user request them (#16052) 2025-09-18 10:36:57 +00:00
test_completion.py
test_ctx_shift.py
test_embedding.py
test_infill.py
test_lora.py
test_rerank.py
test_security.py
test_slot_save.py
test_speculative.py
test_template.py
test_tokenize.py
test_tool_call.py
test_vision_api.py