- Introduce --endpoint-exit flag and LLAMA_ARG_ENDPOINT_EXIT env var
- Add endpoint_exit to common_params (disabled by default)
- Implement POST /exit with explicit confirmation token to prevent misuse
- Support graceful shutdown via injected on_shutdown callback
- Handle both router and non-router server shutdown paths
* git mv
* add server-context.h
* add server-context.h
* clean up headers
* cont : cleanup
* also expose server_response_reader (to be used by CLI)
* fix windows build
* decouple server_routes and server_http
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>