Break down main function in llama-server #13425

ericcurtin · 2025-05-10T12:08:44Z

llama-server main function is getting meaty, just breaking it down into smaller functions.

ericcurtin · 2025-05-10T12:08:56Z

Incomplete

llama-server main function is getting meaty, just breaking it down into smaller functions. Signed-off-by: Eric Curtin <[email protected]>

ngxson · 2025-05-10T12:39:06Z

Before going further, I think it's better to discuss a plan rather than diving into the code.

While working on #13400 (comment) , I also thought about refactoring server.cpp into small components, this should be done in a way that is easy to enable routing requests to multiple models on the same server instance.

For now, the most simple task is of course to abstract out the creation of HTTP server. Second task could be to move all the HTTP handler to a completely separated file. The main component, server_context may also need to be moved to a dedicated file.

ericcurtin requested a review from ngxson as a code owner May 10, 2025 12:08

ericcurtin marked this pull request as draft May 10, 2025 12:08

github-actions bot added examples server labels May 10, 2025

Break down main function in llama-server

5c32fc3

llama-server main function is getting meaty, just breaking it down into smaller functions. Signed-off-by: Eric Curtin <[email protected]>

ericcurtin force-pushed the refactor-server branch from a28bb23 to 5c32fc3 Compare May 10, 2025 12:31

ericcurtin marked this pull request as ready for review May 10, 2025 12:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Break down main function in llama-server #13425

Break down main function in llama-server #13425

ericcurtin commented May 10, 2025

ericcurtin commented May 10, 2025

ngxson commented May 10, 2025

Break down main function in llama-server #13425

Are you sure you want to change the base?

Break down main function in llama-server #13425

Conversation

ericcurtin commented May 10, 2025

ericcurtin commented May 10, 2025

ngxson commented May 10, 2025