[Feature Request]: Autogen online? #3546

summersonnn · 2024-09-19T09:39:09Z

Hello,

I’ve been using vLLM to successfully run my local language model. Currently, I’m using AutoGen to connect to my vLLM server, define tools, run them, and everything works smoothly in offline mode, including calling the appropriate functions.

When using the standard vLLM, serving was straightforward with the vllm serve command. However, with AutoGen, since it’s just a basic script, I can’t serve it the same way as before. I now have to implement an additional layer using FastAPI or something similar on top of the vLLM server. This is a complex task and involves a lot of effort. Handling load balancing will also be a pain.

In short, is there a way to use AutoGen in an online setting? Perhaps a smart method to serve it, or a way to connect it back to the vLLM endpoint and continue interacting as before?

I’m curious to know what the standard approach is.

Many thanks.

Describe the solution you'd like

Maybe an autogen server?

summersonnn added the enhancement New feature or request label Sep 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request]: Autogen online? #3546

[Feature Request]: Autogen online? #3546

summersonnn commented Sep 19, 2024 •

edited

Loading

[Feature Request]: Autogen online? #3546

[Feature Request]: Autogen online? #3546

Comments

summersonnn commented Sep 19, 2024 • edited Loading

Describe the solution you'd like

summersonnn commented Sep 19, 2024 •

edited

Loading