batch processing/parallel processing #585

oldcpple · 2024-06-17T18:02:57Z

Hi there, does Petals currenly support batch processing/parallel processing? For example, to increase resource usage or system throughput, we would like to see servers parallelly processing multiple prompts at the same time, aka batch processing. Is this possible?
Thanks a lot.

justheuristic · 2024-07-11T13:30:10Z

Hi! Both forward/backward and autoregressive inference can run with any batch size, provided that you have enough memory for that.

In our training examples, we use batched training, e.g. this one https://github.com/bigscience-workshop/petals/blob/main/examples/prompt-tuning-sst2.ipynb as a batch size of 32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

batch processing/parallel processing #585

batch processing/parallel processing #585

oldcpple commented Jun 17, 2024

justheuristic commented Jul 11, 2024 •

edited

Loading

batch processing/parallel processing #585

batch processing/parallel processing #585

Comments

oldcpple commented Jun 17, 2024

justheuristic commented Jul 11, 2024 • edited Loading

justheuristic commented Jul 11, 2024 •

edited

Loading