Benchmark results #2783

sshleifer · 2025-01-08T05:43:29Z

sshleifer
Jan 8, 2025

I see many benchmark scripts, and I was wondering if there are aggregated results vs VLLM for different models/input lengths/output lengths so that I dont have to rerun them all.

b8zhong · 2025-09-20T19:36:20Z

b8zhong
Sep 20, 2025
Collaborator

It depends on your model and usecase. But generally SGLang is a bit faster

https://modal.com/llm-almanac/summary

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Benchmark results #2783

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Benchmark results #2783

Uh oh!

sshleifer Jan 8, 2025

Replies: 1 comment

Uh oh!

b8zhong Sep 20, 2025 Collaborator

sshleifer
Jan 8, 2025

b8zhong
Sep 20, 2025
Collaborator