vLLM Semantic Router: Next Phase in LLM inferenceSeptember 6, 2025 · 6 min readHuamin ChenDistinguished Engineer @ Red HatChen WangSenior Staff Research Scientist @ IBMYue ZhuStaff Research Scientist @ IBMXunzhuo LiuSoftware Engineer @ Tencent