VAPI•6mo ago

Groq Llama4 Latency Issue

When I am using Groq Llama4 Maverick on test call it works great but when I use it on real call the latency goes to 5000ms whereas it shows 640 ms on Web and 1140 ms on Twilio ( I am using VAPI Number)

few-sapphire•7/11/25, 6:41 PM

That may be part of the plan you are using for groq? are you using the free api access? that has good latency, but a very long time to first token (TTFT), from what I have seen

KyleAPP•7/11/25, 9:13 PM

Enable streaming, switch to livekit endpointing, reduce waitSeconds, and ensure Groq Llama4 Maverick is used with real-time token streaming..

Groq Llama4 Latency Issue

Similar Threads

Similar Threads

Similar Threads