metropolitan-bronze•5d ago
Very slow speaking response times
Hey VAPI folks - i am trying to figure out settings that produce natural feeling conversation times. Right now I talk and it can take 2-3 seconds before the AI responds. It's really slow. am using
claude-sonnet-4
for my production completions model but even when i use Groq gpt-oss-120B
it's not much better. Is it the SST provider I'm using (Speechmatics)? Is it cartesia? Is there some observability where i can see for each turn what is taking the most time?
Here's my current settings. Any help would be appreciated, may end up spending a lot with you guys so happy to hop on a call..... :
5 Replies
Hi there,
Thank you for your message. Our team is currently out of the office. We operate Monday through Friday, from 9:00 AM to 8:00 PM Pacific Standard Time (PST).
We’ll get back to you as soon as possible during our normal business hours.
If your message is urgent, please mark it accordingly or include “URGENT” in the subject line, and we’ll do our best to respond promptly.
Warm regards,
Vapi
Customer Support Team
Vapi
Customer Support Team
metropolitan-bronzeOP•5d ago
Okay i see call logs does hav some of this. It seems like endpointing is the big one. Also claude sonnet 4 not great. Does VAPI have some like common presets of good combinations with different models? Do i just nuke smart endopinting altogether? How is anyone possibly using it when it apepars to add 1+ second to each response
does vapi support
gemini-2.5-flash-preview-native-audio-dialog
?
and whe i'm using gpt-4o-realtime am i able to customize the voice ?
man even with gpt-4o-realtime. it's so bad. it talks over me all the time or like double streams.
do you guys jut have a "good" preset to use with different models? Trying to twist every little knob just to get it to output something decent because so far pretty every variation i've tried is pretty much unusable
Is there someone i can hop on a call with? Building a consumer app here that has potential to scale up quite large, I would like some help getting the realtime voice settings usable.foreign-sapphire•4d ago
Man, I feel your frustrations.
Could be crazy a little, I suggest you could reduce the voice response time in the web Dashboard interface of the assistant, under Advanced settings.
metropolitan-bronzeOP•3d ago
bueller
Hey Franny, we understand your frustration with configuring a natural sounding assistant. Our default settings are usually as follows:
- Voice: VAPI
- LLM: gpt-4o mini cluster
- transcriber: deepgram nova-2
This setup, despite being simple, is quite effective and natural sounding. Let us know if you have any questions.
Also for available LLM choices that are selectable in our platform, please visit the documentation API page for enum values for each model/provider. https://docs.vapi.ai/api-reference/assistants/create#request.body.model