Transcripts with custom transcriber
Hi!
I am using a custom transcriber/STT with Vapi. Obviously my websocket will return the transcription for the caller, but it seems like Vapi also requires transcription for the agent? Is my understanding correct?
It seems odd to me that Vapi relies on STT for the transcript when it could get the transcript from the LLM output that's later used for TTS. Just want to make sure i'm not missing something?
If I should be providing transcription for both the caller and the agent, what are the best practices for setting this up? Asking b/c this introduces more room for error, since the STT may make mistakes for the agent transcription, and also additional complexity on whether/how to "listen" for the caller or agent.
I am using a custom transcriber/STT with Vapi. Obviously my websocket will return the transcription for the caller, but it seems like Vapi also requires transcription for the agent? Is my understanding correct?
It seems odd to me that Vapi relies on STT for the transcript when it could get the transcript from the LLM output that's later used for TTS. Just want to make sure i'm not missing something?
If I should be providing transcription for both the caller and the agent, what are the best practices for setting this up? Asking b/c this introduces more room for error, since the STT may make mistakes for the agent transcription, and also additional complexity on whether/how to "listen" for the caller or agent.