Issue with custom transcriber
Hello, I’m trying to build a custom transcriber. I have a server that receives audio data from Vapi, sends that data to Google Speech to text API and returns the transcription in the following format:
{
"type": "transcriber-response",
"transcription": “transcription here“,
"channel": "customer"
}
I tested having a conversation with multiple turns. I spoke a few times and the assistant replied back. It works, but it has one issue. I checked the Call Log Details and on the Transcripts tab there’s a single entry that contains all my transcriptions merged in one sentence.
Example:
User: hello how are you what’s the weather in San Francisco
If I use a transcriber available off-the-shelf (like Deepgram), Call Log Details show up correctly.
Example:
User: hello
Assistant: hello
User: how are you
Assistant: I’m just a robot
User: what’s the weather in San Francisco
I believe whenever I speak the assistant gets the merged transcriptions ("hello how are you what’s the weather in San Francisco"), so the issue is not simply affecting Call Log Details, I think it's also affecting the conversation itself
{
"type": "transcriber-response",
"transcription": “transcription here“,
"channel": "customer"
}
I tested having a conversation with multiple turns. I spoke a few times and the assistant replied back. It works, but it has one issue. I checked the Call Log Details and on the Transcripts tab there’s a single entry that contains all my transcriptions merged in one sentence.
Example:
User: hello how are you what’s the weather in San Francisco
If I use a transcriber available off-the-shelf (like Deepgram), Call Log Details show up correctly.
Example:
User: hello
Assistant: hello
User: how are you
Assistant: I’m just a robot
User: what’s the weather in San Francisco
I believe whenever I speak the assistant gets the merged transcriptions ("hello how are you what’s the weather in San Francisco"), so the issue is not simply affecting Call Log Details, I think it's also affecting the conversation itself