Has anyone successfully added streaming (i.e. near realtime) speech to text using an API interface to any of the recognised AI speech models?
I know you can use the Win+H key but I need much higher accuracy plus the additional features/parameters that the speech AI models provide.
Example providers are AssemblyAI, Speechmatics, DeepGram...
Ideally I want to do it all in VBA, via websockets etc. I've asked the OpenAI and Claude to code it but they (and I) can't quite get it working.
I know you can use the Win+H key but I need much higher accuracy plus the additional features/parameters that the speech AI models provide.
Example providers are AssemblyAI, Speechmatics, DeepGram...
Ideally I want to do it all in VBA, via websockets etc. I've asked the OpenAI and Claude to code it but they (and I) can't quite get it working.