OpenAI launches gpt-realtime, a new speech-to-speech AI model designed for enterprise applications with more natural, expressive voices and improved instruction-following capabilities. The model operates through the newly available Realtime API, featuring enhanced function calling, image recognition, and SIP support for contact

4m read timeFrom venturebeat.com
Post cover image
Table of contents
Speech-to-speech modelsBetter instruction followingRealtime API updates

Sort: