Best of Speech Recognition — December 2024
- 1
- 2
Laravel News·1y
Automatic speech recognition and transcription
Whisper.php is a PHP wrapper for whisper.cpp, a C/C++ port of OpenAI's Whisper model, created by Kyrian Obikwelu. Recently released in version 1.0.0, it facilitates fully local and API-free transcription with features like high and low-level APIs, model auto-downloading, support for various audio formats, and multiple output formats. It requires the FFI extension in PHP and relies on platform-specific shared libraries, downloaded automatically during the first initialization. Whisper.php currently supports Linux and macOS, with Windows support in development.
- 3
Daily Dose of Data Science | Avi Chawla | Substack·1y
[Hands-on] Building a Real-Time AI Voice Bot
This post guides readers through the process of building a real-time AI voice bot using AssemblyAI for transcription, OpenAI for generating responses, and ElevenLabs for speech generation. It includes prerequisites, implementation steps, and methods for completing the application. Additionally, it highlights the performance improvements in AssemblyAI's latest speech-to-text model, Universal-2, compared to its predecessor.