Best of Speech RecognitionDecember 2024

  1. 1
    Article
    Avatar of hnHacker News·1y

    The Accent Oracle

    A tool that claims to guess your native language by analyzing your English accent in less than 30 seconds.

  2. 2
    Article
    Avatar of lnLaravel News·1y

    Automatic speech recognition and transcription

    Whisper.php is a PHP wrapper for whisper.cpp, a C/C++ port of OpenAI's Whisper model, created by Kyrian Obikwelu. Recently released in version 1.0.0, it facilitates fully local and API-free transcription with features like high and low-level APIs, model auto-downloading, support for various audio formats, and multiple output formats. It requires the FFI extension in PHP and relies on platform-specific shared libraries, downloaded automatically during the first initialization. Whisper.php currently supports Linux and macOS, with Windows support in development.

  3. 3
    Article
    Avatar of dailydoseofdsDaily Dose of Data Science | Avi Chawla | Substack·1y

    [Hands-on] Building a Real-Time AI Voice Bot

    This post guides readers through the process of building a real-time AI voice bot using AssemblyAI for transcription, OpenAI for generating responses, and ElevenLabs for speech generation. It includes prerequisites, implementation steps, and methods for completing the application. Additionally, it highlights the performance improvements in AssemblyAI's latest speech-to-text model, Universal-2, compared to its predecessor.