The Web Speech API is a web browser API that enables web applications to use sound as data in their operations. With the API, web apps can transcribe the speech in sound input and also synthesise spee

freeCodeCamp is a nonprofit organization offering free online coding courses and programming tutorials, covering topics such as web development, data science, and machine learning. Learners can gain practical coding skills, build real-world projects, and earn certifications to advance their careers in tech.

freeCodeCamp

A step-by-step guide to building a full-stack voice-powered AI application using the browser's Web Speech API. Covers setting up a SpeechRecognition instance in vanilla JavaScript to capture and transcribe speech, building a Node.js backend that forwards transcripts to the Gemini AI API, and connecting both layers. Also includes optional deployment instructions using Google Cloud Run for the backend and Firebase Hosting for the frontend.

How to Build a Voice-Powered AI Application with the Web Speech API

Deploy the Backend Application with Google Cloud Run

Deploy the Frontend Application with Firebase