This project allows users to modify images using audio commands. It incorporates several AI models including Detectron2 for object detection, Faster Whisper for audio transcription, and Stable Diffusion for text-to-image inpainting. Users can upload an image, give an audio command, and see the image modified based on their

3m read timeFrom github.com
Post cover image
Table of contents
AAIELA: AI Assisted Image Editing with Language and AudioProject StructureInstallation:Project WorkflowResearchTodo
1 Comment

Sort: