SAM Audio transforms audio processing by making it easy to isolate any sound from complex audio mixtures using natural, multimodal prompts — whether...

Meta AI's platform is a resource for artificial intelligence (AI) researchers, machine learning practitioners, and technology enthusiasts. Through articles, research papers, and technical discussions, Meta AI offers insights into AI algorithms, deep learning models, and neural network architectures. Readers can learn about AI applications in various domains, such as computer vision, natural language processing, and reinforcement learning. Additionally, Meta AI provides updates on the latest advancements in AI research, AI ethics, and AI governance to help readers stay informed and engaged in the rapidly evolving field of artificial intelligence.

AI at Meta

Meta introduces SAM Audio, a unified multimodal model that isolates sounds from complex audio mixtures using text, visual, or temporal prompts. Built on the Perception Encoder Audiovisual (PE-AV), it achieves state-of-the-art performance across speech, music, and general sound separation. The release includes SAM Audio-Bench (the first in-the-wild audio separation benchmark), SAM Audio Judge (an automatic evaluation model), and integration into the Segment Anything Playground. The model operates faster than real-time and supports flexible prompting combinations, though it cannot separate highly similar audio events or perform complete separation without prompts.