Introducing Gemini Omni, which allows you to create anything from any input and edit naturally using conversational language.

DM provides a diverse range of content spanning technology, business, and culture, offering articles, interviews, and analysis for readers interested in staying updated with the latest trends and developments across various industries. Readers can learn about emerging technologies, industry insights, and  perspectives from experts in different fields.

DeepMind

Google is launching Gemini Omni, a new multimodal AI model focused on video generation and editing. The first release, Gemini Omni Flash, accepts any combination of images, audio, video, and text as input to generate and edit high-quality videos using natural language instructions. Key capabilities include conversational multi-turn video editing, physics-aware scene generation, knowledge-grounded storytelling, and a personal avatar feature. It is rolling out to Google AI Plus/Pro/Ultra subscribers via the Gemini app and Google Flow, and free to YouTube Shorts users. All generated videos include SynthID watermarking for AI content transparency. API access for developers and enterprise customers is coming in the following weeks.