Explore Gemini's new conversational image segmentation, allowing you to interact with visual data more intuitively using complex phrases and abstract concepts.

GoogleDevs' platform is a central hub for developers interested in Google technologies, APIs, and developer tools. Through articles, tutorials, and documentation, GoogleDevs offers insights into building applications using Google Cloud Platform, Android, Chrome, and other Google services. Developers can learn about cloud computing, machine learning, and mobile app development with Google's developer tools and platforms.

Google Developers

Gemini 2.5 introduces conversational image segmentation, allowing users to identify and segment objects using complex natural language descriptions instead of simple labels. The model can understand relational queries like 'the person holding the umbrella', comparative attributes like 'the most wilted flower', and abstract concepts like 'damage' or 'safety violations'. This advancement enables new applications in creative workflows, workplace safety monitoring, and insurance assessment by combining visual understanding with world knowledge and OCR capabilities.

Conversational image segmentation with Gemini 2.5

Leveraging conversational image segmentation queries

Conversational image segmentation in action