Qwen Team recently open sourced Qwen-Image, an image foundation model. Qwen-Image supports text-to-image (T2I) generation and text-image-to-image (TI2I) editing tasks, and outperforms other models on

InfoQ is a leading online platform for software developers, architects, and technical leaders, providing news, articles, presentations, and interviews on a wide range of topics, including agile practices, DevOps, microservices, and emerging technologies. With a focus on quality content and expert insights, InfoQ helps professionals stay informed about the latest trends, best practices, and industry developments. Developers can learn from real-world experiences, gain  knowledge, and connect with peers in the global software community through InfoQ's diverse and engaging content.

InfoQ

Qwen Team released Qwen-Image, an open-source image foundation model that excels at text-to-image generation and image editing tasks. The model combines Qwen2.5-VL for text processing, a VAE for images, and a Multimodal Diffusion Transformer for generation. It outperforms other models on various benchmarks and ranks third on AI Arena against closed models like GPT Image 1. The model was trained on billions of annotated image-text pairs using progressive scaling strategies and reinforcement learning from human feedback.

Qwen Team Open Sources State-of-the-Art Image Model Qwen-Image