Unified-IO 2 is an autoregressive multimodal AI model capable of understanding and generating image, text, audio, and action. It is trained on diverse multimodal data and excels in tasks like keypoint estimation, surface normal estimation, and image generation.
Sort: