Unified-IO 2 is an autoregressive multimodal AI model capable of understanding and generating image, text, audio, and action. It is trained on diverse multimodal data and excels in tasks like keypoint estimation, surface normal estimation, and image generation.

3m read timeFrom marktechpost.com
Post cover image

Sort: