Seeing it All: LLaVA-UHD Perceives High-Resolution Images at Any Aspect Ratio

We are a community of AI/ ML/Generative AI enthusiasts/researchers/journalists/writers who share interesting news and articles about the applications of AI. 

Machine Learning News

LLaVA-UHD is a method for building encoder-decoder models that can handle high-resolution images at any aspect ratio. It splits up large images into smaller slices, preserves fine visual details, and outperforms standard models and specialized high-res systems.