ScreenAI is a visual language model for UI and visually-situated language understanding. It improves upon previous models with a flexible patching strategy, achieving state-of-the-art results on various tasks. The model is trained using a unique mixture of datasets and tasks, and performs well on benchmarks compared to models

6m read timeFrom blog.research.google
Post cover image
Table of contents
ScreenAIData generationExperiments and resultsConclusionAcknowledgements
2 Comments

Sort: