Apple researchers have introduced Ferret-UI Lite, a compact 3B-parameter multimodal model designed to run on-device and interact with graphical user interfaces across mobile, web, and desktop platforms. Unlike existing GUI agents that rely on large foundation models like GPT or Gemini, Ferret-UI Lite prioritizes low latency,
Sort: