ScreenPipe is a tool that streams data from your screen and leverages Large Language Models (LLMs) to process text and images. Inspired by adept.ai, rewind.ai, and Apple Shortcut, the project uses Rust and WASM technologies. The current prototype can capture your screen and extract text using OCR, which can then be processed

5m read timeFrom github.com
Post cover image
Table of contents
Screen to action using LLMsStatusUsageWhy open source?PrinciplesContributingLicensingRelated projects

Sort: