Test out the MediaPipe LLM Inference API via our web demo. The Web SDK will be released in the next few weeks with the iOS SDK coming soon.

GoogleDevs' platform is a central hub for developers interested in Google technologies, APIs, and developer tools. Through articles, tutorials, and documentation, GoogleDevs offers insights into building applications using Google Cloud Platform, Android, Chrome, and other Google services. Developers can learn about cloud computing, machine learning, and mobile app development with Google's developer tools and platforms.

Google Developers

TensorFlow Lite and MediaPipe have released the experimental MediaPipe LLM Inference API, which allows Large Language Models (LLMs) to run fully on-device. The API supports Web, Android, and iOS platforms and offers support for four openly available LLMs: Gemma, Phi 2, Falcon, and Stable LM. The LLMs can be integrated into applications using the provided SDKs and a few simple steps. The release also includes optimized performance, particularly in latency, through various optimizations made across different libraries and runtimes.