WebLLM is a javascript package that brings language model chats directly to web browsers with hardware acceleration and no server support. It is compatible with OpenAI API and offers streaming, json-mode, function-calling, and seed-to-reproduce functionalities.

6m read timeFrom github.com
Post cover image
Table of contents
Get StartedFull OpenAI CompatibilityModel SupportBuild WebLLM Package From SourceLinksAcknowledgement

Sort: