WebLLM is a javascript package that brings language model chats directly to web browsers with hardware acceleration and no server support. It is compatible with OpenAI API and offers streaming, json-mode, function-calling, and seed-to-reproduce functionalities.
Table of contents
Get StartedFull OpenAI CompatibilityModel SupportBuild WebLLM Package From SourceLinksAcknowledgementSort: