Mozilla's Firefox iOS team built 'Shake to Summarize', a feature that generates a webpage summary when the user shakes their phone. This post details the LLM model selection process, evaluating candidates across four dimensions: summary quality (using an LLM judge with GPT-4o), inference speed (time to first token and

6m read timeFrom blog.mozilla.org
Post cover image
Table of contents
Which model?Release and future directions

Sort: