Step-by-step guide to setting up Ollama with Gemma 4 26B on an Apple Silicon Mac mini. Covers installation via Homebrew, pulling and running the model, configuring auto-start at login, creating a launchd agent to preload the model into memory, and setting OLLAMA_KEEP_ALIVE to prevent model unloading. Also covers GPU acceleration via MLX, memory considerations (20GB usage on 24GB machines), and using the OpenAI-compatible local API at localhost:11434.
Table of contents
PrerequisitesStep 1: Install OllamaStep 2: Start OllamaStep 3: Pull Gemma 4 26BStep 4: Test the ModelStep 5: Configure Auto-Start on LoginStep 6: Verify Everything WorksAPI AccessUseful CommandsUninstall / Remove Auto-StartWhat's New in Ollama v0.19+ (March 31, 2026)NotesReferencesSort: