An open-source toolkit for fine-tuning Google's Gemma 4 and 3n models on text, images, and audio using LoRA on Apple Silicon Macs via PyTorch MPS. It supports local CSV datasets as well as streaming from GCS and BigQuery, making it possible to train on large datasets without copying terabytes locally. Key differentiators include being the only Apple-Silicon-native path for audio+text LoRA fine-tuning, no NVIDIA GPU required, and a guided CLI wizard for setup. Supported modalities include text-only (instruction/completion), image+text (captioning/VQA), and audio+text. The toolkit uses Hugging Face checkpoints with PEFT LoRA and exports merged HF/SafeTensors weights.
Table of contents
LoRA for Gemma 4 & 3n — why not just use…?What you can build with thisSupported modelsArchitecture (what actually calls what)RequirementsInstallationCLI cheat sheetText-only fine-tuningImage fine-tuningGemma 3n / Gemma 4 on Apple SiliconData: CSVs, GCS, BigQueryTraining visualizer (optional)NVIDIA Granary & streamingApple Silicon knobsCI & testsExperiment indexTroubleshootingContributingAcknowledgmentsLicenseSort: