Show HN — A developer built a tool to fine-tune Gemma 4 (Google's open model) with multimodal data on M-series Macs. It's clean, well-documented, and lets you adapt Google's base model to your own data without touching GPU clusters. If you've wanted to customize a vision-language model but thought it was out of reach, this lowers the barrier significantly.