Running a Local LLM with Ollama and MCP — An Architecture Spike
· 10 min read
AI inference doesn't have to mean a cloud API call. This post walks through a spike I built to run a locally hosted language model through a clean, layered architecture using Ollama and the Model Context Protocol (MCP).
The full source is on GitHub. Contributions and feedback welcome.
