Running Your Coding Agent Locally: Lessons from a Real-World Experiment

VoxxedDays Zürich 2026 (joint talk with Alessio Soldano)

In this talk, we'll explore the practical journey of building and running a local AI coding setup: choosing models, hosting them on consumer hardware, connecting frontends like LM Studio, and evaluating what really works (and what doesn't). We'll discuss trade-offs in latency, memory, and tool integration, the role of KV cache and model routing, and how far open-source models can go in replicating commercial AI dev environments.

// video

// resources

📄

Slides

maeste.it/transformer-explainer-kit/

▶️

Video Recording

youtube.com/watch?v=DXEsG3Vo6F4

🔗

Talk Page

vdz26.voxxeddays.ch

← // all talks