Extreme Vibe Coding at XP Days Benelux

Sep 1, 2025 Willem van den Ende Subscribe to the RSS feed

Tags: LLM,

Wouter Lagerweij and I will be running the eXtreme Vibe Coding session in a one hour slot at XP Days Benelux 2025, in the Kapellerput in Heeze, NL on 27 or 28 November. We will be there for both days, and probably the pre-conference dinner.

What I like about XP days, is that it is still a place for experimental sessions like this one. And it makes space for hands-on sessions that are also somewhat technical in nature.

I got a message from the XP Days Program Committee last Saturday, asking if our session eXtreme Vibe Coding could be done in 60 instead of the proposed 90 minutes. It can, but it might be a bit more live demo than hands-on work by the participants.

A plenary session at XP Days Benelux 2023, someone presenting to a large gathering of onlookers, standing around.

We mentioned 60 minutes, but forgot to put the timetable for the 60 minutes version in the session description. We briefly discussed it while coming up with the session, and preferred more time for hands-on, so prioritised the 90 minutes version. Thinking about the committees’ message, I came up with doing a Birds of a Feather session in the evening to go more in-depth with those who are up for it. So here is hoping it gets scheduled for the first day of the conference.

We made the learning objectives several months ago. Incremental progress is hard to see day by day, but coming back to our tools and code after being away for weeks, the progress is noticeable. Both in what we have learned over time, e.g. guard rails and engineered prompts, small tools etc. as in the progress that mode and tools are making.

From the learning objectives, Strategic Refactoring and Re-prompting is still much a work in progress. I was reflecting on this with Stephan Eggermont earlier today. Stephan and I are bootstrapping, in stealth mode, on what can be described as a MOMCG (Massive Online Multiplayer Collaborative Game). Last week we have iterated on changes made by our coding assistant, having a close look at what works and what doesn’t. We have made progress in additional checks and balances. But there is a but.

While we are at a point where we can have the assistant churn through some larger plans with a number of refactoring steps and having all the unit and end-to end tests pass, there are still surprises. For instance, when the assistant successfully migrated a JavaScript front-end to typescript, it added several thousand lines of additional validation code that was not only unnecessary, but often wrong.

We also find that “Going fast with value delivery while staying in control of complexity” is non-trivial. Drafting some blog posts today, one of the things that emerged is that the cost-of-change curve is still a thing. We can now get both success and trouble faster. Our current hypothesis is that progress coding tools’ ability to evaluate and remediate code might be outstripped by the models increasing capability to produce large amounts of code in one go. We are working on ways to counter-act this, but this is an active area of research.

It is not yet clear if Going down the waterfall iteratively - moulding a view on our vibes with Stephan Eggermont has made it in for XP days. I hope it will, because these are quite complimentary. In AI terms, having good “evals” is essential, and visualisations are a key part to effectively and efficiently judge larger changes.

If you made it this far, I would appreciate your feedback, either privately or in the linked in comments for this post, once it goes up there. Several points in this posts could be written out to be more substantiated, it would be useful to know which ones pique your interest.