Concertor Blog
Technical writing on multi-model AI coding, verification, and how AI dev tools actually work under the hood.
-
Multi-Model Fusion
Why multi-model AI coding beats single-model: Claude + GPT together
When two models from different families disagree on the same coding problem, that divergence is not noise — it maps directly to genuine problem difficulty. Here is why multi-model AI coding is architecturally superior to running one model twice.
Read article → -
Verification
Why AI coding verification fails when agents grade their own homework
LLM judges score roughly 50% on correctness evaluation — barely better than chance. The model that wrote the bug cannot reliably find the bug. Here is what real AI coding verification requires, and why execution tests are the only honest ground truth.
Read article → -
Model Comparison
Claude vs GPT for coding: you should not have to choose
Claude and GPT fail on different problems. That is the whole point. An honest comparison of where each excels, where each breaks, and why routing to one or the other discards exactly the information you need most.
Read article →