Engineering

Concertor Blog

Technical writing on multi-model AI coding, verification, and how AI dev tools actually work under the hood.

Multi-Model Fusion

Why multi-model AI coding beats single-model: Claude + GPT together

When two models from different families disagree on the same coding problem, that divergence is not noise — it maps directly to genuine problem difficulty. Here is why multi-model AI coding is architecturally superior to running one model twice.

June 26, 2026 · 6 min read
Read article →
Verification

Why AI coding verification fails when agents grade their own homework

LLM judges score roughly 50% on correctness evaluation — barely better than chance. The model that wrote the bug cannot reliably find the bug. Here is what real AI coding verification requires, and why execution tests are the only honest ground truth.

June 26, 2026 · 6 min read
Read article →
Model Comparison

Claude vs GPT for coding: you should not have to choose

Claude and GPT fail on different problems. That is the whole point. An honest comparison of where each excels, where each breaks, and why routing to one or the other discards exactly the information you need most.

June 26, 2026 · 7 min read
Read article →