Technical writing on AI architecture, multi-model orchestration, and the systems we ship on. Public, plain language, archived here.
Given a task that decomposes into independent sub-tasks, a system of small models specialized to each sub-task returns more correct answers than a single general model of equal aggregate parameter count. The theoretical foundation behind YG3 platform orchestration.