We started Seleya Labs because we think the next decade of AI is
going to look very different from the one we're in.
The current story is simple. A few labs sit at the frontier.
Everyone else rents intelligence from them by the token. The labs
get smarter, the API gets better, and that's how AI happens.
The trouble is, this story assumes compute is abundant. It isn't.
Memory is sold out through 2026. Power, not GPUs, is now the
binding constraint on new data centers. The supply chain has hard
chokepoints at every layer, from rare earths to ASML, and none of
them resolve in under five years. The labs at the top of the stack
will get their compute. Most of us won't, at least not at any
price we want to pay.
Our bet is that what most people actually need from AI doesn't
require a frontier model. A correctly-sized model, fine-tuned on
your own work, with proper memory and grounding, will outperform a
frontier model at your work. The hard part isn't the model. It's
everything around the model: the runtime, the context, the memory,
the tooling, the way it deploys on hardware you control. That's
what we're building.
Spock is our first
product, and it's running in production today. A longer thesis is
publishing in 2026.