Blog
ResearchApr 16, 2026
Mission defines strategy, and strategy defines structure
How our five phase pipeline revealed a bigger image on its own, and what that teaches us about context engineering across agent boundaries
ResearchApr 10, 2026
Robustness through meaning - one triple at a time
Exploring how ontologies cross-validate with LLMs, making robust failure detection in agentic systems. Why this is a different approach than Palantir's operational ontology.
ResearchFeb 27, 2026
Can Your Prompts Optimize Themselves?
Exploring how DSPy's declarative approach to prompt engineering replaces hand-crafted templates with Bayesian-optimized programs and what happens when you apply it to a real failure detection pipeline.
ResearchFeb 6, 2026
Are Your AI Agents Reliable?
Exploring how frameworks like τ²-bench and Pydantic Evals are shaping the science of evaluating AI agent reliability in production.
Showing 4 of 4 posts