Designing an Agent Reading Test
In which I try to give people tools to understand how agents read web content, and where they fail.
Tools, apps, and strong opinions about coffee
In which I try to give people tools to understand how agents read web content, and where they fail.
In which I revisit measuring agent web traffic, and dive deeper down the rabbit hole.
In which I share research about why LLM-generated output is hard to fact check.
In which verification is the hardest unsolved problem in AI content pipelines, and most organizations don't know it.
In which I find that a platform-published spec's omissions track financial incentives.
In which I use AI to help draft content, and discover its limitations.
In which I build a system to filter AI-related content.
In which someone asks for something in 'skill-validator' and I spin up a new community research project.
In which a platform breaks the Agent Skills spec for its own benefit.
In which I build a freshness check for llms.txt and discover that my tools were the problem.