Confident-Sounding Gibberish
In which I share research about why LLM-generated output is hard to fact check.
Tools, apps, and strong opinions about coffee
In which I share research about why LLM-generated output is hard to fact check.
In which verification is the hardest unsolved problem in AI content pipelines, and most organizations don't know it.
In which I find that a platform-published spec's omissions track financial incentives.
In which I use AI to help draft content, and discover its limitations.
In which I build a system to filter AI-related content.
In which someone asks for something in 'skill-validator' and I spin up a new community research project.
In which a platform breaks the Agent Skills spec for its own benefit.
In which I build a freshness check for llms.txt and discover that my tools were the problem.
In which I validate a 23.7k-star skill mega repo and discover problems the star count won't tell you.
In which I explain why a vibes-based approach to AI and docs ain't cutting it.