bisonbear's submissions

1.		I ran Opus 4.7 vs. Old Opus 4.6 vs. New Opus 4.6 on 28 Zod tasks (stet.sh)
		2 points by bisonbear 6 days ago \| past \| discuss
2.		Coding evals are broken. CI is green while AI code quality goes unmeasured (stet.sh)
		1 point by bisonbear 9 days ago \| past \| discuss
3.		Agents.md is the highest-leverage code you're not testing (stet.sh)
		1 point by bisonbear 14 days ago \| past
4.		Your AI coding benchmark is hiding a 2x quality gap (stet.sh)
		3 points by bisonbear 41 days ago \| past
5.		Things I Learned at the Claude Code NYC Meetup (benr.build)
		2 points by bisonbear 3 months ago \| past
6.		Claude vs. Codex in the Messy Middle (benr.build)
		1 point by bisonbear 3 months ago \| past
7.		Spacetime as a Neural Network (benr.build)
		11 points by bisonbear 3 months ago \| past \| 5 comments
8.		One agent isn't enough (benr.build)
		18 points by bisonbear 4 months ago \| past \| 2 comments
9.		Context Engineering: The New Skill for Working with AI Agents (benr.build)
		1 point by bisonbear 5 months ago \| past
10.		The New Math of Building with AI (benr.build)
		2 points by bisonbear 6 months ago \| past