Hacker Newsnew | past | comments | ask | show | jobs | submit | bisonbear's submissionslogin
1.I ran Opus 4.7 vs. Old Opus 4.6 vs. New Opus 4.6 on 28 Zod tasks (stet.sh)
2 points by bisonbear 6 days ago | past | discuss
2.Coding evals are broken. CI is green while AI code quality goes unmeasured (stet.sh)
1 point by bisonbear 9 days ago | past | discuss
3.Agents.md is the highest-leverage code you're not testing (stet.sh)
1 point by bisonbear 14 days ago | past
4.Your AI coding benchmark is hiding a 2x quality gap (stet.sh)
3 points by bisonbear 41 days ago | past
5.Things I Learned at the Claude Code NYC Meetup (benr.build)
2 points by bisonbear 3 months ago | past
6.Claude vs. Codex in the Messy Middle (benr.build)
1 point by bisonbear 3 months ago | past
7.Spacetime as a Neural Network (benr.build)
11 points by bisonbear 3 months ago | past | 5 comments
8.One agent isn't enough (benr.build)
18 points by bisonbear 4 months ago | past | 2 comments
9.Context Engineering: The New Skill for Working with AI Agents (benr.build)
1 point by bisonbear 5 months ago | past
10.The New Math of Building with AI (benr.build)
2 points by bisonbear 6 months ago | past

Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: