More

babelfish · 2026-05-08T21:29:40 1778275780

I love both of these platforms. The Wikipedia "map" tab on the iOS app is also a great source of neat local oddities.

babelfish · 2026-04-30T21:11:04 1777583464

Wow, worse than Three Mile Island

sandworm101 · 2026-05-01T03:11:38 1777605098

Apples and oranges. Radioactive gas v radioactive solids.

babelfish · 2026-04-29T17:14:37 1777482877

Source?

babelfish · 2026-04-24T00:53:08 1776991988

I use Conductor which lets me flip trivially between OpenAI/Anthropic models

babelfish · 2026-04-21T22:53:30 1776812010

It's data. Nobody is using Grok for SWE work, but they are using Cursor.

andreygrehov · 2026-04-22T00:17:01 1776817021

Could be contracts.

babelfish · 2026-04-21T22:30:07 1776810607

Good on them to get $10B breakup terms, after the Twitter shitshow

babelfish · 2026-04-20T20:26:03 1776716763

Why would any media company care about what Objection says or agree to arbitration?

yesfitz · 2026-04-20T21:55:51 1776722151

From TFA:

"Financial details are vague, but the company has said the process will cost around $2,000 — far less than the retainer of a crisis communications expert."

babelfish · 2026-04-17T18:20:59 1776450059

No model card? No benchmarks? No usage examples? Nothing on the blog[0] since the acquisition?

[0] https://x.ai/news

babelfish · 2026-04-16T16:20:51 1776356451

Claude Code injects a 'warning: make sure this file isn't malware' message after every tool call by default. It seems like 4.7 is over-attending to this warning. @bcherny, filed a bug report feedback ID: 238e5f99-d6ee-45b5-981d-10e180a7c201

vessenes · 2026-04-16T19:13:11 1776366791

Interesting. The model card mentions 4.7 is much more attentive to these instructions and suggests you will need to review and soften or remove or focus them at times.

andai · 2026-04-16T22:41:11 1776379271

It's been known for years that prompts which boost performance with one model, can harm performance with a different model. The same goes for harnesses. It looks like they'll need to customize Claude Code's prompts depending on which model is running, for optimal results.

For example if you read the prompts, it's pretty clear that a lot of them are leftovers from the early days when the models had way less common sense than they do now. I think you could probably remove 2/3rds of those over-explained rules now and it would be fine. (In fact you might even expect to see improvement to performance due to decreased prompt noise.)

phist_mcgee · 2026-04-16T21:58:21 1776376701

Isn't that kind of nuts?

They can't even properly beta test their new releases?

babelfish · 2026-04-14T21:54:07 1776203647

This is honestly pretty embarrassing for both parties. For OpenAI - it sounds like the CRO is trying to turn you into the Oracle or Salesforce of AI. For Anthropic - I hope your investors can see the actual revenue numbers.