A field sobriety test is distinct from a chemical analysis (breathalyzer or otherwise).
In California, you are required to submit to chemical testing (breath, urine, or blood — I don’t recall the rules for which applies in which situations). However, you are not required to otherwise talk to or perform the absurd procedure of the field sobriety test (“you have the right to remain silent”).
> there should be verifiable reverse lookup to a valid registered number
This has existed for a very, very long time; its just a business feature you have to pay a lot of extra money for and that is generally unavailable to consumers.
Business customers get access to DNIS and ANI.
DNIS is the number that was dialed to reach the business. It is used for call routing and marketing analytics.
ANI is the phone number from which the call was made. It’s used again for call routing and analytics, but also billing. It’s how they knew where to send the bill back in the days of 1-900 numbers. Because real money is involved, it’s a pretty good bet it’s usually tied to the real entity.
You, as a mere consumer, are left with caller id, which no business user cares about.
It took you a month to revert after multiple complaints. You still blamed users for using the product exactly as you advertised it. And all of your official channels were completely quite for two months, whether it was about new draconian peak hour limits, or about the new defaults, or about exponentially increasing token costs.
People literally started seeing issues immediately as you changed the defaults: https://x.com/levelsio/status/2029307862493618290 And despite a huge amount of reports you still kept it for a whole month.
And then you shipped a completely untested feature with prompt cache misses and literally gaslit users and blamed users for using the product as advertised.
Now untold umber of people have been hit by these changes, so as an apology you reset usage limits three hours before they would reset anyway.
Good job.
Edit. By the way, a very telling sentence from the report:
--- start quote ---
We’ll ensure that a larger share of internal staff use the exact public build of Claude Code (as opposed to the version we use to test new features); and we'll make improvements to our Code Review tool that we use internally
--- end quote ---
Translation: no one is using or even testing the product we ship, and we blindly trust Claude Code to review and find bugs for us. Last one isn't even a translation: https://x.com/bcherny/status/2017742750473720121
> Nasdaq was the first to consider a rule change that would grant mega IPOs like SpaceX early admission to its flagship Nasdaq-100 index. The exchange and index provider began a consultation period in February to assess the viability of and industry response to a proposed “fast entry” rule. The change was approved on March 30 and will be effective on May 1.
I wouldn't be surprised if folks start complaining to California government agencies like the Department of Consumer Affairs, and they take it seriously.
There is a lot of political capital to be earned by appearing to be "tough" on AI companies.
pretty much none of these big providers are offering the guarantees needed to be taken seriously in workplaces right now. the technology itself isn't offering the deterministic guarantees that should warrant it in the workplace right now. problem is everyone's foot is just on the gas. even if your workplace isnt paying for it, people are just straight up rolling their own personal claude accounts to do work at orgs.
ive been trying to make the case all year that if we're going to let employees do shit with ai, lets try claude. in the past like.. 2-3 weeks all that goodwill has basically evaporated.
local inference needs to take off asap because all of these entities actually suck and i wouldn't trust a single sla with anthropic. they are not acting like a serious company right now, this is a joke.
Copilot's per-prompt pricing model is overwhelmingly the best value for money right now, although they more than doubled the price of Opus 4.7 compared to 4.6 and completely removed 4.5 and 4.6, which erodes their lead somewhat. Copilot restricts context much more than CC, but I still find it to be plenty capable. I've occasionally managed to give Copilot/Opus 4.6 a prompt that kept it productive for a full work day for a cost of just ~$0.10.
My company has Github Business, which provides all major models.
Claude runs out of tokens usually in the middle of the month. I can use it with opencode.
Then one extra Claude Pro.
Then also:
Codex business for a test month. gpt-5.4 was excellent.
And since kimi-2.6 I bought their monthly for testing. Works fine. Not as excellent as opus, but usable. Fixes tricky problems by its own.
Not an OpenRouter subscription yet. Those models are either free on opencode anyway, or not good enough.
The free MinMax 2.5 on opencode is pretty usable also.
I know of a very serious business that deployed Max to all of their developers. API pricing, from what I see, can become more expensive than just hiring another dev.
We're also not seeing much difference in real throughput at an agency. Everyone is getting decent results, output wise but it just doesn't seem to change the outcomes that much. There is also a mixed incentive at an agency, because a reduction in hours spent is a reduction in revenue.
It will be interesting to see how it all plays out, but I suspect if cost continues to increase and output only improves incrementally from here, that the cost will be the final decider rather than the competence.
I could see it being a thing we use only sometimes, for some things, but ultimately remain reliant on developers to get the work through the pipeline.
That's true but employees offer more than code output, and you still need people operating the "machine" at this stage.
I am interested in how corporate politics evolves in this new environment. Usually all the way up the chain, managers and directors use head count as a measure of power and influence (and compensation). Who's going to pay a director top level pay when all they're doing is funneling requirements to various agents? That seems like a technical role that isn't particularly aligned with the soft skills management excel with either.
Management seems disconnected from reality. Real employees accumulate tribal knowledge, have an almost infinite context, and don’t keep disabling unit tests because they don’t pass. They don’t really cost money if it’s information workers that build almost all of the modern service industry. It’s management that we should see as a cost center.
Well yes it is expensive, but companies are paying for that. It is far more expensive than the Max and it does go up to or more in some cases compared to the employee salary.
Larger companies are using Claude through AWS Bedrock and are willing to easily pay $5k+ per engineer per month for it.
The thinking appears to be that a model that can do the work of a developer must be worth a significant share of a developer salary. I think this idea is flawed.
Developer salaries are driven up by scarcity - scarcity of developer skills overall and scarcity of developer skills in specific places like California. If AI models destroy the scarcity then the price worth paying for a coding agent will drop dramatically.
Maybe Anthropic can get away with it for a couple of months. But this will not last.
But if e.g. a developer can do 50% more, shouldn't it be worth it to pay up to 50% of developer salary for the product?
So the % is debatable of course. There's cases where an AI agent can save weeks worth of investigation, there's cases where you are mainly blocked due to processes, and many different circumstances. It's up to every company on their own to decide it. But if they decide it's 50%, why shouldn't they spend 50% of salary on it?
Like imagine a large company with thousands of microservices. You need to build a feature, before you had to setup cross timezone team meetings to figure out who owns what, what is happening in each microservice, how it all connects together. But now you can essentially send an AI Agent to scour and prepare all this material for you, which theoretically in this planning could save hours of back and forth meetings.
If 1 hour / 1 eng costs $200, then a 10 people 1h meeting avoided would save $200 x 10 = $2000 alone.
I don't see it as a replacement for dev, it's more of a multiplier.
I believe what GP is saying is that there is a price calculation today, but then if enough devs become unemployed, their salary will go down, making them more competitive by finops calculations, at which point the Ai prices will have to come down as well. Where equilibrium is, no one knows
I think it's an interest hypothesis but I don't think it works out like that. AI prices aren't priced in relation to the work they do, they're priced in relation to tokens (input/output). As long as it's cheaper to use those tokens than it is to pay a dev, then dev salaries will likely fall. Whenever it becomes cheaper to hire a dev than to use AI, a company will likely just hire a dev. But AI prices won't fall just because dev salaries have.
Yeah, I mean I think there's just too much work and I think devs who are effective with AI won't become unemployed, but their productivity will be multiplied. More will be expected of companies in terms of output, so it will be just more output.
>But if e.g. a developer can do 50% more, shouldn't it be worth it to pay up to 50% of developer salary for the product?
That's the upper bound but it's not the market price.
Accounting software (+ hardware) doesn't cost nearly as much as the accountant hours it saves. Accountant salaries are simply not a relevant yardstick for the price that software vendors can charge for accounting software.
Equally, the market price for code generators will not stay anywhere near the price of developer hours it saves. It will be determined by competition.
Because accounting software is cheaper due to competition. In software eng Claude is currently strongest and there's higher costs involved than normal SaaS. There are many fields in which the tools/machinery cost more than the salaries of people.
>Because accounting software is cheaper due to competition. In software eng Claude is currently strongest and there's higher costs involved than normal SaaS.
Yes, competition not salaries determines the margins that software vendors can charge. That's exactly what I'm saying.
My expectation is that competition between coding agents will stay strong and costs for the current level of software engineering performance will fall.
>There are many fields in which the tools/machinery cost more than the salaries of people.
Of course not, but they cost more than the person using them, it's multiplying the productivity of that person, so if AI multiplied enough as well it would make sense.
The question was whether the salaries that would have been paid for the working hours replaced by the machine are a realistic yardstick for the market price of the machine.
I am pretty sure that a hole in the pocket in the order of 50 000 000 USD/month (assuming around 20 000 people using AI in not the smartest or most optimized way possible, therefore burning A LOT of tokens) will be noticeable by even the largest companies.
It is noticeable and even promoted, large companies do pay such sums for the API, like $5k+ per person per month. Not every eng is using AI that much already, but companies are clearly willing to pay those sums.
I just cancelled before seeing this news. i was already pissed about constantly hitting limits on the 20 a month plan and looking for alternatives and this seals the deal. Bye bye!
Yea, I've been fine so far, but something happened with Opus 4.6 and especially 4.7. I was able to do some actual work with a Pro plan before. Now it's just pure anxiety of hitting the limits.
With Sonnet it's a bit better, but I can get the same performance with GPT-5.4.
Now I'm pretty much paying the 20€ for Claude Pro so it can plan/review stuff and then I use pi.dev + GPT-5.4 for the actual work.
I just paid for Pro for the first time 24 hours ago. Its been great, but the limits are crazy. It's nice not dealing with ChatGPTs sycophantic gaslighting, and not having random bugs.
That said, I seem to be caught in that 2% test if I open in a private tab. What nonsense. I wouldn't be paying for Claude if it wasn't for its quality abilities, which necessarily includes Claude Code.
I can easily hit the weekly limit on Claude even on the $200 plan. I have yet to ever hit a rate limit on Codex $100. And the results are almost as good. And don't get me started on Anthropic's extra usage scam.
Not the op, but it’s fairly easy to hit if you automate a kanban and have some stuff you want to get done. All those little “wouldn’t it be great if” tasks that show up after doing a big task become very doable, it just soaks your tokens.
Look, the firearms industry has worked in the past to ban competitors but I really don't think they see 3d printed firearms as competitors. The market there is tiny. Meanwhile Everytown is a gun control organization that wants to ban all guns everywhere and again, is documented to be the one behind this push.
Is this not like a schizo conspiracy theory? Like why would the grocery chains fund the bag bans? So they can save a tiny amount of money on paying for bags?
But having to bring your own bags limits how much you can buy. If someone has a plan to just use their own bags, they will likely forgo purchases at a higher rate than if the bag is not in the equation for them.
It's not obvious to me that the buying limit effect sales decrease would not outweigh the savings on physical bag purchases. Maybe I'm not following?
The grocery chain campaign is well documented. Just search for it.
The short answer is that bags are a non-trivial cost for the larger chains. Now, they get to charge for them at an astounding markup and no longer have to compete with any grocery store on this point. All grocery stores are affected equally, which means it is disproportionately damaging to mom-and-pop stores and smaller chains.
Grocery stores _absolutely_ supported the bag bans, though they weren't the initial groups asking for them. Similar to how the cigarette companies liked the TV ad bans--if nobody could advertise on TV than the playing field would be level and their profits all went up from decreased costs.
Some of them supported them because they were pressured into it. Grocery bans of bags and payment etc. are a PITA for customers. No business in it's right mind would force that on their customer unless they were required to. Passing the cost on to their customer is not an issue. Supporting laws requiring payment etc. are cost benefit analysis. Is it worth fighting the bad PR etc or go along. But obviously they wouldn't have provided the bags in the first place if it was not a competitive benefit to them.
People here are talking about two kinds of laws: minimum bag charges and outright bag bans.
In some jurisdictions, a grocery store isn’t allowed to give you a traditional disposable bag at any price. In others, there’s either a bag tax or a minimum price, usually five or ten cents, a store must charge per bag.
I assumed that the grocers would want to offer bags. Making it more easy to drop in and buy something is going to be significantly more money than the cost of bags per a customer.
What percent of the overall purchase profit is 10 cents, and how much does it reduce in sales by adding friction? Surely there must be data on this, has nobody looked into it in public?
Also, it’s been awhile but don’t plastic bags make it easier to carry more things at once because the handles are so thin and flexible? And I don’t remember handles ever ripping on plastic grocery bags.
If the math works out in favor of charging for bags it would imply that the margin is incredibly thin in the literal sense of the word incredible. Like the average purchase has so little profit that 10 cents per bag is meaningful? What is the average profit on a bag of items or on an average purchase? Surely more than 10 cents, no? Like I know grocery stores are notoriously low margin, but that’s among businesses it’s not almost 0 in an absolute sense.
In California, you are required to submit to chemical testing (breath, urine, or blood — I don’t recall the rules for which applies in which situations). However, you are not required to otherwise talk to or perform the absurd procedure of the field sobriety test (“you have the right to remain silent”).
reply