More

keeganpoppen · 2026-02-05T16:46:34 1770309994

wtf are you conversing with LLMs that you regularly are running into "state owned propaganda" in the references? my "blatant nonsense" detector is going off...

olyjohn · 2026-02-05T17:41:58 1770313318

My favorite is when it cites 5 sources, and 1 of them is a real source, and then the other 4 are short form junk that point to the first one as the source. So basically its just picked one article and summarized it for you and not picked any info from any other places. Oh and bonus points when I type the exact same prompt into a search engine, and that 1 source is the top search result anyways.

sofixa · 2026-02-05T20:15:47 1770322547

https://www.euronews.com/next/2025/11/04/ai-chatbots-are-spe...

Original study: https://www.isdglobal.org/digital-dispatch/investigation-tal...

keeganpoppen · 2026-02-04T19:00:21 1770231621

that place is absolutely gorgeous. like it or not, The Sphere is an amazing… cultural artifact, and stuff like this is absolutely the future.

keeganpoppen · 2026-02-04T18:58:17 1770231497

if llm ads become a real thing, let’s acknowledge that this is exactly what will happen in no uncertain terms.

observationist · 2026-02-04T19:16:49 1770232609

The only chance of that happening is if Altman somehow feels sufficiently shamed into abandoning the lazy enshittification track to monetization.

I don't think they have an accurate model for what they're doing - they're treating it like just another app or platform, using tools and methods designed around social media and app store analytics. They're not treating it like what it is, which is a completely novel technology with more potential than the industrial revolution for completely reshaping how humans interact with each other and the universe, fundamentally disrupting cognitive labor and access to information.

The total mismatch between what they're doing with it to monetize and what the thing actually means to civilization is the biggest signal yet that Altman might not be the right guy to run things. He's savvy and crafty and extraordinarily good at the palace intrigue and corporate maneuvering, but if AdTech is where they landed, it doesn't seem like he's got the right mental map for AI, for all he talks a good game.

bluGill · 2026-02-04T19:57:52 1770235072

There are a number of different llms - no reason they all need to do things the same. If you are replacing web search then ads are probably how you earn money. However if you are replacing the work people do for a company it makes more sense to charge for the work. I'm not sure if their current token charges are the right one, but it seems like a better track.

keeganpoppen · 2026-02-04T18:57:05 1770231425

yeah it’s either that or openai has effected a massive own-goal… im leaning toward your view, but hoping that prediction does not manifest. i would be fine with all sorts of shit in life being more expensive but ad-free… but this is certainly a priviledged take and i recognize that.

keeganpoppen · 2026-02-04T18:51:55 1770231115

uhhh i cast doubt on multi-language support as affecting latency. model size, maybe, but what is the mechanism for making latency worse? i think of model latency as O(log(model size))… but i am open to being wrong / that being a not-good mental model / educated guess.

kergonath · 2026-02-04T20:07:51 1770235671

Even model size, it’s modest. There is a lot of machinery that is going to be common for all languages. You don’t multiply model size by 2 when you double the number of supported languages.

janalsncm · 2026-02-05T00:26:56 1770251216

Well for example the last step is to softmax over all output logits, which is the same as your vocab size. You need the sum of the exponentiated values of each logit to calculate the denominator which is O(N).

Bigger impact is before that you need to project the hidden state matrix to the vocab list. Something like 4096x250000. Bigger vocab=more FLOPs.

If you’re on a GPU things are parallelized so maybe it’s not quite linear if everything fits nicely. But on a cpu you’re going to struggle more.

This is why the juiciest target when shrinking models is the token embedding table. For example AlBERT factorized the whole embedding table to two low rank matrices.

ethmarks · 2026-02-04T21:55:28 1770242128

If encoding more learned languages and grammars and dictionaries makes the model size bigger, it will also increase latency. Try running a 1B model locally and then try to run a 500B model on the same hardware. You'll notice that latency has rather a lot to do with model size.

make3 · 2026-02-04T19:06:24 1770231984

model size directly affects latency

keeganpoppen · 2026-02-04T18:49:59 1770230999

it must have some exposure to bengali— just not enough for them to advertise it. otherwise it would have a damn hard time.

keeganpoppen · 2026-02-04T18:49:07 1770230947

what a great test hahah

keeganpoppen · 2026-02-04T00:42:49 1770165769

i have absolutely no idea whatsoever what this means

keeganpoppen · 2026-02-04T00:41:53 1770165713

common sense has the misfortune of being less "common" than we would all like it to be. because some breathless hucksters are overpromising and underdelivering in the present, we may as well throw out the baby, the bath water, and the bath tub itself! who even wants computers to think like humans and automate jobs that no human would want to do? don't you appreciate the self-worth that comes from menial labor? i don't even get why we use tractors to farm when we have perfectly good beasts of burden to do the same labor!

keeganpoppen · 2026-02-04T00:37:17 1770165437

still a skill issue, not a codex issue. sure, this line of critique is also one levied by tech bros who want to transfer your company's balance sheet from salaries to ai-SaaS(-ery), but in what world does that automatically make the tech fraudulent or even deficient? and since when is not wanting to develop a skill a reasonable substitute for anything? if my doctor decided they didn't want to keep up on medical advances, i would find a different doctor. but yet somehow finding fault with an ai because it can't read your mind and, in response to that adversity, refusing to introspect at all about why that might be and blaming it on the technology is a reasonable critique? somehow we have magically discovered a technology to manufacture cognition from nothing more than the intricate weaving of silicon, dopants, et al., and the takeaway is that it sucks because it is too slow, doesn't get everything exactly right, etc.? and the craziest part is that the more time you spend with it, the better intuition you get for getting whatever it is you want out of it. but, yeah... let's lend even more of an ear to the head-in-sand crowd-- that's where the real thought leaders are. you don't have to be an ai techno-utopian maximalist to see the profound worthiness and promise of the technology; these things are manifestly self-evident.