More

remilouf · 2026-05-07T09:16:00 1778145360

Author here. Sorry my writing is tedious. Next time I’ll use AI to make it more readable.

remilouf · 2026-04-15T09:09:18 1776244158

> Ironically LLMs solve the MxN problem he's complaining about

Enlighten me please

remilouf · 2026-04-14T17:05:53 1776186353

Ooops sorry

remilouf · 2026-04-14T16:14:51 1776183291

Author here. You're right, it's not a hard problem, but a particularly annoying one.

remilouf · 2026-04-03T13:45:59 1775223959

I haven't always done this, and the knowledge base used to visibly degrade over time. Reviewing a PR does not take a long time, maybe a few minutes, and this compounds over time.

remilouf · on Oct 20, 2024

This is actually pretty funny.

remilouf · on Sept 10, 2024

That’d be a pretty inefficient way to generate bullshit at scale

zero-sharp · on Sept 11, 2024

automating the creation of false testimonials is inefficient at scale? go on ...

what's the alternative?

remilouf · on May 2, 2024

LLM evaluations are very sensitive to the details of the prompt's structure. This post shows how using structured generation reduces the results' variance and the ranking shifts.

remilouf · on April 5, 2024

Looks like it’s quite the opposite: http://blog.dottxt.co/performance-gsm8k.html

remilouf · on March 15, 2024

What do you mean by "semantic dimension"?