Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's simple. Don't ingest more than 40KB at a time into its LLM's RAG pipe and its hallucination goes way, way down.

Preferably like not at the start and best not to do more than 40KB at a time at all.

That's how I learned how to deal with nftables' 120KB parser_bison.y file by breaking them up into clean sections.

All of a sudden, a fully-deterministic LL(1) full semantic pathway of nftables' CLI syntax appears before my very eye (and spent hours validating it): 100% and test generators now can permutate crazy test cases with relative ease.

Cue in Joe Walsh's "Life's Been Good To Me".



Why 40kb?


Cheap public offering of their expensive data center is that sweet spot and cutoff at 40KB.


and doesn't it depend on the LLM?


If you have your Pro or private LLM, then it's a tad bit bigger.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: