Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

No base model? disappointed.


The base model is Llama 3.1 70B


It is probably the same base model as Llama 3.0.

They mention postraining improvements.


interesting comment... what are you doing with base models? Are you a "finetuner"? I have been trying my hand with finetunes on instruct models and the results have been ok, but not awesome. I have a base model downloading now to give that a proper shot.


I'm not them but I still prefer a text completion style of prompting rather than a baked in pre-prompt structure assuming only a 'chat' style metaphor of interaction.


Base models are useful in research to see the effect of instruction tuning




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: