Coming back to this. LORA training is only on the attention layer, and this was ...

		SubiculumCode on Sept 6, 2023 \| parent \| context \| favorite \| on: Can LLMs learn from a single example? Coming back to this. LORA training is only on the attention layer, and this was sufficient for memorization , per the article. So we wouldn't update all the model's weights in some kind of constant context one-shot learning scheme.