upvote
LoRA can work with big models. But I mean sample-efficient RL.
reply