While the plugins do support the creation of RLHF datasets for model finetuning, the plugins themselves don't currently use a custom-trained model. They support all major LLM providers (including local). I've found that with the right prompts, the frontier models are shockingly effective. And they are progressing much faster than any custom training effort I could shoestring together. As the models improve, the plugins improve.
reply