RLHF (Reinforcement Learning from Human Feedback)

A technique used to train AI models like ChatGPT by incorporating human preferences and feedback into the learning process.