AIAWS AI3h ago
Reinforcement fine-tuning with LLM-as-a-judge
In this post, we take a deeper look at how RLAIF or RL with LLM-as-a-judge works with Amazon Nova models effectively.
Read full articleSource: AWS AI · Opens in new tab
In this post, we take a deeper look at how RLAIF or RL with LLM-as-a-judge works with Amazon Nova models effectively.
Read full articleSource: AWS AI · Opens in new tab