Improve LLM performance with human and AI feedback on Amazon SageMaker for Amazon Engineering
AWS Machine Learning - AI
APRIL 24, 2024
To increase training samples for better learning, we also used another LLM to generate feedback scores. We present the reinforcement learning process and the benchmarking results to demonstrate the LLM performance improvement. This method addressed the RAG limitation and further improved the bot response quality.
Let's personalize your content