Evaluating Large Language Models on Gender-Occupational Stereotypes Using the Wino Bias Test
John Snow Labs
NOVEMBER 1, 2023
In this blog post, we dive into testing the WinoBias dataset on LLMs, examining language models’ handling of gender and occupational roles, evaluation metrics, and the wider implications. It aims for consistent accuracy in coreference decisions across stereotypical and non-stereotypical scenarios by dividing data into two categories.
Let's personalize your content