Multi-Cultural Preference Optimization of Reward Models

Published in review, 2025

Minsik Oh, Advit Deepak, Sophie Wu, Douwe Kiela, Ekaterina Shutova [Paper]