Steerable Preference Optimization of Reward ModelsPublished in Pluralistic Alignment @ ICML, 2026Minsik Oh, Advit Deepak, Sophie Wu, Douwe Kiela, Ekaterina Shutova [Paper]Share on Twitter Facebook LinkedIn Previous Next