Poster
in
Workshop: Workshop on Behavioral Machine Learning
Selective Preference Aggregation
Shreyas Kadekodi · Hayden McTavish · Berk Ustun
Many tasks in machine learning are shaped by procedures where items are ordered based on the preferences of a group—whether for funding proposals, recommending products, or improving the helpfulness of responses from a large language model. In these settings, individuals express their preferences over items as votes, ratings, and rankings. Given a dataset of individual preferences, preference aggregation methods rank the items in a way that summarizes the group’s collective preferences. Standard methods for preference aggregation are designed to arbitrate dissent. When individuals express conflicting preferences between items, these methods will rank one item over another—resolving disagreement based on axioms of social choice. In this work, we introduce a paradigm for \emph{selective aggregation} in which we \emph{abstain} rather than arbitrate in cases of dissent. Given a dataset of ordinal preferences from a group of judges, we aggregate their preferences into a \emph{selective ranking}—i.e., a \emph{partial order} over items where every comparison is aligned with (1-\dissent{}\%) of judges. Our approach handles missing data effectively, ensuring robust performance across diverse settings, suited to alignment tasks. We develop an algorithm to construct selective rankings that achieve all possible trade-offs between comparability and disagreement.