Interpreting Language Model Preferences Through the Lens of Decision Trees

A decision-tree perspective to interpret LLM preference mechanisms.

January 22, 2025 · 16 min · Min Li