Deep Dive into Preference Learning Agents