Skip to main navigation Skip to search Skip to main content

DHPA: Dynamic human preference analytics framework: A case study on taxi drivers' learning curve analysis

  • Menghai Pan
  • , Weixiao Huang
  • , Yanhua Li
  • , Xun Zhou
  • , Zhenming Liu
  • , Rui Song
  • , Hui Lu*
  • , Zhihong Tian
  • , Jun Luo
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Many real-world human behaviors can be modeled and characterized as sequential decision-making processes, such as a taxi driver's choices of working regions and times. Each driver possesses unique preferences on the sequential choices over time and improves the driver's working efficiency. Understanding the dynamics of such preferences helps accelerate the learning process of taxi drivers. Prior works on taxi operation management mostly focus on finding optimal driving strategies or routes, lacking in-depth analysis on what the drivers learned during the process and how they affect the performance of the driver. In this work, we make the first attempt to establish Dynamic Human Preference Analytics. We inversely learn the taxi drivers' preferences from data and characterize the dynamics of such preferences over time. We extract two types of features (i.e., profile features and habit features) to model the decision space of drivers. Then through inverse reinforcement learning, we learn the preferences of drivers with respect to these features. The results illustrate that self-improving drivers tend to keep adjusting their preferences to habit features to increase their earning efficiency while keeping the preferences to profile features invariant. However, experienced drivers have stable preferences over time. The exploring drivers tend to randomly adjust the preferences over time.

Original languageEnglish
Article number8
JournalACM Transactions on Intelligent Systems and Technology
Volume11
Issue number1
DOIs
StatePublished - 17 Jan 2020
Externally publishedYes

Keywords

  • And Phrases: Urban computing
  • Inverse reinforcement learning
  • Preference dynamics

Fingerprint

Dive into the research topics of 'DHPA: Dynamic human preference analytics framework: A case study on taxi drivers' learning curve analysis'. Together they form a unique fingerprint.

Cite this