TY - GEN
T1 - A Generic Framework for Evaluating Gaze Representations for Gaze Estimation
AU - Lin, Xinyu
AU - Liu, Buyu
AU - Zhu, Suguo
AU - Bao, Jun
N1 - Publisher Copyright:
© 2025 ACM.
PY - 2025/6/30
Y1 - 2025/6/30
N2 - While gaze representations have shown impressive performance in gaze estimation tasks, they face significant limitations, as most existing evaluations fail to consider the generalizability of these representations in broader settings. To address this gap, we propose a generic evaluation framework specifically designed to assess the generalizability of gaze representations within the context of gaze estimation. Our framework includes detailed spatial and temporal analyses. For the former, we introduce pairwise distance metrics and linearity assessments to measure the global structure of representations. For the latter, we incorporate metrics for fixation and saccade detection, along with Cohen's Kappa to evaluate agreement. Finally, we present a unified metric that integrates the perspectives mentioned earlier. We perform extensive comparisons across six publicly available baselines, including three supervised and three unsupervised or few-shot learning approaches. Experiments on three benchmarks show that our proposed metrics strongly correlate with representation generalizability, offering an interpretable and effective evaluation tool. When combined with the mutually informative conventional accuracy-based metrics, our approach enables more informed, long-term decision-making for tasks like efficient few-shot calibration or adaptation to unseen domains.
AB - While gaze representations have shown impressive performance in gaze estimation tasks, they face significant limitations, as most existing evaluations fail to consider the generalizability of these representations in broader settings. To address this gap, we propose a generic evaluation framework specifically designed to assess the generalizability of gaze representations within the context of gaze estimation. Our framework includes detailed spatial and temporal analyses. For the former, we introduce pairwise distance metrics and linearity assessments to measure the global structure of representations. For the latter, we incorporate metrics for fixation and saccade detection, along with Cohen's Kappa to evaluate agreement. Finally, we present a unified metric that integrates the perspectives mentioned earlier. We perform extensive comparisons across six publicly available baselines, including three supervised and three unsupervised or few-shot learning approaches. Experiments on three benchmarks show that our proposed metrics strongly correlate with representation generalizability, offering an interpretable and effective evaluation tool. When combined with the mutually informative conventional accuracy-based metrics, our approach enables more informed, long-term decision-making for tasks like efficient few-shot calibration or adaptation to unseen domains.
KW - evaluation metrics
KW - gaze representation
KW - long-term decision-making
UR - https://www.scopus.com/pages/publications/105011584256
U2 - 10.1145/3731715.3733261
DO - 10.1145/3731715.3733261
M3 - 会议稿件
AN - SCOPUS:105011584256
T3 - ICMR 2025 - Proceedings of the 2025 International Conference on Multimedia Retrieval
SP - 833
EP - 841
BT - ICMR 2025 - Proceedings of the 2025 International Conference on Multimedia Retrieval
PB - Association for Computing Machinery, Inc
T2 - 2025 International Conference on Multimedia Retrieval, ICMR 2025
Y2 - 30 June 2025 through 3 July 2025
ER -