Skip to main navigation Skip to search Skip to main content

CogBERT: Cognition-Guided Pre-trained Language Models

  • Harbin Institute of Technology

Research output: Contribution to journalConference articlepeer-review

Abstract

We study the problem of integrating cognitive language processing signals (e.g., eyetracking or EEG data) into pre-trained models like BERT. Existing methods typically fine-tune pre-trained models on cognitive data, ignoring the semantic gap between the texts and cognitive signals. To fill the gap, we propose CogBERT, a framework that can induce fine-grained cognitive features from cognitive data and incorporate cognitive features into BERT by adaptively adjusting the weight of cognitive features for different NLP tasks. Extensive experiments show that: (1) Cognition-guided pre-trained models can consistently perform better than basic pre-trained models on ten NLP tasks. (2) Different cognitive features contribute differently to different NLP tasks. Based on this observation, we give a fine-grained explanation of why cognitive data is helpful for NLP. (3) Different transformer layers of pre-trained models should encode different cognitive features, with word-level cognitive features at the bottom and semantic-level cognitive features at the top. (4) Attention visualization demonstrates that CogBERT can align with human gaze patterns and improves its natural language understanding ability.

Original languageEnglish
Pages (from-to)3210-3225
Number of pages16
JournalProceedings - International Conference on Computational Linguistics, COLING
Volume29
Issue number1
StatePublished - 2022
Event29th International Conference on Computational Linguistics, COLING 2022 - Hybrid, Gyeongju, Korea, Republic of
Duration: 12 Oct 202217 Oct 2022

Fingerprint

Dive into the research topics of 'CogBERT: Cognition-Guided Pre-trained Language Models'. Together they form a unique fingerprint.

Cite this