Skip to main navigation Skip to search Skip to main content

A convolutional attention model for text classification

  • Jiachen Du
  • , Lin Gui
  • , Ruifeng Xu*
  • , Yulan He
  • *Corresponding author for this work
  • Harbin Institute of Technology Shenzhen
  • Hong Kong Polytechnic University
  • Guangdong Provincial Engineering Technology Research Center for Data Science
  • Aston University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Neural network models with attention mechanism have shown their efficiencies on various tasks. However, there is little research work on attention mechanism for text classification and existing attention model for text classification lacks of cognitive intuition and mathematical explanation. In this paper, we propose a new architecture of neural network based on the attention model for text classification. In particular, we show that the convolutional neural network (CNN) is a reasonable model for extracting attentions from text sequences in mathematics. We then propose a novel attention model base on CNN and introduce a new network architecture which combines recurrent neural network with our CNN-based attention model. Experimental results on five datasets show that our proposed models can accurately capture the salient parts of sentences to improve the performance of text classification.

Original languageEnglish
Title of host publicationNatural Language Processing and Chinese Computing - 6th CCF International Conference, NLPCC 2017, Proceedings
EditorsXuanjing Huang, Jing Jiang, Dongyan Zhao, Yansong Feng, Yu Hong
PublisherSpringer Verlag
Pages183-195
Number of pages13
ISBN (Print)9783319736174
DOIs
StatePublished - 2018
Externally publishedYes
Event6th CCF International Conference on Natural Language Processing and Chinese Computing, NLPCC 2017 - Dalian, China
Duration: 8 Nov 201712 Nov 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10619 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th CCF International Conference on Natural Language Processing and Chinese Computing, NLPCC 2017
Country/TerritoryChina
CityDalian
Period8/11/1712/11/17

Fingerprint

Dive into the research topics of 'A convolutional attention model for text classification'. Together they form a unique fingerprint.

Cite this