Skip to main navigation Skip to search Skip to main content

Fact Discovery for Text Data

  • Chen Ye*
  • , Hongzhi Wang
  • , Guojun Dai
  • *Corresponding author for this work
  • Hangzhou Dianzi University
  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

Abstract

Fact extraction, which aims to extract (entity, attribute, value)-tuples from massive text corpora, is crucial in text data mining. Recent approaches focus on extracting facts by mining textual patterns with semantic types, where the quality of a pattern is evaluated based on content-based criteria, such as frequency. However, these approaches overlook the dimension of pattern reliability, which reflects how likely the extracted facts are correct. As a result, a pattern of good content quality (e.g., high frequency) may still extract incorrect facts. In this chapter, we consider both pattern reliability and fact trustworthiness in addressing the pattern-based fact extraction problem [1]. We give a motive example and the problem definition in Sects. 5.1 and 5.2, respectively. We detail the CNN-LSTM model design and present the experimental results in Sect. 5.3. Next, we conclude in Sect. 5.4.

Original languageEnglish
Title of host publicationSpringerBriefs in Computer Science
PublisherSpringer
Pages69-83
Number of pages15
DOIs
StatePublished - 2022
Externally publishedYes

Publication series

NameSpringerBriefs in Computer Science
ISSN (Print)2191-5768
ISSN (Electronic)2191-5776

Keywords

  • Fact extraction
  • Pattern discovery
  • Text data

Fingerprint

Dive into the research topics of 'Fact Discovery for Text Data'. Together they form a unique fingerprint.

Cite this