Skip to main navigation Skip to search Skip to main content

Discovering frequent subgraphs over uncertain graph databases under probabilistic semantics

  • Zhaonian Zou*
  • , Hong Gao
  • , Jianzhong Li
  • *Corresponding author for this work
  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Frequent subgraph mining has been extensively studied on certain graph data. However, uncertainties are inherently accompanied with graph data in practice, and there is very few work on mining uncertain graph data. This paper investigates frequent subgraph mining on uncertain graphs under probabilistic semantics. Specifically, a measure called φ-frequent probability is introduced to evaluate the degree of recurrence of subgraphs. Given a set of uncertain graphs and two numbers 0 < φ, τ < 1, the goal is to quickly find all subgraphs with φ-frequent probability at least τ. Due to the NP-hardness of the problem, an approximate mining algorithm is proposed for this problem. Let 0 < δ < 1 be a parameter. The algorithm guarantees to find any frequent subgraph S with probability at least (1-δ/2)s, where s is the number of edges of S. In addition, it is thoroughly discussed how to set δ to guarantee the overall approximation quality of the algorithm. The extensive experiments on real uncertain graph data verify that the algorithm is efficient and that the mining results have very high quality.

Original languageEnglish
Title of host publicationKDD'10 - Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data
Pages633-642
Number of pages10
DOIs
StatePublished - 2010
Externally publishedYes
Event16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD-2010 - Washington, DC, United States
Duration: 25 Jul 201028 Jul 2010

Publication series

NameProceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Conference

Conference16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD-2010
Country/TerritoryUnited States
CityWashington, DC
Period25/07/1028/07/10

Keywords

  • Frequent subgraph
  • Probabilistic semantics
  • Uncertain graph

Fingerprint

Dive into the research topics of 'Discovering frequent subgraphs over uncertain graph databases under probabilistic semantics'. Together they form a unique fingerprint.

Cite this