Skip to main navigation Skip to search Skip to main content

Approximate neural subgraph counting for similar queries

  • Bin Yang
  • , Zhaonian Zou*
  • , Jianxiong Ye
  • *Corresponding author for this work
  • Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Existing work on approximate subgraph counting mainly focuses on isolate-query settings, where queries are independent. However, in some scenarios, similar queries can be processed frequently due to user preferences or application requirements. The approximate subgraph counting problem in similar query scenarios has not been studied yet. In this paper, we study the approximate subgraph counting problem in similar query scenarios and propose a lightweight learning-based subgraph counting framework, SubMB, to solve this problem. SubMB can estimate the count for arbitrary-size similar query patterns effectively within polynomial time complexity, and avoids complicated computations on the data graph. SubMB adopts a multi-level substructure extraction method and an edge-based query transformation method to get information related to the count and uses a graph attention blocks estimator to estimate query counts. We conduct extensive experiments on 7 real-world datasets with varying sizes ranging from 103 to 106 to evaluate the effectiveness and efficiency of SubMB. The experimental results show that SubMB can reduce the q-error by 1-3 orders of magnitude on multiple percentiles (including the median, 75%, 85%, 90% percentile) and by 35%–72% on the max q-error compared to the baseline methods. In addition, SubMB can reduce the offline training time by 79%–98% compared to the state-of-the-art methods.

Original languageEnglish
Article number104142
JournalInformation Processing and Management
Volume62
Issue number4
DOIs
StatePublished - Jul 2025

Keywords

  • Deep learning
  • Maximum common subgraph
  • Similar queries
  • Subgraph counting

Fingerprint

Dive into the research topics of 'Approximate neural subgraph counting for similar queries'. Together they form a unique fingerprint.

Cite this