TY - GEN
T1 - A novel approach for efficient supergraph query processing on graph databases
AU - Zhang, Shuo
AU - Li, Jianzhong
AU - Gao, Hong
AU - Zou, Zhaonian
PY - 2009
Y1 - 2009
N2 - In recent years, large amount of data modeled by graphs, namely graph data, have been collected in various domains. Efficiently processing queries on graph databases has attracted a lot of research attentions. Supergraph query is a kind of new and important queries in practice. A supergraph query, Paq;, on a graph database D is to retrieve all graphs in D such that Paq; is a supergraph of them. Because the number of graphs in databases is large and subgraph isomorphism testing is NP-complete, efficiently processing such queries is a big challenge. This paper first proposes an optimal compact method for organizing graph databases. Common subgraphs of the graphs in a database are stored only once in the compact organization of the database, in order to reduce the overall cost of subgraph isomorphism testings from stored graphs to queries during query processing. Then, an exact algorithm and an approximate algorithm for generating significant feature set with optimal order are proposed to construct indices on graph databases. The optimal order on the feature set is to reduce the number of subgraph isomorphism testings during query processing. Based on the compact organization of graph databases, a novel algorithm of testing subgraph isomorphisms from multiple graphs to one graph is presented. Finally, based on all these techniques, a query processing method is proposed. Analytical and experimental results show that the proposed algorithms outperform the existing similar algorithms by one to two orders of magnitude.
AB - In recent years, large amount of data modeled by graphs, namely graph data, have been collected in various domains. Efficiently processing queries on graph databases has attracted a lot of research attentions. Supergraph query is a kind of new and important queries in practice. A supergraph query, Paq;, on a graph database D is to retrieve all graphs in D such that Paq; is a supergraph of them. Because the number of graphs in databases is large and subgraph isomorphism testing is NP-complete, efficiently processing such queries is a big challenge. This paper first proposes an optimal compact method for organizing graph databases. Common subgraphs of the graphs in a database are stored only once in the compact organization of the database, in order to reduce the overall cost of subgraph isomorphism testings from stored graphs to queries during query processing. Then, an exact algorithm and an approximate algorithm for generating significant feature set with optimal order are proposed to construct indices on graph databases. The optimal order on the feature set is to reduce the number of subgraph isomorphism testings during query processing. Based on the compact organization of graph databases, a novel algorithm of testing subgraph isomorphisms from multiple graphs to one graph is presented. Finally, based on all these techniques, a query processing method is proposed. Analytical and experimental results show that the proposed algorithms outperform the existing similar algorithms by one to two orders of magnitude.
UR - https://www.scopus.com/pages/publications/70349160269
U2 - 10.1145/1516360.1516385
DO - 10.1145/1516360.1516385
M3 - 会议稿件
AN - SCOPUS:70349160269
SN - 9781605584225
T3 - Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09
SP - 204
EP - 215
BT - Proceedings of the 12th International Conference on Extending Database Technology
T2 - 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09
Y2 - 24 March 2009 through 26 March 2009
ER -