TY - CHAP
T1 - Web information integration based on compressed XML
AU - Wang, Hongzhi
AU - Li, Jianzhong
AU - He, Zhenying
AU - Luo, Jizhou
PY - 2003
Y1 - 2003
N2 - Nowadays, information integration to web data sources and XML becomes a favorite information exchange format New application motivates the problems that massive information is often transmitted in network and must be processed in limited buffer in mediator. To process query on massive data from web data source effectively, we present a method of XML compression based on edit distance for information transmission in information integration. By compressing XML, this method can reduce both the transmission time and buffer space. Two different strategies of XML compression for transmission and process in mediator are designed. Optimization of the combination of these strategies is discussed. We also propose the query execution algorithms on compressed XML data in buffer of mediator. We focus on main operators of data from wrapper in mediator, namely sort, union, join and aggregation. Implementation of these operators on compressed data using two different methods is described in this paper.
AB - Nowadays, information integration to web data sources and XML becomes a favorite information exchange format New application motivates the problems that massive information is often transmitted in network and must be processed in limited buffer in mediator. To process query on massive data from web data source effectively, we present a method of XML compression based on edit distance for information transmission in information integration. By compressing XML, this method can reduce both the transmission time and buffer space. Two different strategies of XML compression for transmission and process in mediator are designed. Optimization of the combination of these strategies is discussed. We also propose the query execution algorithms on compressed XML data in buffer of mediator. We focus on main operators of data from wrapper in mediator, namely sort, union, join and aggregation. Implementation of these operators on compressed data using two different methods is described in this paper.
UR - https://www.scopus.com/pages/publications/35248880581
U2 - 10.1007/978-3-540-39845-5_11
DO - 10.1007/978-3-540-39845-5_11
M3 - 章节
AN - SCOPUS:35248880581
SN - 3540201114
SN - 9783540201113
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 122
EP - 137
BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
A2 - Bianchi-Berthouze, Nadia
PB - Springer Verlag
ER -