TY - GEN
T1 - Resume Extraction and Validation from Public Information
AU - Kong, Xiangyi
AU - Gu, Zhaoquan
AU - Wang, Le
AU - Yin, Lihua
AU - Li, Shudong
AU - Han, Weihong
N1 - Publisher Copyright:
© 2020 ACM.
PY - 2020/12/4
Y1 - 2020/12/4
N2 - With the development of information technologies, knowledge graphs have attracted much attention from more and more fields, due to their usage in information retrieval, recommendation, etc. However, the scale of many knowledge graphs is relatively small due to lack of huge data, it is difficult to perform text understanding and knowledge reasoning on them. Consequently, it is necessary to combine different knowledge graphs. Nevertheless, in some scenarios, an attribute may have multiple values in different knowledge graphs, and some values may be wrong. Knowledge validation is a core and difficult point in knowledge graph fusion. In this paper, we study resume extraction and validation from many public resources. Specifically, we extracted personnel information from four encyclopedia websites; we also propose a new multi-truth discovery and validation method, which evaluates the credibility of attribute values through Internet public information. The proposed method can not only be applied to the research of knowledge graph fusion, but also can be widely adopted in various fields such as rumor discovery, text understanding, knowledge reasoning.
AB - With the development of information technologies, knowledge graphs have attracted much attention from more and more fields, due to their usage in information retrieval, recommendation, etc. However, the scale of many knowledge graphs is relatively small due to lack of huge data, it is difficult to perform text understanding and knowledge reasoning on them. Consequently, it is necessary to combine different knowledge graphs. Nevertheless, in some scenarios, an attribute may have multiple values in different knowledge graphs, and some values may be wrong. Knowledge validation is a core and difficult point in knowledge graph fusion. In this paper, we study resume extraction and validation from many public resources. Specifically, we extracted personnel information from four encyclopedia websites; we also propose a new multi-truth discovery and validation method, which evaluates the credibility of attribute values through Internet public information. The proposed method can not only be applied to the research of knowledge graph fusion, but also can be widely adopted in various fields such as rumor discovery, text understanding, knowledge reasoning.
KW - Extraction
KW - Knowledge Graphs
KW - Multi-truth Discovery
KW - Validation
UR - https://www.scopus.com/pages/publications/85098962632
U2 - 10.1145/3444370.3444597
DO - 10.1145/3444370.3444597
M3 - 会议稿件
AN - SCOPUS:85098962632
T3 - ACM International Conference Proceeding Series
SP - 355
EP - 359
BT - Proceedings of the 2020 International Conference on Cyberspace Innovation of Advanced Technologies, CIAT 2020
PB - Association for Computing Machinery
T2 - 2020 International Conference on Cyberspace Innovation of Advanced Technologies, CIAT 2020
Y2 - 4 December 2020 through 6 December 2020
ER -