TY - GEN
T1 - Chinese document image retrieval based on recognition candidates
AU - Jia, Xuhui
AU - Xia, Yong
AU - Zhou, Rui
AU - Liang, Hongwei
PY - 2012
Y1 - 2012
N2 - For the sake of the low recognition rate for degraded Chinese document, the retrieval performance is not good if directly based on OCR result. In this paper, an indexing method with n-gram and recognition candidates is proposed to improve the performance of retrieval. For ease of test, this paper also presents a method to automatically generate ground-truth of imaged document, synthesized degraded document image and ground-truth of recognition candidates. Several synthesized document image collections on large-scale are built and used, and the experimental results show that the retrieval performance are improved for both collections with high or low OCR error rates.
AB - For the sake of the low recognition rate for degraded Chinese document, the retrieval performance is not good if directly based on OCR result. In this paper, an indexing method with n-gram and recognition candidates is proposed to improve the performance of retrieval. For ease of test, this paper also presents a method to automatically generate ground-truth of imaged document, synthesized degraded document image and ground-truth of recognition candidates. Several synthesized document image collections on large-scale are built and used, and the experimental results show that the retrieval performance are improved for both collections with high or low OCR error rates.
KW - Chinese document image retrieval
KW - indexing method with n-gram and recognition candidates
KW - synthesized degraded document image
UR - https://www.scopus.com/pages/publications/84872897159
U2 - 10.1109/FSKD.2012.6233763
DO - 10.1109/FSKD.2012.6233763
M3 - 会议稿件
AN - SCOPUS:84872897159
SN - 9781467300223
T3 - Proceedings - 2012 9th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2012
SP - 2892
EP - 2897
BT - Proceedings - 2012 9th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2012
T2 - 2012 9th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2012
Y2 - 29 May 2012 through 31 May 2012
ER -