Skip to main navigation Skip to search Skip to main content

Gabor filter based text extraction from digital document images

  • Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

This paper presents an algorithm that can automatically detect and extract text in digital document images. Firstly, we process and fuse Gabor filtered images at different orientations and scales and obtain an image that reflects the layout of the document image. Then, potential text regions are directly extracted from the resulting image. Finally, two criteria based on the geometrical property and high frequency content are adopted to kick-out those non-text regions. The experiments are performed on some representative images with different styles and with texts in different languages and fonts. Experimental results show that the algorithm works well on document images from a wide variety of source.

Original languageEnglish
Pages (from-to)2387-2390
Number of pages4
JournalTien Tzu Hsueh Pao/Acta Electronica Sinica
Volume34
Issue numberSUPPL.
StatePublished - Dec 2006

Keywords

  • Digital document images
  • Gabor filter
  • Text extraction

Fingerprint

Dive into the research topics of 'Gabor filter based text extraction from digital document images'. Together they form a unique fingerprint.

Cite this