Authors
Enas M. Elgbbas, Mahmoud I. Khalil, Hazem Abbas, Ain Shams University, Egypt
Abstract
This paper introduces a method for binarization of historical document images that suffer from non-uniform background, faint text, low contrast, stain, bleed-through, or shadow challenges. The proposed method adaptively detects the non-uniform background in the document image and eliminates it. Areas that contain missing text are adaptively identified and reprocessed separately. Stain and bleed-through objects are found depending on stroke width and locally binarized. Shadow is detected based on the image contrast. Otsu multilevel is applied for binarization. DIBCO series is used for testing.
Keywords
Document image binarization, Otsu multilevel