A new fast DBSCAN using dual-space analysis and colour integral volume for document image segmentation

No Thumbnail Available

Date

2025

Journal Title

Journal ISSN

Volume Title

Publisher

Inderscience Publishers

Abstract

The segmentation of the colour document images is an essential step allowing facilitating and improving the stages of characterisation and interpretation of the information contained in these documents. Recent systems of automatic processing and recognition of document images, which use separation of colorimetric layers, are more efficient compared to conventional systems, only based on binary or grey levels images. This task requires non-supervised pixel segmentation or clustering techniques to separate the document image to a variable and unknown number of colour layers. The methods based on density are widely used in this context at pixel level, such as the DBSCAN method and its different variants, very robust to the noise and more adapted to the degradations present on document images, but who suffer from a great complexity. In this context, we propose a new faster DBSCAN variant using the volume integral in colorimetric space for the first time to significantly reduce calculation time. The combination of the two spaces, Cartesian and colorimetric has also accelerated the method and improved its performance on document images with different challenges. The results obtained show the effectiveness of the proposed approach, which was marked by significant improvement in the quality of segmentation and reduction in computed time

Description

Keywords

Clustering, DBSCAN, Region growing, Document image segmentation, Integral volume

Citation

Endorsement

Review

Supplemented By

Referenced By