Abstract

This study investigates table recognition techniques for digital documents, focusing on the challenges posed by diverse invoice layouts. A comparative evaluation of traditional pattern recognition and deep learning approaches highlighted their respective strengths and limitations. Special attention was given to ProjectionP, a proprietary lightweight method for resource-constrained environments, which combines morphological line extraction with pixel-based thresholding. A modified evaluation procedure adapted from ICDAR2013 was introduced to better balance over- and under-segmentation errors.

Comparative analysis showed that while Camelot leverages PDF metadata, it struggles with visual segmentation. Nanonets achieve high grid detection accuracy but can misplace text in complex tables. ProjectionP, optimized for desktop hardware, delivered competitive results, outperforming Camelot and matching Nanonets in specific cases.

Recommended Citation

Marczak, P. & Omieljanowicz, M. (2025). A Lightweight Approach to Table Recognition in Digital InvoicesIn I. Luković, S. Bjeladinović, B. Delibašić, D. Barać, N. Iivari, E. Insfran, M. Lang, H. Linger, & C. Schneider (Eds.), Empowering the Interdisciplinary Role of ISD in Addressing Contemporary Issues in Digital Transformation: How Data Science and Generative AI Contributes to ISD (ISD2025 Proceedings). Belgrade, Serbia: University of Gdańsk, Department of Business Informatics & University of Belgrade, Faculty of Organizational Sciences. ISBN: 978-83-972632-1-5. https://doi.org/10.62036/ISD.2025.31

Paper Type

Short Paper

DOI

10.62036/ISD.2025.31

Share

COinS
 

A Lightweight Approach to Table Recognition in Digital Invoices

This study investigates table recognition techniques for digital documents, focusing on the challenges posed by diverse invoice layouts. A comparative evaluation of traditional pattern recognition and deep learning approaches highlighted their respective strengths and limitations. Special attention was given to ProjectionP, a proprietary lightweight method for resource-constrained environments, which combines morphological line extraction with pixel-based thresholding. A modified evaluation procedure adapted from ICDAR2013 was introduced to better balance over- and under-segmentation errors.

Comparative analysis showed that while Camelot leverages PDF metadata, it struggles with visual segmentation. Nanonets achieve high grid detection accuracy but can misplace text in complex tables. ProjectionP, optimized for desktop hardware, delivered competitive results, outperforming Camelot and matching Nanonets in specific cases.